X

High Fidelity Imagineering With AI | A Beginners Journey

After about 20mins of reading and prompt writing, these are the first images I created on the AI-assisted-image-creation-platform Midjourney.

The fidelity of these creations are incredible. Check out these closeups:

As a further experiment and using one of the already created Celtic warriors as the basis for the design, I went on to develop some potential covers for a fiction book I’m writing:

First set of images from prompt.

The reason for exploring this new platform is to understand the potential for creating images for my own creative projects and also is this something to build into the public speaking masterclasses I deliver for clients.

In a part of these masterclasses I talk about and show how to craft kick-ass presentation decks. In doing so I utilise a vast array of open sourced and creative common licensed content available on the web via a sweet bevvy of amassed resources over the years, which means that within twenty minutes or so I can find some wonderful items to embellish any story being told and build out a pretty wicked looking deck.

However, after just an hour or so playing on Midjourney, I see it becoming a powerful part of a storytellers toolkit as the business case will be: why trawl through different sites if you can very quickly state your needs and get a high-fidelity version of a specific image which serves the need.

Examples

Whether you’re an architectural student talking about the parametric abilities of your discipline:

A team leader wanting to represent a metaphoric path relating to a bold business strategy needing to be pursued:

A human resources lead presenting to the board and needing a stock-style image of people laughing in a corporate setting around a computer to demonstrate the aim for a conducive work culture:

Or just having fun turning one of your favourite cartoon characters from childhood into a photo-realistic portrait:

First set of images from prompt.
Second iteration with version 4 (bottom right) being my fav, see below:
Boom, Count Duckula in the proverbial flesh!

The only restriction is your own imagination (which I why like the term Augmented Imagination for AI)!

Limitations

It’s not a perfect platform.

Here’s what I got when I asked for a simple yellow post-it note with the word ‘help’ written on it:

There’s also a barrier to entry from both a digital interface perspective and also an image literacy angle.

Access to Midjourney (which is still positioned as being in beta) is via the Discord platform. This means the creation and indeed craft of producing these images is done out in the open in the semi-public hosted forums. I say semi-public because without first a Discord account plus accepting a few T&C’s to access, you can’t play.

Then there’s the literacy step. Knowing about cameras, lenses, exposures, angles plus lighting will give your images the edge above others. You have to be open to the messy process of nudging the system in the right way and nuances of language along with scene setting or even world building is needed.

Whether it be creating cute little monsters:

An atmospheric and rainy backstreet in an Asian city:

To a toy figure of Michael Jordan:

…you have to guide the output heavily.

The Trick

It can all sound too good to be true, or for some I’ve spoken to before I started playing, a little too confusing as well.

Once you get your head around the interface (as it’s quite jaggedy in it’s forum style, think ‘Slack channel peppered with a cacophony of people you don’t know saying wild and wonderful things with images blooming to life in between your prompts’, as it scrolls without you and finding your creations becomes sometimes hard work), I found the trick in creating impressive images is to balance simplicity and complexity.

Laying down the basics along with the intricate details of how to construct the output you want to see yields high-end results. For example, to build the image above I stipulated the scene, the person, the emotion displayed, the specific laptop being used, then the angle, the camera, the lens, the exposure, the lighting and the general ‘feel’—there’s a lot steering the production of what you see but it’s worth the time.

Built in to the Midjourney service are lots of ‘commands‘ to get your head around, and an interesting one is the “/describe” command which writes four example prompts based on an image you upload. So if you’re already using an image that you own or created you can upload and use the manufactured text prompt to get a sense of how you can create another one like it.

This and many other commands is a paid for service (as part of Midjourney subscription plans) and as I’m still using the free service I point it out as a great way to get started along with finding inspiration in prompt writing approaches.

Ownership

The current T&C’s of Midjourney says you keep what you create:

“…You own all Assets You create with the Services, to the extent possible under current law.”

…although it also states they own it all as well (including the prompts and any reference material you upload) along with some other funky stuff in there, like anyone else in the community can use them and remix them also.

It’s a bit confusing.

Just be aware of the classic adage: if it’s free or cheap you’re probably the product at this stage.

Again, if you want to create in the dark (without being in the public forum) you have to pay for that privilege.

Ethics

Like most though I’m worried deeply about the ethical nature of these creative engines due to questions on where they sourced their data (the large language models, LLMs, used to ‘train’ the AI on—I recommend reading A Completely Non-Technical Explanation of AI and Deep Learning if all this is gobble-dee-gook). Plus, if any output is being influenced by specific artists or other creatives, where is the attribution and citations for them? Surely that should be there as standard and would be easy to code in if you’ve come this far…?

There’s also a question of fabricating ‘alternative’ realities (like the recent Pope in a Prada coat). But what about making different historical narratives?

Like in the latter years of his life, Steve Jobs spent most of his time in his small home office working on old school computers:

He didn’t, I made this scene and story up.

Or photos of Elvis when he was a baby already playing a guitar:

Again, not true, all made up by me!

Then there’s other questions including environmental, as there’s valid questions around the growing carbon footprint of these platforms plus as mentioned, unethical concerns regarding image scrapers who took 30 billion images from Facebook and other social media sites and gave them to cops: it puts everyone into a ‘perpetual police line-up.

Conclusion

I’m dazzled by this!

It’s captivating to be generating such quality with little effort. And there lies in the danger. To not respect the background nature of the inventive act of creativity is what I fear people will miss here. I’m lucky enough to be able to navigate through my understanding and respect for what informs these constructions (I mean the influences from photographers and my basic knowledge of cameras plus image form as well as the technological interface), and yet it’s so liberating at the ease by which impressiveness can be gained.

With all technologies, I always experience it through the lens of augmentation. Does this aid what I do and offer into the world, with kindness and deep consideration on the impact?

Midjourney and the larger AI suite of tools (currently) do nothing without us. They are value-neutral. However, it’s simple to see how much negative impact they will have in the future. Have us all questioning what we see whilst bad actors look to exploit this to their advantage. And I mean much more than cheeky photographers winning prizes using AI.

So for now I invite you to take some time and look at the public Midjourney showcase to see the possibilities of what people are producing, it’s very hard not to raise your eyebrows in awe.

What about you, what potential do you see in platforms like this? Where will you use it? Why won’t you?

Because you never know when you need to create a shocked looking blue faced pink sheep in a meadow:

All images produced in this blog post was done by Midjourney and the prompts I came up with.
Published

Nature x Humanity | A Lesson In Adaptive Architecture

Radical material varietals which replace the current extractive system, buildings that react in bio-sensitive ways, and cities which heal our planet.

“The documentary debuts at a critical juncture when the anthropomass—the mass produced by humans—has exceeded that of the living biomass on our planet.”

Neri Oxman is an impressive human.

I watched her give her Design at the intersection of technology and biology TED talk whilst attending TEDActive in 2015 and we connected virtually shortly after. Ever since I’ve been trying to bring her out to this fair land to either speak at TEDxWellington or Creative Leadership NZ (however, diaries never aligned).

She is the founder of The Mediated Matter Group at the Massachusetts Institute of Technology plus originator of Material Ecology: exploring / teaching at the intersection of technology, biology and sustainable design. The above is a retrospective of sorts relating to her work and an invitation to explore what is being done beyond ones own level of literacy.

I watched the short documentary hanging on by my cognitive-fingernails, as the conceptual and material potential shared realigned with something precious which has been lost of late when speaking to local leaders and industry professionals about their approach to city planning / building / construction industry practices, and that is, hope.

There is a vast amount of latent possibilities in the projects explored: whether it be 3D printing optically aligned glass with a molten calligraphy pen for high-efficiency solar-energy capture, adaptable bio-based structural materials with programmed decomposition, co-designing with silk-worms to inform future architectural forms and processes, synthesising pigments to enhance structures with their embodied properties, plus building autonomously at large-scale for urban settings; one ends the viewing experience seriously considering either retraining into this emerging field or at least supporting the best way one can to allow it to shape future discussions plus positively rebalance the world back to bio-equanimity*.

It’s all about making nature your / our client.

Thank you, Neri!

*my work as MC and podcast producer for the wonderful teulo.co platform is playing some small part in this.
Published

Creative Welly Episode #2 | More Courageous Conversations

Another two good souls exploring all manner of wonderful topics, ideas, stories, experiences etc.

We curate and collide intrepid talent from the coolest most creative little capital in the world (with a few out-of-city friends as well from time-to-time).

Subscribe and catch the first episode via Creative Welly.

Again, this project is totally self-funded and wouldn’t be possible without these amazing collaborators:

All episodes are shot and edited by the wonderfully talented Jono Tucker, Empire Films. An extremely diligent and personable soul who has added a polish to the resulting video which I never could’ve achieved, thank you Jono.

Hosted at Xequals, a centrally based web development agency who provide us with a kick-ass office which totally gets kitted out for the shoot. Thank you Alex Matthews for being so gracious with your space.

Learn more about the background in this ‘Creative Welly Launched | Learning Out Loud‘ post.

Published

The Future Of Now | Designing Social

dowa session

“Let’s just forget about the future, And get on with the past” Sting

Imagine designing a classroom. A place for learning and the cultivation of curiosity.

Four walls. Ceiling and a floor. Some windows and a door. Other elements like power sockets, furniture, projector, whiteboards, light switches. Focus on that light switch. It could be a dimmer or maybe a couple of configurations laid out as buttons in a vertical line. It’s usually just on or off.

Replace it with a camera. Now with existing gestural technology and software the users of the room have the potential to wave their hand or hold up a certain amount of fingers to make it work.

What if the camera was ‘broken’. Left open for the students to decide how it will function and better still to learn how to programme to make it work. Maybe they replace it with a microphone as they want voice commands (and it changes to recognise different languages for what is being taught that day in class). Or the camera recognises colour which in turn light the room the same way.

Now, not only is the classroom designed as a place to learn but also a space to learn how to use.


‘The Future Of Now’ was the title of a talk / workshop developed and delivered to the wonderful souls at DOWA-IBI Group Architects, Portland, Oregon (during my stateside trip in July).

The above was a response I gave when one of the architects asked for a very specific application to some of the social media / technologies in their future designs.

The official line:

DK was engaging, informative and thoughtful. He challenged us to think differently. The take away was: what is has already become what was and we should consider what will be with the opportunities available today.

For a firm like us we welcome that challenge.
John Weekes, Co-Founder, DOWA-IBI Group Architects

Published