Midjourney V6: Bridging Art and Text with Promising Yet Imperfect Results

Evelyn Galindo
4 min readJan 11, 2024
Image by Evelyn Galindo using Midjourney v6

Yes, I know. “Illusion” is spelled wrong, but it’s almost right and that’s the point.

The unveiling of Midjourney v6 in the realm of AI and creative technology has been a much-anticipated event. This latest version promises a host of new features including faster upscales, superior image quality, refined aesthetics, enhanced sensitivity to prompts, and the feature I’m most excited about, a significant leap in text generation. This update has been particularly exciting for those yearning for a tool capable of seamlessly blending text into images, potentially heralding a new era in AI-assisted artistry. However, the question remains: does Midjourney v6 deliver?

To put this new tool to the test, I embarked on a series of image and text experiments. The results were a mixed bag, demonstrating both the potential and the current limitations of Midjourney v6.

The first trial involved creating a black and white stencil image of Mexican icon La Catrina, incorporating a sandwich board sign with the word “Esprit”. The artistic style was to be reminiscent of Banksy, miniaturecore, Jan Toorop, and Diego Rivera, using a heavy impasto technique against a stark white background. Impressively, Midjourney v6 fared well with this setup, effectively capturing the essence of stencil art and integrating the text in a Banksy-esque style. This success indicated that the tool might be more adept at handling simpler textual integrations, especially when the text is minimal and the artistic style simple and distinct.

Image by Evelyn Galindo using Midjourney v6

The second experiment aimed to capture the vibrancy of Spanish flamenco culture. The prompt described a Carmen Amaya inspired woman in a business-style flamenco suit, deeply engrossed in her performance. The phrase “El baile es la poesía oculta del cuerpo” (Dance is the body’s hidden poetry) was to be included. Here, the AI struggled to balance the rich cultural elements with the textual component. While the image captured the flamenco spirit, the text integration was less successful, revealing the challenges Midjourney v6 faces in marrying complex visuals with text. I tried a second image with the single word “love” with better results.

Image by Evelyn Galindo using Midjourney v6
Image by Evelyn Galindo using Midjourney v6

In the third and most emotive test, the phrase “joy is an act of resistance” and “joy is resistance” was used across four different scenarios: a boy playing soccer, a street performer, street graffiti, and a bouquet of flowers. Each image aimed to represent joy uniquely, challenging the AI to weave the inspiring text into the visual narrative seamlessly. The results were varied; some images eloquently captured joy, while others faltered in text integration.

Image by Evelyn Galindo using Midjourney v6
Image by Evelyn Galindo using Midjourney v6
Image by Evelyn Galindo using Midjourney v6
Image by Evelyn Galindo using Midjourney v6

These experiments, though revealing some shortcomings, underscore the significant progress Midjourney v6 represents, especially for those interested in the nuanced interplay of image and text. Its enhanced sensitivity to linguistic subtleties is noteworthy, albeit not yet perfect in achieving seamless image-text integration.

Image by Evelyn Galindo using Midjourney v6

In summary, Midjourney v6 still needs work, but it marks a significant step forward in AI-driven creative tools. While it hasn’t fully mastered the art of blending text and imagery, its advancements in handling language and aesthetics are commendable. As AI technology continues to evolve, we are definitely closer to realizing the dream of tools that can truly enhance our creative expressions, seamlessly combining visual art with powerful textual narratives. For now, Midjourney v6 offers a promising glimpse into what the future holds for AI in the creative domain.

--

--