Tag: Multimodal Gemini
-
The Ladder of Understanding: Imagen 3 Drew a Comic with Multimodal Gemini (II)
Part 2: Beyond the Recipe: Exploring the Mind of an AI Artist In this part we talk less about the creativity process, but more about creative psychology – about the mind of an AI artist. In Part 1, Refining the Recipe, Step-by-Step, we chronicled our iterative journey of crafting text prompts to guide Imagen 3,…
-
The Ladder of Understanding: Imagen 3 Drew a Comic with Multimodal Gemini (I)
General Introduction – For the Entire Two-Part Series This two-part blog post series about the ladder of understanding continues our exploration of AI’s capabilities in the realm of visual understanding and creation. In our previous post, Can AI Interpret a Comic Strip?, we saw how Multimodal Gemini could analyze and interpret a complex comic. In…