Tag: Skeleton claw
-
The Ladder of Understanding: Imagen 3 Drew a Comic with Multimodal Gemini (I)
General Introduction – For the Entire Two-Part Series This two-part blog post series about the ladder of understanding continues our exploration of AI’s capabilities in the realm of visual understanding and creation. In our previous post, Can AI Interpret a Comic Strip?, we saw how Multimodal Gemini could analyze and interpret a complex comic. In…
-
Can AI Interpret a Comic Strip? Exploring the Capabilities of Multimodal Gemini
Prologue: This is a meta piece about Can AI interpret a comic strip? A comic strip about Scott Adam’s Talent stack is interpreted by a multimodal AI. This interpretation is later discussed by the blogger and Gemini 2 LLM. Have you ever wondered, “Can AI interpret a comic strip?”. That question sparked a fascinating journey…