TLDR: ThematicPlane is a novel AI-powered image editing system that allows users to manipulate images based on high-level semantic concepts like mood or style, rather than technical prompts. It bridges the gap between a user’s tacit creative intent and the AI’s capabilities, fostering more intuitive and iterative design workflows, as demonstrated in an exploratory study.
Generative AI has revolutionized how we create images, making sophisticated tools accessible to many. However, a persistent challenge remains: aligning the AI’s output precisely with a user’s nuanced creative vision, especially for those without technical expertise. Current tools often demand users to translate their abstract ideas into concrete prompts or reference images, which can hinder the natural flow of creative exploration.
Imagine wanting to make an image feel ‘warmer’ or ‘more dramatic’ without knowing the exact technical parameters or complex prompts required. This gap between a user’s high-level semantic intent and the AI system’s low-level controls is a significant barrier. To address this, researchers have introduced ThematicPlane, an innovative system designed to bridge this very gap.
ThematicPlane allows users to navigate and manipulate high-level semantic concepts—such as mood, style, or narrative tone—within an interactive ‘thematic design plane.’ This approach moves beyond conventional prompting, enabling a more tacit and intuitive way to edit images that aligns with a user’s inherent understanding of a concept. It acts as a scaffolding layer, connecting a user’s unspoken creative intent directly with the system’s capabilities, fostering creativity without the need for explicit externalization.
How ThematicPlane Works
The system begins by taking an input image. Using advanced AI, it extracts thematic keywords from the image, focusing on elements like mood or style rather than specific objects. For each identified theme, ThematicPlane generates multiple thematic variations, positioning them along a semantic axis. It then uses sophisticated embedding techniques to compare the input image and textual themes, ranking them by similarity. These ranked descriptors are then fed into a powerful image generation model, like Imagen 3, to produce transformed images. Users can then choose any newly generated image as their primary reference to continue their editing process, allowing for iterative refinement.
Insights from an Exploratory Study
An exploratory study involving six participants, all with prior experience in image generation tools, provided valuable insights into ThematicPlane’s effectiveness. Participants engaged in tasks that required them to modify images to match specific targets, comparing ThematicPlane against other AI-based image generation methods.
The study revealed that ThematicPlane supports both open-ended discovery and goal-oriented refinement in creative tasks. While participants didn’t always anticipate the exact outcome of their edits, they reported high satisfaction with the results. Unexpected generations often served as inspiration, leading to new design directions or enabling iterations where themes blended cohesively.
However, the study also highlighted varying perceptions of how themes mapped to outputs. Some participants expected a linear relationship between their input and the resulting image, while others found the connections less clear. This suggests a future need for more explainable controls within such systems, helping users better understand the underlying steering mechanisms.
Also Read:
- Explanatory AI: A New Approach to Making AI Understandable for Everyone
- CLIPin: Boosting Image-Text Understanding with a New Non-Contrastive Approach
Looking Ahead
ThematicPlane represents a significant step towards more intuitive and semantics-driven interaction in generative design tools. Future work aims to enhance the system by introducing multidimensional representations, allowing users to manipulate multiple attributes simultaneously—for example, both mood and style—through more dynamic gestures. This could lead to even more nuanced control over the AI’s creative output, further bridging the gap between human imagination and artificial intelligence.
For more details, you can read the full research paper: ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Editing.


