For a Semiotic Approach to Generative Image AI

On Compositional Criteria

Authors

  • Enzo D'Armenio Author
  • Maria Giulia Dondero Author
  • Adrien Deliège Author
  • Alessandro Sarti Author

DOI:

https://doi.org/10.71743/ee5nrx33

Keywords:

semiotics, generative artificial intelligence, intersemiotic translation, visual studies, composition, enunciation

Abstract

This article analyzes the semiotic functioning of Midjourney and DALL•E, two generative AI models capable of producing images out of natural language prompts. The theoretical assumption of this article is that the images produced by these AIs are the results of a particular intersemiotic translation, realized through the collaboration of human and computational operators. Our research will show the specificity of the intersemiotic translation realized by AIs vis-à-vis more classical kinds of translation (e.g., from a novel to a movie) and will also analyze the different kinds of “visual reasoning” characterizing Midjourney and DALL•E. Our article has two goals: first, to study how these models perform intersemiotic translations; namely, what choices they make in order to translate the generality of the symbolic (and indexical) signs of verbal languages into the specificity of the visual composition. Second, we intend to verify the degree of control that one can have over the visual composition. Following this, we present the results of the tests carried out on Midjourney and DALL•E pertaining to two semiotic macro-criteria: plastic categories (eidetic, chromatic, and topological) and visual enunciation (gaze relations, visual translation of verbalized actions, temporality, and aspectuality). These criteria were developed by Paris School semiotics in order to analyze artistic images. Here, they will be used as principles of composition and parameters for controlling the results. At the end, we demonstrate that through this experimentation with elementary parameters of visual composition, semiotics can provide an epistemological and analytical framework for understanding and assessing the intersemiotic translations realized through generative AIs. Reciprocally, these tests on AIs aid our understanding of the two semiotic macro-criteria used, notably leading to a multiplication of enunciative instances in image production. Databases, algorithms, prompts, and aleatoric elements act as discursive agents.
a sequence of three two-dimensional geometric objects: a red circle, a green triangle and an orange square crossed horizontally by a white line

Downloads

Published

2025-04-22

Issue

Section

Articles

How to Cite

D'Armenio, E., Dondero, M. G., Deliège, A., & Sarti, A. (2025). For a Semiotic Approach to Generative Image AI: On Compositional Criteria. Semiotic Review, 9. https://doi.org/10.71743/ee5nrx33