Prompt: Difference between revisions

From CTPwiki

Line 26: Line 26:
File:Image-man-washing-dishes-2.png|Image generated with the prompt Man washing the dishes
File:Image-man-washing-dishes-2.png|Image generated with the prompt Man washing the dishes
File:Image-man-doing-the-dishes.png|Image generated with the prompt Man washing the dishes
File:Image-man-doing-the-dishes.png|Image generated with the prompt Man washing the dishes
</gallery>Humour, political critique or political hegemony. Trumpian visual politics is written through prompts.
</gallery>Sensitivity, brittleness. Humour, political critique or political hegemony. Trumpian visual politics is written through prompts.


== How does it relate to autonomous infrastructure? ==
== How does it relate to autonomous infrastructure? ==
The prompt becomes related to autonomous infrastructure through the practice of annotating LoRAs for instance. LoRA's annotation is a form of pre-prompting, embedding prompts in the model. Thinking about images infrastructurally. Not for one image but for a whole genre or character.
The prompt becomes related to autonomous infrastructure through the practice of annotating LoRAs for instance. LoRA's annotation is a form of pre-prompting, embedding prompts in the model. Thinking about images infrastructurally. Not for one image but for a whole genre or character.
[[Category:Objects of Interest and Necessity]]
[[Category:Objects of Interest and Necessity]]

Revision as of 09:59, 16 July 2025

Negative prompt

What is the network that sustains this object?

  • How does it move from person to person, person to software, to platform, what things are attached to it (visual culture)
  • Networks of attachments
  • How does it relate / sustain a collective? (human + non-human)

Prompts can be shared or kept private. Help others to reproduce. Economy of sharing. List of best prompts and tutorials. Groups of people with same interest in CivitAI.

How does it evolve through time?

Evolution of the interface for these objects. Early chatgpt offered two parameters through the API: prompt and temperature. Today extremely complex object with all kinds of components and parameters. Visually what is the difference? Richness of the interface in decentralization (the more options, the better...). Ye the prompt remains very central. Break from previous experiments with GANs which remained confined to a technically skilled audience. Promise of democratization.

Diverging tendencies in the ways users are invited to prompts in AI systems. One philosophy is to ask as little as possible from the user. With only a few words, the user get what they supposedly want. The system has to make up for all the bits that are missing, context, etc. This involves prompt augmentation on the server side. Also a lot of implicit assumptions.

The other approach is to give the user all the means to prompt as an expert. Prompt expansion is visible to the user, providing tools to improve the prompt, offer context, continuous chat, etc. Image generation in Stable Horde is stateless. Meaning that every prompt is considered in isolation. The system can't infer anything from past prompts, no concepts of sessions. Much harder to create this sense of continuity than with a centralized service such as OpenAI's chatGPT.

Evolution of prompting in Flux models. They integrate the advances in LLMs to add a more refined semantic understanding of the prompt.

(When does the negative prompt appear? )

How does it create value? Or decrease / affect value?

The quality of a model is evaluated to how well it responds to prompts. Prompt adherence.

What is its place/role in techno cultural strategies?

The fantasy behind the system is that by interpreting the prompt, it "reads" what is in the user's mind. The interpreting the prompt involves much more than a literal translation of a string of words into pixels. It is the interpretation of the meaning of these words. As historically prompts were limited in size, this work of interpretation would be performed on the basis of a very minimal description. Even often with a syntax reduced to a comma-separated list or a string of tags. This had for effect that the model was tasked to fill the blanks. As the model tried to make do, it would inevitably reveal its own bias. If a prompt mentions an interior, the model would generate an image of a house that reflects the dominant trends in its training data. Prompting is therefore half ideation and half search: the visualisation of an idea (what the user wants to see) and the visualisation of the model's worldview. After a few prompts, a user understands that each model has its own singularity. The model is trained with particular data. Further, it synthesizes these data in its own way. Through prompting the user gradually develops a feel for the model's singularity. Elaborate semantics to work around perceived limitations. Midjourney prompt "The viking telling a secret in the mouth of another" to generate an image of two vikings kissing on the mouth. A way to outsmart the system or to bypass censorship. But also an understanding of how to navigate latent space. Guiding the denoising process through prompting.

Sensitivity, brittleness. Humour, political critique or political hegemony. Trumpian visual politics is written through prompts.

How does it relate to autonomous infrastructure?

The prompt becomes related to autonomous infrastructure through the practice of annotating LoRAs for instance. LoRA's annotation is a form of pre-prompting, embedding prompts in the model. Thinking about images infrastructurally. Not for one image but for a whole genre or character.