What question did this study set out to answer?

This research analyzes the role of vision language models in the realm of art and authorship.

February 14, 2026

On Stochastic Picassos and Why Vision Language Models Cannot Replace Artists

Puntos clave

This research analyzes the role of vision language models in the realm of art and authorship.
Critical analysis of existing literature on vision language models and artistry
Examination of agency and its implications for authorship in artistic creation
Discussion of examples like DALL-E and Midjourney
VLMs can generate high-quality images but lack true authorship and agency
Only biological agents are currently responsible for genuine artistic creation
Concerns about VLMs undermining artistic tradition may be exaggerated

Resumen

Abstract Vision Language Models (VLMs), like DALL-E, Midjourney, and Stable Diffusion, have raised significant concerns regarding authorship and whether AI-generated images devalue artistic practices and traditions. Recently, some have argued that VLMs should be viewed as another tool that artists use to generate their creative outputs. I defend this position and expand on it by introducing an account of agency that demonstrates that only biological agents, at least for now, possess the necessary powers to be responsible for the act of creation (for example, the capacity to realize autonomous goal-directed actions and manipulate affordances). I ultimately argue that although VLMs afford artists the ability to output high-quality images with minimal technical skill, creating artworks that are artistically valued using VLMs will require significant ingenuity. Therefore, in my view, concerns that this new tool will blur the lines of authorship and undermine artistic practices and traditions are unwarranted.

Me gusta

Guardar

Me gusta

Guardar

On Stochastic Picassos and Why Vision Language Models Cannot Replace Artists

Puntos clave

Resumen

Cite This Study