Human sketches exhibit substantial variability across individuals in terms of line style, abstraction level and drawing conventions. Unlike realistic images, they provide limited contextual information and rely on highly simplified concept representations. Recognizing and generating sketches therefore requires efficient use of the available information, identification of the most informative local features, interpretation of their meaning within a minimal context, and understanding of the spatial relationships that define the overall structure. In this study, we introduce ViSketch-GPT, a representation and model that can extract these local features, contextualize them within the sketch and encode spatial relationships, thereby enabling a deeper understanding of the sketch structure. Guided by the intuition of the void as information, we leverage Signed Distance Functions (SDF) to reveal this potentially hidden information, organizing it via quadtree decomposition and processing it with a hierarchical Transformer to capture multi-scale dependencies. This structured representation allows the model to support both high-fidelity generation and accurate classification. Experiments on the QuickDraw and TU-Berlin datasets demonstrated that the model classifies sketches with high accuracy while generating outputs that preserve structural coherence, respect part relationships, and capture essential conceptual patterns despite the scarcity of information in the original sketches.
Building similarity graph...
Analyzing shared references across papers
Loading...
Giulio Federico
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo"
Giuseppe Amato
University of Salerno
Fabio Carrara
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo"
SHILAP Revista de lepidopterología
IEEE Access
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo"
Building similarity graph...
Analyzing shared references across papers
Loading...
Federico et al. (Thu,) studied this question.
synapsesocial.com/papers/69a75f7ec6e9836116a2ae71 — DOI: https://doi.org/10.1109/access.2026.3659732