This paper presents a human-centered, AI-driven framework for font design that reimagines typography generation as a collaborative process between humans and large language models (LLMs). Unlike conventional pixel- or vector-based approaches, our method introduces a Continuous Style Projector that maps visual features from a pre-trained ResNet encoder into the LLM’s latent space, enabling zero-shot style interpolation and fine-grained control of stroke and serif attributes. To model handwriting trajectories more effectively, we employ a Mixture Density Network (MDN) head, allowing the system to capture multi-modal stroke distributions beyond deterministic regression. Experimental results show that users can interactively explore, mix, and generate new typefaces in real time, making the system accessible for both experts and non-experts. The approach reduces reliance on commercial font licenses and supports a wide range of applications in education, design, and digital communication. Overall, this work demonstrates how LLM-based generative models can enhance creativity, personalization, and cultural expression in typography, contributing to the broader field of AI-assisted design.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yuexi Dong
Mingyong Gao
Information
University of Science and Technology of China
Beijing Jiaotong University
Building similarity graph...
Analyzing shared references across papers
Loading...
Dong et al. (Tue,) studied this question.
www.synapsesocial.com/papers/698435aaf1d9ada3c1fb4cc9 — DOI: https://doi.org/10.3390/info17020150