Research of biology and genomics inspired from linguis7cs is a dangerous territory, for several reasons. Once these are iden7fied, we can then dis7ll the shared concerns and common features between linguis7cs and gene7cs. This will be the introduc7on to my talk. I will address the ques7on of what is the nature of human language to finish with well a defined no7on of interest in the genera7ve modeling of the regula7on of gene expression, including a brief discussion on tokeniza7on of large language models (LLMs) of genomes. I will close my talk with what I consider a remarkable symbolic discovery.
Julio Collado Vides (Fri,) studied this question.