Los puntos clave no están disponibles para este artículo en este momento.
Abstract We introduce Finenzyme , a Protein Language Model (PLM) that employs a multifaceted learning strategy based on transfer learning from a decoder-based Transformer, conditional learning using specific functional keywords, and fine-tuning to model specific Enzyme Commission (EC) categories. Using Finenzyme , we investigate the conditions under which fine-tuning enhances the prediction and generation of EC categories, showing a two-fold perplexity improvement in EC-specific categories compared to a generalist model. Our extensive experimentation shows that Finenzyme generated sequences can be very different from natural ones while retaining similar tertiary structures, functions and chemical kinetics of their natural counterparts. Importantly, the embedded representations of the generated enzymes closely resemble those of natural ones, thus making them suitable for downstream tasks. Finally, we illustrate how Finenzyme can be used in practice to generate enzymes characterized by specific functions using in-silico directed evolution, a computationally inexpensive PLM fine-tuning procedure significantly enhancing and assisting targeted enzyme engineering tasks.
Building similarity graph...
Analyzing shared references across papers
Loading...
M. Nicolini
University of Milan
Emanuele Saitto
University of Milan
Ruben Emilio Jimenez Franco
Fundação Getulio Vargas
University of Milan
Royal Holloway University of London
Berlin Institute of Health at Charité - Universitätsmedizin Berlin
Building similarity graph...
Analyzing shared references across papers
Loading...
Nicolini et al. (Sat,) studied this question.
synapsesocial.com/papers/68e5cc6bb6db6435875629f6 — DOI: https://doi.org/10.1101/2024.08.10.607430