Long-term preservation of biological information against degradation is a dominant constraint on cellular biology. We derive the architectural specification this constraint requires of a primordial stem cell (PSC) - the lineage configured to carry archival content across geological time and deploy it at threshold events - and search extant biology for the match. The dinoflagellates satisfy it comprehensively. Computational analysis of the Breviolum minutum genome recovers a previously unrecognized genetic language: a vocabulary of 1,819 fixed-width 9 nt type identifiers (modulons) composing combinatorially into 12,983 recipes that specify the cell's operational genes. The vocabulary is densely reused (each modulon participates in dozens of distinct recipes) and topologically organized for error tolerance (47.9× neighbor enrichment at one substitution); the recipes are bordered by the ATCATCATC envelope motif, oriented by side-exclusive directional flank families, and predominantly unique (7,382 of 8,757 recipes occur exactly once). The genome match anchors a wider one: the dinoflagellate protection profile, tandem-array organization, retroposition-via-DinoSL deployment tag, nuclear-net spatial scaffolding, dark proteome, broad operational capability, century-scale dormancy, and present-day deployment signatures map one-to-one onto the framework's PSC specification. The combined match shifts the burden of evidence onto accounts that read the dinoflagellate profile any other way.
Kovács et al. (Sun,) studied this question.