This paper examines what happens when legacy Excel artifacts are treated not as static lesbut as repositories of executable business logic and are transformed into a structural intermediate representation (IR) and an executable codebase. Its unit of analysis is the document-to-codetransformation architecture itself, interpreted together with repository-level automation literature and o cial product documentation. In addition, a limited internal operationalexample is used to inspect how structured repository representations are consumed in a live LLMwork ow.The central claim is threefold. First, document codi cation turns a document-managementproblem into a software-engineering problem. Second, that shift externalizes tacit operationalknowledge into structural addresses, dependency edges, regeneration procedures, and durablechange history. Third, once the asset becomes a repository object rather than a raw document, repository-oriented automation tools gain materially stronger leverage for search, multile change planning, impact analysis, veri cation, and documentation. More speci cally, thevalue of Excel codi cation lies in producing a structured repository surface, and that structureimproves LLM work quality by improving retrieval relevance, grounding traceability, and outputstability. The paper analyzes that leverage along ve dimensions: structural addressability, dependency visibility, executable reproducibility, change traceability, and executable veri cation.
D Kim (Tue,) studied this question.