MARVIN — short for Materials Autonomous Research Validation and Inference Network — is a software-led infrastructure for materials discovery that treats the CALPHAD-derived Gibbs energy function as the universal currency tying every step of an autonomous loop together. The motivation is a concrete gap in the current self-driving-laboratory (SDL) literature: systems like the A-Lab, Ada, and Burger's mobile robotic chemist propose and synthesize candidate compounds without explicit reference to assessed thermodynamic data, which makes it hard to tell whether a product is a genuinely stable equilibrium phase, a kinetically trapped artifact, or simply unreacted precursor. Leeman et al. and Cheetham MARVIN is a direct response to it. The architecture is a four-layer pipeline. The first layer ingests data from literature, DFT, NIST-JANAF, and (eventually) live experiments. The second is the CALPHAD core itself — Bayesian thermodynamic database construction with ESPEI, multicomponent equilibrium calculations through pycalphad, precipitation kinetics via kawin, and the self-contained SGTE Gibbs evaluator built for this work. The third is a reasoning engine: a SQLite + NetworkX Knowledge Graph that records every datum with full provenance, and an MCP-tooled multi-agent layer that draws on the Coscientist / ChemCrow / SciAgents lineage for closed-loop reasoning. The fourth layer is the decision-action surface — a Digital Twin dashboard, an RL synthesis optimizer, an Economic Resilience module for supply-chain criticality, and the experiment-dispatch interface. Solid borders in Fig. 1 mark what's actually implemented; dashed borders mark the components that remain designed but not yet operational. The case study is garnet-type Li₇La₃Zr₂O₁₂ (LLZO), a leading solid-state-electrolyte candidate that braids together four classic problems — phase metastability, dopant-vacancy charge compensation, surface-chemistry contamination, and supply-chain vulnerability of tantalum. Section 7 reports actual computational results from the implemented half of the architecture: a two-panel Li-O-H Kellogg atmosphere diagram (computed from the Chang the work is candid that the falling branch beyond x ≈ 0.125 is driven by tetragonal ordering and grain-boundary effects and could be captured within CALPHAD by introducing a subregular L₁ interaction parameter on the octahedral sublattice — a natural target for the production ESPEI assessment. What sets the manuscript apart is its transparency about the gap between implemented and designed components, and a CC-BY 4.0 Zenodo deposit that ships the assessed TDB, ESPEI YAML run-config, four ESPEI JSON datasets (Chang-Hallstedt Li-O ZPF, Rettenwander 2016 site occupancies, Allen 2012 conductivity, Bolech 1996 pyrochlore), the self-contained SGTE Gibbs evaluator and CEF Gibbs solver, the entire figure pipeline, and a real Metropolis-Hastings MCMC trace with proper Gelman-Rubin diagnostics (R̂ < 1.05 across all parameters, ~20% acceptance) on a 3-parameter LiOH thermochemistry demonstration model. The garnet phase is deliberately kept out of the v1 deposit — the (Li,Al,Ga,Va)₃(Li,Al,Ga,Va)₄(La)₃(Zr,Al,Ga,Ta)₂(O)₁₂ model is presented as a proposal awaiting the converged production assessment, with an explicit strategy for managing its 64 endmember corners (eight realizable Ta-system corners + 16 Al/Ga analogues fitted to ND data, 40 fictive corners constrained by reciprocal relations or suppressed by a +500 kJ/mol charge-impossibility penalty). The result is a paper that anchors every autonomous decision to an assessed Gibbs energy function with full provenance — turning autonomous materials discovery from a black-box synthesis campaign into a thermodynamically auditable workflow — while being explicit about what's done, what's deferred, and exactly what reproducible artifacts back the claim.
Bustamante et al. (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: