LLM agents typically rely on a single model for multi-step tool-using tasks, creating a tension between required capability breadth and individual model limitations. We introduce LLM-Skill Orchestration, a three-layer architecture where: (1) a reasoning model generates orchestration rules from system constraints alone; (2) a planning model decomposes tasks into skill graphs with explicit dependencies; and (3) heterogeneous LLM-Skills — both pure-text and tool-equipped — execute in parallel through a shared context pool. We evaluate 50 agentic tasks across five types (information retrieval, code construction, cross-system analysis, multi-step reasoning, compound decision-making). Each task has 4–6 binary checklist items, totaling 202 items. The rule-augmented system (Hb) achieves 202/202 completion and 17.5/20 average quality (LLM-as-Judge, σ=2.0), compared to 137/202 (68%) and 7.4/20 for the single-model baseline (A), and 166/202 (82%) and 13.7/20 for static-rule orchestration (C). Key findings: (i) same-model decomposition (D: 8/22) performs worse than no decomposition (A: 13/22), proving that model diversity, not parallelism, drives collaborative gains; (ii) rule-blind generation (Hb: 96/100) outperforms rule-informed generation (Hi: 76/100), demonstrating that deductive reasoning from system invariants generalizes better than inductive learning from failure cases; (iii) 34 of 227 skills (15%) produced 0-byte output due to API anomalies, yet all were autonomously compensated by the synthesis stage — an emergent architectural resilience not designed into the system. This is the second paper in a three-part series on multi-model orchestration ('AI Managing AI'). Part 1 (DOI: 10.5281/zenodo.19387375) addresses knowledge synthesis; this paper addresses agentic task execution; Part 3 (in preparation) develops the theoretical framework including hallucination duality. Part 2 of 3 in the 'AI Managing AI' paper series. Part 1: Dimension-Direct Routing — knowledge synthesis via multi-model orchestration (DOI: 10.5281/zenodo.19387375). Part 2 (this paper): LLM-Skill Orchestration — agentic task execution via rule-augmented multi-model collaboration. Part 3 (in preparation): AI Managing AI — a dual-mode framework for deterministic execution and creative exploration, including hallucination duality theory.
Building similarity graph...
Analyzing shared references across papers
Loading...
Tao Rui
Building similarity graph...
Analyzing shared references across papers
Loading...
Tao Rui (Sat,) studied this question.
www.synapsesocial.com/papers/69dc88d83afacbeac03ea9a5 — DOI: https://doi.org/10.5281/zenodo.19503674
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: