The rapid progress of large language models (LLMs) has fostered the development of domain-specific variants in law, medicine, and finance. However, existing legal LLMs still struggle to generate contextually grounded and regulation-compliant responses in complex scenarios of fair competition review. To address this, we present FairAgent, a collaborative multi-agent framework that unifies data refinement and reinforcement learning for legal reasoning. FairAgent integrates two core modules: (1) EchoCourt, a closed-loop data generation and refinement pipeline that constructs high-quality question–answer pairs through generation, critique, and optimization guided by a hierarchical Fairness Knowledge Forest; and (2) a two-stage outcome-based reinforcement learning mechanism that progressively teaches the model to invoke and integrate external retrieval in reasoning. We further enhance learning stability through a RAG-based rollout and retrieval-mask loss. Extensive evaluations demonstrate that FairAgent significantly improves reasoning accuracy, interpretability, and compliance in fair competition review compared with state-of-the-art baselines, establishing a scalable framework for retrieval-augmented legal intelligence.
Mao et al. (Mon,) studied this question.