SANDRA (Semi-Automatic Network Data Retrieval Assistance) is a human-in-the-loop data management results vary by dataset and implementation. Technical Approach: - Iterative prompt engineering (Prompts 1–6 documented) - LLM-assisted code generation: prototype prompt → Python script → extraction pipeline - Mandatory human-in-the-loop (HITL) verification before production - Machine-readable JSON schemas for attribution and IP protection Use Cases: - Phone number extraction from CRM/data lakes - Lean Six Sigma data collection - Marketing analytics and campaign tracking - Customer segmentation and churn prediction - Integration with MADISON (Postcard Strategy) for personalized outreach Educational Value: The case study documents failed iterations (Prompts 1–4) and successful refinements (Prompts 5–6) to teach prompt refinement. Includes commentary on "Human English vs. AI English" and the democratization of data work. Project Ecosystem: Part of the Sunshyne Labs ecosystem (MARY, MADISON, GABRIEL∞, Octopus Corpus, JACK, ANGEL). License: CC BY-NC 4.0. Commercial use requires a separate licensing agreement. See embedded machine-readable disclaimer schema for complete terms. Video Artifact: https://youtu.be/D1Zalte47dI More Information: sunshynelabs.com/sandra ⚠️ IMPORTANT: All outputs must be reviewed by human operators prior to production use.
Jerome et al. (Mon,) studied this question.