SSIR
Structured Symptom Retrieval Intelligence

Structured Symptom Retrieval Intelligence (SSIR)
Deterministic, High-Confidence Medical Triage via Retrieval-First Architecture
Christopher Da Silva
Symptomloop Inc.
2025
Structured Symptom Retrieval Intelligence (SSIR) is a deterministic retrieval framework for medical triage that replaces the generative unpredictability of large language models (LLMs) with pre-engineered, clinically validated protocols and similarity search over precomputed embeddings. This design ensures every decision is auditable, reproducible, and cost-predictable—critical requirements for regulated healthcare environments.
SSIR is the healthcare-specific implementation of the broader Structured Intelligence Retrieval (SIR) paradigm, a domain-agnostic framework for retrieval-first, generation-last reasoning in high-stakes sectors. At its core, SSIR embeds a curated library of clinician-reviewed triage protocols into a vector space, processes patient narratives into canonical symptom terms via an NLP preprocessing layer, and iteratively refines the patient context until a confidence threshold is reached.
Structured Symptom Retrieval Intelligence (SSIR): Deterministic, High-Confidence Medical Triage via Retrieval-First Architecture
Abstract
Structured Symptom Retrieval Intelligence (SSIR) is a deterministic retrieval framework for medical triage that replaces the generative unpredictability of large language models (LLMs) with pre-engineered, clinically validated protocols and similarity search over precomputed embeddings. This design ensures every decision is auditable, reproducible, and cost-predictable—critical requirements for regulated healthcare environments.
SSIR is the healthcare-specific implementation of the broader Structured Intelligence Retrieval (SIR) paradigm, a domain-agnostic framework for retrieval-first, generation-last reasoning in high-stakes sectors. At its core, SSIR embeds a curated library of clinician-reviewed triage protocols into a vector space, processes patient narratives into canonical symptom terms via an NLP preprocessing layer, and iteratively refines the patient context until a confidence threshold (
) is reached.
The architecture includes multi-path orchestration to handle divergent symptom clusters, a controlled single-shot LLM fallback for resolving stalled reasoning loops, and strict guardrails to ensure that no diagnosis is generated at runtime. Retrieval is strictly non-generative—LLMs are used only for constrained clarifier generation from a pre-approved vocabulary.
SSIR's theoretical foundation, formalized mathematically in this paper, demonstrates convergence properties and introduces the Stochastic Structuring Effect (SSE)—the emergent tendency of unstructured patient input to resolve into structured, canonical symptom sets through iterative interaction. The framework delivers superior retrieval accuracy compared to traditional Retrieval-Augmented Generation (RAG) pipelines while operating at a fraction of the cost and with zero risk of hallucinated medical advice.
The medical triage technology stack is at an inflection point. Generative LLM-first approaches—while impressive in unconstrained domains—struggle to meet the reproducibility, safety, and cost thresholds demanded by regulated clinical practice.
The problem is threefold:
Safety & Compliance Risk
Operational Cost & Scalability
Accuracy & Multi-Symptom Complexity
SSIR solves these constraints by design.
It is a retrieval-first, generation-last system where the terminal output is always a clinician-validated protocol, never a model-invented one. The user's query is the only element embedded at runtime, compared against a fixed, embedded library of protocols. An iterative context refinement loop progressively improves retrieval confidence until it crosses the acceptance threshold, ensuring both determinism and convergence.
The framework introduces:
Why this matters now:
Healthcare AI is moving rapidly toward regulation-grade AI pipelines. SSIR is engineered to be certifiable under MDR/FDA frameworks, economically viable for nationwide deployment, and adaptable across domains that share similar constraints (finance, legal, logistics). By separating reasoning (retrieval) from articulation (optional LLM), it achieves safety without sacrificing user experience.
Most medical AI systems today fall into one of two categories:
End-to-End Generative LLMs
Retrieval-Augmented Generation (RAG)
Both approaches share weaknesses:
Structured Intelligence Retrieval (SIR) flips the paradigm:
SSIR, as SIR's medical instantiation, brings these advantages to triage:
In regulated healthcare and similarly constrained domains, retrieval-first is not just an optimization—it's a necessity. SSIR represents a production-grade embodiment of that principle.
© 2025 Symptomloop Inc. All rights reserved.
This document contains proprietary and confidential information. Unauthorized reproduction or distribution is strictly prohibited.