Retrieval
A custom TF-IDF retriever in pure TypeScript. We tokenize, drop stopwords, build uni+bi-gram features, weight by inverse document frequency, and rank by cosine similarity. Deterministic, fast, dependency-free.
const retriever = new Retriever(corpus);
const top = retriever.retrieve(text, 5);
// → [{ id, text, label, similarity }, ...]- · Deterministic per corpus
- · ≤ 5 ms per query
- · No network
- · Stateless
- · Tiny corpora (≤ 3)
- · OOV jargon
- · Cross-language queries