Sources

Grounding, citations, and further reading for When Unstructured Search Isn't Enough.

All of this is optional. The article itself is the tutorial. This page exists for readers who want to follow the citation trail back to the primary sources, see the textbook grounding behind each claim, and read deeper into the literature on Text-to-SQL, knowledge graphs, and query routing.

Nothing on this page is required reading, and you do not need to purchase any of these books. The numbered references in the article hyperlink to the corresponding entries here, so you can jump in at the point of interest and follow the back-to-article link to return.

About the Sources

SLP3: Jurafsky & Martin

Jurafsky, Daniel & James H. Martin. Speech and Language Processing, 3rd ed. (draft).

The standard academic textbook for NLP. Freely available in draft form at web.stanford.edu/~jurafsky/slp3/. Chapter 11 covers information retrieval and question answering, including the formal RAG pipeline, the bag-of-words model, dense retrieval architectures, ColBERT, and the precision/recall/MAP evaluation framework that the article repeatedly cites.

Widdows & Cohen: Large Language Models: How They Work and Why They Matter

Widdows, Dominic & Trevor Cohen. SemanticVectors Publishing, 2025.

Accessible and mathematically grounded survey of LLM architecture and behavior. Chapter 2 traces the mathematical foundations of vector-space retrieval (cosine similarity, tf-idf, PageRank). Chapter 5.3.3 frames RAG as a "computational compromise." Chapter 6 covers hallucination/confabulation, guardrails, and the historical separation of fact stores and language models.

Alammar & Grootendorst: Hands-On Large Language Models

Alammar, Jay & Maarten Grootendorst. O'Reilly Media, 2024.

Practitioner-oriented survey. Chapter 5 covers BERTopic and the c-TF-IDF approach that motivates LLM-driven structure extraction from prose. Chapter 8 covers RAG end-to-end and recommends hybrid search across semantic and keyword retrieval, the precursor to the multi-backend router pattern this article describes.

Rajkumar et al.: Text-to-SQL evaluation

Rajkumar, N., Li, R., & Bahdanau, D. (2022). arXiv:2204.00498.

Evaluates the Text-to-SQL capabilities of large language models, with emphasis on how schema representation affects accuracy. Finds that schema descriptions enriched with column comments and sample values substantially improve generation quality. Available at arxiv.org/abs/2204.00498.

Yu et al.: Spider benchmark

Yu, T., Zhang, R., Yang, K., et al. (2018). EMNLP 2018. arXiv:1809.08887.

Introduces the Spider benchmark, a large-scale human-labeled dataset for complex and cross-domain Text-to-SQL evaluation. The reported 80-85% exact-match accuracy figure that this article cites is measured against Spider. Available at arxiv.org/abs/1809.08887.

Pan et al.: Unifying LLMs and Knowledge Graphs

Pan, S., Luo, L., Wang, Y., et al. (2024). IEEE Transactions on Knowledge and Data Engineering.

Comprehensive survey of LLM-based knowledge graph construction and completion techniques, including entity extraction, relation extraction, and entity resolution. The reference for the article's discussion of automated KG construction. Available at arxiv.org/abs/2306.08302.

Li et al.: TAGe (Table-Augmented Generation)

Li, Z., Zhang, W., Zhang, C., & Song, D. (2024). arXiv:2408.14717.

Explores how LLMs can reason over both textual and tabular data without an explicit routing step. Sits at the frontier between text RAG and structured-data RAG. Available at arxiv.org/abs/2408.14717.

Woods: LUNAR

Woods, W. A. (1973). AFIPS Conference Proceedings, Vol. 42.

Early natural-language interface to a database, demonstrated on a corpus of lunar geology samples returned from the Apollo missions. Historical anchor for the long lineage of NL-to-database work that Text-to-SQL extends. Available at doi.org/10.1016/S0019-9958(73)90507-4.

Warren & Pereira: CHAT-80

Warren, D. H. D. & Pereira, F. C. N. (1982). Computational Linguistics, 8(3-4).

A landmark NL-to-database system, written in Prolog, that demonstrated interpretable query translation on a world-facts dataset. The second historical reference point for the Text-to-SQL lineage. Available at doi.org/10.1016/0004-3702(82)90013-X.

Hogan et al.: Knowledge Graphs (survey)

Hogan, A., Blomqvist, E., Cochez, M., et al. (2021). ACM Computing Surveys, 54(4).

The canonical survey of knowledge graphs, covering data models, representation, querying, refinement, and applications. The reference for the article's working definition of a property graph with typed nodes and edges. Available at arxiv.org/abs/2003.02320.

Lewis et al.: original RAG paper

Lewis, P., Perez, E., Piktus, A., et al. (2020). NeurIPS 2020. arXiv:2005.11401.

The 2020 paper that named retrieval-augmented generation as an architecture. Background reading for the textual-RAG happy path that this article extends with structured backends. Available at arxiv.org/abs/2005.11401.

The Happy Path and the Unhappy Path

8Lewis et al. on the original RAG architecture

Lewis and colleagues introduce retrieval-augmented generation as a named architecture in their 2020 paper. The work defines the two-stage retriever-plus-generator framing and demonstrates that grounding a generator in retrieved evidence improves performance on knowledge-intensive tasks. Every subsequent extension of RAG, including the structured-backend variants this article describes, builds on that two-stage decomposition.

Lewis et al. (2020), Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv:2005.11401

Sources

About the Sources

SLP3: Jurafsky & Martin

Widdows & Cohen: Large Language Models: How They Work and Why They Matter

Alammar & Grootendorst: Hands-On Large Language Models

Rajkumar et al.: Text-to-SQL evaluation

Yu et al.: Spider benchmark

Pan et al.: Unifying LLMs and Knowledge Graphs

Li et al.: TAGe (Table-Augmented Generation)

Woods: LUNAR

Warren & Pereira: CHAT-80

Hogan et al.: Knowledge Graphs (survey)

Lewis et al.: original RAG paper

The Happy Path and the Unhappy Path

8Lewis et al. on the original RAG architecture

9The formal RAG pipeline

10RAG as customization, primarily over text

The Limits of Vector Search

11Vector-space retrieval foundations

12What "close to the query embedding" means

13The bag-of-words limitation

14The Romeo ambiguity

15Bi-encoders versus relational composition

16RAG as a computational compromise

Text-to-SQL: Querying Databases with Natural Language

5Woods: LUNAR (1973)

6Warren & Pereira: CHAT-80 (1982)

17Six decades of natural-language database interfaces

18Why modern LLMs make Text-to-SQL practical

1Rajkumar et al. on schema-aware Text-to-SQL

19The vocabulary-mismatch problem in a new guise

20Inverted index as a structured access path

21Guardrails and the layered-defense logic

2Yu et al.: the Spider benchmark

22Confabulated SQL as a wrong answer with authority

23LLM calibration and the missing verification step

Knowledge Graphs: When Relationships Are the Answer

24ColBERT and the limits of token-level similarity

7Hogan et al. canonical KG survey

25PageRank as an early graph-traversal precedent

26BERTopic and structure extraction from prose

27The symbol-grounding problem and entity resolution

3Pan et al. on LLM-driven KG construction

Hybrid Architectures: Routing Queries to the Right System

28Alammar & Grootendorst on hybrid search

29The classical IR architecture, generalized

Practical Considerations

30Separation of facts and language as the provenance argument

31Evaluation across multiple retrieval modes

Where This Is Heading

4Li et al. on Table-Augmented Generation

32Chain-of-thought, test-time scaling, and the agentic approach