Embedding Models Compared: Retrieval Quality, Cost, and Latency
A practical embedding model comparison for retrieval quality, vector size, latency, cost, and self-hosting tradeoffs.
Embeddings coverage in this archive spans 3 posts from Apr 2023 to Jul 2023 and treats embeddings as a production discipline: evaluation loops, tool boundaries, escalation paths, and cost control. The strongest adjacent threads are ai, go, and search. Recurring title motifs include embedding, models, compared, and retrieval.
A practical embedding model comparison for retrieval quality, vector size, latency, cost, and self-hosting tradeoffs.
A hands-on walkthrough of building semantic search with Go, OpenAI embeddings, and pgvector. Includes chunking strategies, hybrid retrieval, and the gotchas I hit along the way.
A practical guide to vector databases -- what they store, how similarity search works, and the architectural decisions that matter in production.