Building a Local Semantic Search Engine - Part 4: Caching for Speed
Bart Gottschalk Bart Gottschalk

Building a Local Semantic Search Engine - Part 4: Caching for Speed

First search on a new directory: wait for every chunk to embed. A hundred chunks? A few seconds. A thousand? You're waiting—and burning electricity (or API dollars if you're using a cloud service). Second search: instant. The difference? A JSON file storing pre-computed vectors. Caching turned "wait for it" into "already done."

Read More