As an aside, there's now Lance data format support in DuckDB through their extension. It has Lance's vector search support available among other things:
I just noticed this, and your post, and haven't yet checked neither (sorry). I'm however doing some vector search benchmarking soon, with DuckDB's options alongside others. So your work caught my attention here.
wolfgangK 14 hours ago [-]
Nice !
My most pressing request for VSS would be efficient binary vectors : is this on the table ?
cigrainger 14 hours ago [-]
I haven't given binary vectors a lot of thought, but I'm exploring RaBitQ[1].
https://github.com/lance-format/lance-duckdb/tree/main?tab=r...
I just noticed this, and your post, and haven't yet checked neither (sorry). I'm however doing some vector search benchmarking soon, with DuckDB's options alongside others. So your work caught my attention here.
[1] https://arxiv.org/abs/2405.12497