Skip to content
Open
Show file tree
Hide file tree
Changes from 19 commits
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
3c4f073
feat: Integrate Solr as a supported database in VSB
cwaddingham Sep 9, 2025
e2571b3
Added two utilities for handling bulk import into Pinecone. One to co…
cwaddingham Sep 10, 2025
61ebdfd
Corrected distance and dimensions for YFCC dataset.
cwaddingham Sep 11, 2025
eec3bdc
Added additional libraries to pyproject.toml.
cwaddingham Sep 12, 2025
463ac73
Added additional command line arguments for Solr, specifically around…
cwaddingham Sep 12, 2025
75d3793
Additional error logging on Pinecone index creation.
cwaddingham Sep 12, 2025
33a02ac
refactor(solr): overhaul client/DB; retries, schema ensure, dedupe, r…
cwaddingham Sep 12, 2025
6a76055
Corrected linter errors.
cwaddingham Sep 16, 2025
a864f06
Updated formatting to clear linter errors.
cwaddingham Sep 16, 2025
e1cc061
Updated test files to resolve linter errors.
cwaddingham Sep 16, 2025
8b3ed85
- Updated docker-compose.yaml to set reasonable minimums for system r…
cwaddingham Sep 17, 2025
f61e583
Added new command line option to limit how many queries are run (usef…
cwaddingham Sep 17, 2025
b96f0b1
- Updated README.md with instructions on using the Docker container
cwaddingham Sep 17, 2025
d19ce3b
Added support for query limits and ensured proper closing of database…
cwaddingham Sep 17, 2025
32121a6
Ensured vector IDs are compared with ground truth consistently by cas…
cwaddingham Sep 17, 2025
1482ec2
Updates to allow for query limits from the command line.
cwaddingham Sep 17, 2025
53028d0
Updates to comply with black linter report.
cwaddingham Sep 17, 2025
9ad8863
Regenerate poetry.lock after pyproject.toml changes
cwaddingham Sep 18, 2025
63224ca
Changed hard coded user/password for Solr test to use environment var…
cwaddingham Sep 18, 2025
66a786b
Added yfcc-test as a workload to resolve some Python import errors du…
cwaddingham Sep 19, 2025
ccdc5bb
Added spawn-test-solr as a test to resolve some Python import errors …
cwaddingham Sep 19, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -111,6 +111,7 @@ The following databases are currently supported by VSB:
* [Pinecone](vsb/databases/pinecone/README.md)
* [pgvector](vsb/databases/pgvector/README.md)
* [OpenSearch](vsb/databases/opensearch/README.md)
* [Solr](vsb/databases/solr/README.md)

> [!TIP]
> You can also display the list of supported databases using the following command:
Expand Down
8 changes: 8 additions & 0 deletions docker/solr/configsets/_default/conf/lang/contractions_ca.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
# Set of Catalan contractions for ElisionFilter
# TODO: load this as a resource from the analyzer and sync it in build.xml
d
l
m
n
s
t
15 changes: 15 additions & 0 deletions docker/solr/configsets/_default/conf/lang/contractions_fr.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
# Set of French contractions for ElisionFilter
# TODO: load this as a resource from the analyzer and sync it in build.xml
l
m
t
qu
n
s
j
d
c
jusqu
quoiqu
lorsqu
puisqu
5 changes: 5 additions & 0 deletions docker/solr/configsets/_default/conf/lang/contractions_ga.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Set of Irish contractions for ElisionFilter
# TODO: load this as a resource from the analyzer and sync it in build.xml
d
m
b
23 changes: 23 additions & 0 deletions docker/solr/configsets/_default/conf/lang/contractions_it.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
# Set of Italian contractions for ElisionFilter
# TODO: load this as a resource from the analyzer and sync it in build.xml
c
l
all
dall
dell
nell
sull
coll
pell
gl
agl
dagl
degl
negl
sugl
un
m
t
s
v
d
5 changes: 5 additions & 0 deletions docker/solr/configsets/_default/conf/lang/hyphenations_ga.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
# Set of Irish hyphenations for StopFilter
# TODO: load this as a resource from the analyzer and sync it in build.xml
h
n
t
6 changes: 6 additions & 0 deletions docker/solr/configsets/_default/conf/lang/stemdict_nl.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
# Set of overrides for the dutch stemmer
# TODO: load this as a resource from the analyzer and sync it in build.xml
fiets fiets
bromfiets bromfiets
ei eier
kind kinder
Loading
Loading