Skip to content

Conversation

msuozzo
Copy link
Member

@msuozzo msuozzo commented Aug 14, 2025

The full crates.io index corpus is too large and growing too fast to
realistically assume we can have access to it all at once for inference
purposes. This new component enables us to keep a fixed-size cache of
snapshots on disk while rotating the least used snapshots out when other
snapshots are requested. We unconditionally keep the current (i.e.
non-archived) index cloned given it is the newest and most likely to be
requested during inference.

The interface also supports loading index portions from an existing
filesystem which means it can also be used transiently to mediate access
to these resources.

@msuozzo msuozzo requested a review from wbxyz August 14, 2025 03:10
@msuozzo msuozzo force-pushed the push-wytvzntoryuz branch 4 times, most recently from 074b92b to 3c65cd3 Compare August 14, 2025 16:28
Copy link
Member

@wbxyz wbxyz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Baring race conditions in the IndexManager, this lgtm.

@msuozzo msuozzo force-pushed the push-wytvzntoryuz branch 3 times, most recently from 3a34638 to 408f0b2 Compare August 28, 2025 20:20
@msuozzo msuozzo force-pushed the push-wytvzntoryuz branch 2 times, most recently from 9297d2b to 1dcf6d2 Compare September 2, 2025 20:59
@msuozzo msuozzo force-pushed the push-wytvzntoryuz branch 3 times, most recently from e2c097f to 9e8e725 Compare September 12, 2025 20:07
@msuozzo msuozzo force-pushed the push-wytvzntoryuz branch 4 times, most recently from dd57d58 to 7f00563 Compare September 30, 2025 16:39
The full crates.io index corpus is too large and growing too fast to
realistically assume we can have access to it all at once for inference
purposes. This new component enables us to keep a fixed-size cache of
snapshots on disk while rotating the least used snapshots out when other
snapshots are requested. We unconditionally keep the current (i.e.
non-archived) index cloned given it is the newest and most likely to be
requested during inference.

The interface also supports loading index portions from an existing
filesystem which means it can also be used transiently to mediate access
to these resources.
@msuozzo msuozzo merged commit 7df1ba2 into google:main Sep 30, 2025
4 checks passed
@msuozzo msuozzo deleted the push-wytvzntoryuz branch September 30, 2025 17:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants