Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

P1-Retrieved Results Cache #33

Open
ronakice opened this issue Jan 22, 2024 · 6 comments
Open

P1-Retrieved Results Cache #33

ronakice opened this issue Jan 22, 2024 · 6 comments
Assignees
Labels

Comments

@ronakice
Copy link
Member

Provide important cached retrieve results as well as rerank results hosted elsewhere but documented here. I can perhaps do this sometime.

@ronakice ronakice self-assigned this Jan 22, 2024
@ronakice ronakice added help appreciated :D enhancement New feature or request labels Jan 22, 2024
@sahel-sh sahel-sh changed the title Retrieved Results Cache P3-Retrieved Results Cache Jan 29, 2024
@sahel-sh
Copy link
Member

@ronakice it seems like you are looking for volunteers for this one, I unassigned it so that people can pick

@ronakice
Copy link
Member Author

Updating this with the following link: https://github.com/castorini/rank_llm_data

@ronakice
Copy link
Member Author

Please interface with this!

@AndreSlavescu
Copy link
Contributor

interested

@ronakice
Copy link
Member Author

Something like:

If it exists and matches md5 use it, else get it from rank_llm_data if there, else run retrieve.

Eventually, the rerank_results can be added to rank_llm_data too for verification. But not priority for now. This will likely fall into place after we have a nice 2CR after #32 is mature

@ronakice ronakice changed the title P3-Retrieved Results Cache P1-Retrieved Results Cache Feb 15, 2024
@ronakice
Copy link
Member Author

This is super important so bumping to P1. More people want to repro the baselines before jumping into their own dataset and making them download SPLADE indexes every time is probably not optimal

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants