Retrieving by BM25 becomes slower when there are many queries #1704
-
I tried BM25 baseline for MSMARCO passage ranking and succeeded. The retrieving speed is about But when I was going to retrieve more queries (all the |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
You're probably running out of memory. Since the Try running on smaller batches of queries. |
Beta Was this translation helpful? Give feedback.
You're probably running out of memory. Since the
SearchCollection
implementation is multi-threaded, it keeps the hits in memory until all the queries are processed, and the writes out to disk all at once. This simplifies thread synchronization.Try running on smaller batches of queries.