Benchmarks for compute resources #41

awgymer · 2024-11-13T12:01:24Z

Hi I have looked through the documentation but I can't see any indication of speed benchmarks or recommended compute to achieve a given throughput?

Given that we run jobs through a scheduler that requires setting resource requests I am wondering if you are able to shed any light on what you might consider to be sensible defaults to provide a process for:

--threads argument
cpus/cores to give a job
memory (overall or per-thread) to give a job
what sort of runtime you might expect for a typical sample with these settings

The text was updated successfully, but these errors were encountered:

aquaskyline · 2024-11-15T01:48:12Z

Line 680 in ClairS' preprint gives you some figures about using ClairS on a whole genome. If you are distributing ClairS' job to multiple nodes by setting intervals, you will need to adjust the --chunk_size accordingly. Say if you set --thread 32 for each 5Mbp interval on a single computing node. The best chuck_size is calculated as 5Mbp/32*4, the constant 4 is because ClairS uses 4 threads for each chunk.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks for compute resources #41

Benchmarks for compute resources #41

awgymer commented Nov 13, 2024

aquaskyline commented Nov 15, 2024

Benchmarks for compute resources #41

Benchmarks for compute resources #41

Comments

awgymer commented Nov 13, 2024

aquaskyline commented Nov 15, 2024