-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize VCF import #87
Comments
@Arkanosis : You will find several annotated VCF on Brownie, in the /var/regovar/files directory |
The new importer should be wrap in a pipeline to become the new official vcf importer : https://github.com/REGOVAR-Pipelines/VCFImporter |
Ok great! :) |
Which means by the way that we really have no reason to use python more than anything else, right? |
yes |
And in addition to this task, we should have a reflexion on the DB schema, I have the intuition that we can remove the "variant id". I thinks that we need it only in the "working table" but if we can avoid to use it ("insert or update" query) it at the step of the import, we will be able to increase a lot performance. see : https://github.com/REGOVAR/Regovar/blob/master/regovar/core/managers/imports/vcf_manager.py#L569 |
Import in
regovar/core/managers/imports/vcf_manager.py
is slow. Therefore:@Oodnadatta @ikit : do you please have a relevant, annotated VCF for me, that is slow enough to import? The slower the better (I can shorten it as needed if that's really too slow for me). Thanks!
The text was updated successfully, but these errors were encountered: