Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get a dump of everything but scientific articles #2

Open
wetneb opened this issue Dec 18, 2019 · 5 comments
Open

Get a dump of everything but scientific articles #2

wetneb opened this issue Dec 18, 2019 · 5 comments

Comments

@wetneb
Copy link

wetneb commented Dec 18, 2019

Thanks for this great tool!

I would be interested in generating a dump of all wikidata items, except those which have P31:Q13442814. It's not clear to me if this is doable yet?

@WolfgangFahl
Copy link

Scholarly articles are at https://www.wikidata.org/wiki/Q13442814
https://tools.wmflabs.org/scholia/ has statistics about the amount of triples you'd save on excluding them. It would be only 3% of all triples ... - Still a feature to filter out certain entities might be worthwhile.

@wetneb
Copy link
Author

wetneb commented Jun 15, 2020

It would be only 3% of all triples ...

Are you sure about this? Where do you see this figure? Scholia does announce 11,186,800,006 Wikidata triples but I don't see a figure for the number of triples for scientific articles? I expect that to be much more than 3%…

@WolfgangFahl
Copy link

35718600 | Scholarly articles it says... - yes you are right the number of triples with all properties will be higher than 3% then.

@danbri
Copy link

danbri commented Nov 11, 2020

Any progress on this?

@danbri
Copy link

danbri commented Nov 13, 2020

Some more stats links:

ScholarlyArticle and Astronomical object are interesting subsets, both to extract and keep, or to exclude, depending on purpose.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants