Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add sqlite version of the corpus #8

Open
eseiver opened this issue Sep 25, 2017 · 2 comments
Open

Add sqlite version of the corpus #8

eseiver opened this issue Sep 25, 2017 · 2 comments
Assignees
Labels

Comments

@eseiver
Copy link
Collaborator

eseiver commented Sep 25, 2017

Make sqlite version of the corpus
Rabble routers team at PLOS is interested in this!

@sbassi
Copy link
Collaborator

sbassi commented Sep 27, 2017

@eseiver :
I've been talking with Simon on this. I got lot of clues for me to start a research. The only thing I need from you is to tell me about the sources of the XML files. Do they come from Rhino? I was told that Rhino XML has more information because they have the subject areas, and they can be used as a search parameter. When I see the XML I can see the subjects so I presume that these are XML from Rhino, can you confirm?

@eseiver
Copy link
Collaborator Author

eseiver commented Oct 3, 2017

I'm not sure which system rhino is, but I'm pulling them from either content-repo or the journal pages directly. Those two XML forms are equivalent

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants