-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue running demos on Ubuntu 14 #14
Comments
The file corpus.txt should have been automatically extracted from corpus.db when you download the dataset. Run the following command to generate the file: You might want to remove the following folder, so that you have a clean start when you train an LDA in MALLET. Then, try running the following again. |
The file corpus.txt is being created. Here is a longer extract of the error message: Copying [data/demo/infovis/corpus/corpus.db] --> [apps/temp_20140503_190729_997992_3269/data/corpus.db] The error occurs for mallet and gensim. |
There must be other issues that cause these files to be missing. Regenerating these files doesn't fix the root problem. Could you remove the data/demo/infovis and apps/infovis_mallet folders, and run "./demo.py infovis mallet"? What is the full console output? |
Here is the full console output: oliver@ubuntu: Build a topic model (mallet) using a demo dataset (20newsgroups) Setting up the 20newsgroups dataset... Training an LDA topic model using MALLET... Importing a file into MALLET: [data/demo/20newsgroups/corpus/corpus.txt] --> [data/demo/20newsgroups/model-mallet/corpus.mallet] Import a MALLET LDA topic model as a web2py application... Creating app: 20newsgroups_mallet [apps/temp_20140518_102959_056298_2262] |
Ran into some of the same problems here.
curl --insecure --location http://homes.cs.washington.edu/~jcchuang/termite-datasets/$DEMO.zip > $DOWNLOAD_PATH/$DEMO.zip located in fetch_dataset.sh cannot execute
|
Hi there,
I get the following error message on all demos:
Importing a file into MALLET: [data/demo/20newsgroups/corpus/corpus.txt] --> [data/demo/20newsgroups/model-mallet/corpus.mallet]
Traceback (most recent call last):
File "bin/train_mallet.py", line 42, in
main()
File "bin/train_mallet.py", line 39, in main
TrainMallet( args.corpus_path, args.model_path, args.token_regex, args.topics, args.iters, args.quiet, args.overwrite )
File "bin/train_mallet.py", line 25, in TrainMallet
BuildLDA( corpus_filename, model_path, tokenRegex = token_regex, numTopics = num_topics, numIters = num_iters )
File "/home/oliver/Desktop/termite-data-server-master/bin/modellers/MalletLDA.py", line 31, in init
importer.ImportFileOrFolder( tokenRegex )
File "/home/oliver/Desktop/termite-data-server-master/bin/modellers/MalletLDA.py", line 76, in ImportFileOrFolder
self.Shell( command )
File "/home/oliver/Desktop/termite-data-server-master/bin/modellers/MalletLDA.py", line 44, in Shell
p = subprocess.Popen( command, stdout = subprocess.PIPE, stderr = subprocess.STDOUT )
File "/usr/lib/python2.7/subprocess.py", line 710, in init
errread, errwrite)
File "/usr/lib/python2.7/subprocess.py", line 1327, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
Best,
Oliver
The text was updated successfully, but these errors were encountered: