You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I am using Nutch 1.10 version, Selenium 2.44.0 and Firefox 40.0.3 . I wanted to crawl dynamic contents of web pages. I have followed the instructions given in this link https://github.com/momer/nutch-selenium .
but when I execute the Nutchcrawl the process is executing. But when I try to take a dump from the segments it doesn't contain any data content. I am facing this issue only when I include the "protocol-selenium" plugin. Without this plugin I can able to crawl and I am getting the data content while dumping it. I don't know where am I going wrong please correct me and help me in this regard.
I am using the below command to start nutch to crawl:
$ bin/crawl /home/yoganandh/yoga/testnutch/apache-nutch-1.10/runtime/local/urls/seed.txt /home/yoganandh/yoga/testnutch/apache-nutch-1.10/runtime/local/crawl 2
Hi, I am using Nutch 1.10 version, Selenium 2.44.0 and Firefox 40.0.3 . I wanted to crawl dynamic contents of web pages. I have followed the instructions given in this link https://github.com/momer/nutch-selenium .
but when I execute the Nutchcrawl the process is executing. But when I try to take a dump from the segments it doesn't contain any data content. I am facing this issue only when I include the "protocol-selenium" plugin. Without this plugin I can able to crawl and I am getting the data content while dumping it. I don't know where am I going wrong please correct me and help me in this regard.
I am using the below command to start nutch to crawl:
$ bin/crawl /home/yoganandh/yoga/testnutch/apache-nutch-1.10/runtime/local/urls/seed.txt /home/yoganandh/yoga/testnutch/apache-nutch-1.10/runtime/local/crawl 2
dump command:
$ bin/nutch readseg -dump crawl/segments/20151006174816 dumpData1 -nocontent -nofetch -nogenerate -noparse -noparsedata
Thanks in advance.
The text was updated successfully, but these errors were encountered: