I cannot crawl links in Facebook's API response.Everything works fine when i crawl other Web-pages. I'm using Nutch 2.2.1, Hbase 0.9 for storing and Solr for indexing. As seed i'm using
https://graph.facebook.com/v2.10/me?fields=friends%7Bfeed%7Bpermalink_url%7D%2Cname%7D&access_token=<MY_ACC_TOKEN>
Injecting it's ok. At the end of the crawling cycle , i have my seed saved in my db. But during the fetching, nutch doesn't see any URL
Fetcher: throughput threshold: -1
-finishing thread FetcherThread49, activeThreads=0
Fetcher: throughput threshold sequence: 5
0/0 spinwaiting/active, 0 pages, 0 errors, 0.0 0 pages/s, 0 0 kb/s, 0 URLs in 0 queues
I've just tried to edit any file that discards URLs that contain charatchers as probable queries, but nothing happened. I've already implemented https, it does not work by default.
How can i solve this?
user_postspermission, before you can access their feed, right? - CBroe