I am a beginner of web-crawling, I had tried crawler4j for static web.
And now, I would like to try crawling this website (https://weedmaps.com/brands) via Nutch+hbase+solr, but I can't even go further.
I had tried other website such as http://sports.sina.com.cn, I can actually index the information to solr.
I wanna know for https://weedmaps.com/brands, the source page doesn't have the explicit out links, how can I crawl it? Can any body suggest the tools or articles? or explain the reason why nutch doesn't work?
Thank you so much.