I'm using the SgmlLinkExtractor functionality in scrapy to parse specific urls.
I override start_requests function to crawl dynamic url.
this looks like:
start_requests(self): ..... yield Requests(url.strip(), callbackA)
Callback A does nothing right now.
I also implemented process_value for the SgmlLinkExtractor but it never called.
This is the rule I'm using:
rules = [Rule(SgmlLinkExtractor(allow=()), callback=callbackB, follow=True),]
Again callbackB never called.