Crawlers crawl web pages, a single crawler can crawl, multi-threaded crawlers can not open the url?

as mentioned above, if a single crawler can crawl, the multithreaded crawler cannot open the url. Is the request time between the multithreaded crawlers too short, which triggers the anti-crawler mechanism of the website?

May.22,2021

pay attention to the delay request. I usually only start batch downloading when downloading images.


you can try to change the proxy every time. The IP, is most likely pulled into the blacklist after the access rate is too fast.

Menu