Ask about the choice of reptiles.

now there is a need to crawl an article from a website, including all the js,css.html files, and then save it to become your own article, which is loaded asynchronously through ajax. So I would like to ask, this kind of demand, which way to achieve better, scrapy splash and puppeteer seem to be similar in principle. In addition to the above two, there is no other framework for my current needs, the language is selected in node and ptyhon for advice.

Html5 node.js python

Apr.19,2022

selenium is good, although inefficient

articles are obtained through ajax , why don't you just use this interface?

finally, I chose puppeteer

. I think that the retro combination of scrapy and bs4 will not fail to apply

dynamic web pages loaded through ajax. It is recommended to use selenium

Previous: Is there a conflict in the submit form validation rules when antd Tab is switched?

Next: About Community Profiles on github

Chrome debugging
Please tell me why chrome debugging js file breakpoint, then why walk to other js files inside? how to avoid this? ...

Css html5 node.js python javascript

Mar.01,2021
How to judge how to pass this verification rule
now there is a requirement that begins with 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 145, 147, 150, 151, 152, 153, 155, 156, 157, 158, 159, 173, 175, 176, 177, 178, 180, 181, 182, 183, 184, 184, 185, 186, 187, 188, 189, 166, 198, 199 ...

Javascript html html5 node.js python

Dec.20,2021

MySQL Query : SELECT * FROM `codeshelper`.`v9_news` WHERE status=99 AND catid='6' ORDER BY rand() LIMIT 5
MySQL Error : Disk full (/tmp/#sql-temptable-64f5-3447e1c-1c275.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
MySQL Errno : 1021
Message : Disk full (/tmp/#sql-temptable-64f5-3447e1c-1c275.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
Need Help?