The crawler is full of spaces when it comes down.

the intern website crawls, and the browser looks like this clipboard.pngclipboard.png
clipboard.png

clipboard.png

ask for advice.

Apr.05,2021

implement another plan

-sharp!/usr/bin/env python
-sharp -*- coding: utf-8 -*-

import requests

-sharp 
if __name__ == "__main__":
    """webhtml"""
    url = 'https://www.shixiseng.com/interns/c-110100_st-intern_?k=Python'
    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.12 Safari/537.36'
    }
    request = requests.get(url, headers=headers)
    html_text = request.text
    print(html_text)

introduce from bs4 import BeautifulSoup processing if you need to analyze the html you got.
I hope I can help you.

Menu