How to use beautiful soup to crawl the movie name and link in the source code of the following web page

use python3 bs4 to climb the latest movie of movie paradise http://www.dytt8.net/
, but crawl out is web data, very messy, you can use soup.findAll to directly find the link tag to extract import urllib.request
from bs4 import BeautifulSoup
html = urllib.request.urlopen (" http://www.dytt8.net/")
bsObj = BeautifulSoup (html,"html.parser")
a = bsObj.findAll ("div", {" class":"co_content8"})
list1 = []
for i in a:

j = i.findAll("a")
print(type(j))
print("-sharp-sharp-sharp")
print(list1.append(str(j)))

print (" list1 is:", list1)
print (type (list1)
print (len (list1)
for n in list1:

print(n.split(","))

part of the source code of the web page is as follows:

Windows python

Mar.11,2021

CSS selection or xpath, pyjquery
traditional rules are recommended. Findall is also acceptable, but the effect is not good.

first locate the ul, under div, then use findall to extract each
under ul, and then extract the href attribute of each
.
you can take a look at the article , which is about the use of BeautifulySoup. I hope it will be helpful
.
try whether the code is feasible

<blockquote>soup.findAll<a herf=<br><strong> hrefhtml</strong> </blockquote> -sharp -*- coding: utf-8 -*- import urllib.request,re from bs4 import BeautifulSoup html = urllib.request.urlopen('http://www.dytt8.net/') bsObj = BeautifulSoup(html,'html.parser') bsObj1 = bsObj.find_all('a',href=re.compile('/html')) for i in bsObj1: print (i['href'],i.string)
Previous: In JavaScript, why can non-array objects also use Obj ["Index"] to access properties? Next: Why does the program written in the python code show no response? Python turns off the monitor instead of hibernating, and turns on the monitor with mouse movement turn off the monitor via python I have found the relevant code before can be turned off, but after closing, a little movement of the mouse will turn on the monitor. I think a small movement of the mouse will not trigger the operation of turning on t... Windows python Mar.10,2021 I can't watch the cousera video and I can't open the lectrue slide. Is there a solution? can t watch cousera video and can t open lectrue slide. Is there a solution? ... Windows python Mar.18,2021 There is no problem with running celery directly, but using monitoring software to start celery cannot run successfully. What is the possible reason? there is a graphics class celery task that must be run under windows. typing celery directly on the command line can run successfully as a whole and get the correct results. while using nssm to package celery as a service to start, although the task c... Windows python supervisor celery Mar.21,2021 Mistakes in doing after-class questions: the result I expect is that the program is running normally and there are three opportunities to guess numbers. topic description mistakes in doing after-class questions for small turtles sources of topics and their own ideas the source is the fifth post-class question for the little soft-shelled turtle. my idea is to first define a variable, then assign a... Windows python Mar.29,2021 What's wrong with the invalidity of the python virtual environment? the virtual environment is created by pycharm. After entering venv script activate.bat on the command line, the display enters the virtual environment, and (venv) appears. However, typing pip-V shows c: users yglin appdata local program... Windows python Oct.15,2021 How does pyqt5 open files by default? for example, after double-clicking the HTML file in the gui interface, open the file with the default browser pyqt5. Is there such a function or method? ... Windows python Oct.17,2021 How does Python quickly detect the validity of URL (50W +) and resolve the IP address area? URL is stored in the text (CSV). You need to test the validity of the URL, parse the IP address and its corresponding physical location, and then append the result to the line of the URL . sample data 1,www.qq.com, 2,www.baidu.com, . . . expected res... Windows python Feb.06,2022 The python crawler uses urllib.request.urlretrieve () to save the picture locally, and the picture has no content. I use Jupyter Notebook to crawl the url, of the picture I need to save, but use urllib.request.urlretrieve () to save the picture locally, the picture has no content, and the url is opened with no content (the same as I saved locally), but open with goo... Windows python Mar.06,2022 Zip_command = "rar a% s% s"% (target,' '.join (source) in the python backup script the whole script can be run, but there is something I don t understand Zip_command = "rar a% s% s "% (target, .join (source) is not understood in the script. I hope someone who understands it can explain the principle. -sharp! usr bin python -s... Windows python Mar.25,2022 [question post] about the serial port module serial of python, I'd like to ask all the bosses. ser.write (chr (0x80). Encode ())-sharp sends data to the serial port when I have sent this command to the serial debugging assistant, the hexadecimal I received is C280, which means that whenever the data I send is greater than 0X80, there will be more... Windows python May.23,2022 There is always a warning when calling chatterbot to talk to bot in it. when I call chatterbot and have a conversation with a trained bot, there is always a "No value for search_text was available on the provided input " sentence between the conversations, just like the one in the picture: : "No value for search_text w... Windows python Jun.22,2022

css mysql arrays josn react html typescript webpack npm sass R objective-c .net sql-server jquery python-3.x angularjs django angular excel regex iphone ajax linux xml pandas vba spring database wordpress string wpf xcode windows bash postgresql oracle multithreading eclipse list firebase algorithm macos forms image scala visual-studio azure bootstrap spring-boot react-native python-2.7 docker performance function winforms matlab powershell apache dataframe api sqlite numpy rest shell selenium flutter dart maven loops qt swing android-studio csv express file class tensorflow sorting codeigniter perl MySQL Query : SELECT * FROM `codeshelper`.`v9_news` WHERE status=99 AND catid='6' ORDER BY rand() LIMIT 5 MySQL Error : Disk full (/tmp/#sql-temptable-64f5-37f9629-6619.MAI); waiting for someone to free some space... (errno: 28 "No space left on device") MySQL Errno : 1021 Message : Disk full (/tmp/#sql-temptable-64f5-37f9629-6619.MAI); waiting for someone to free some space... (errno: 28 "No space left on device") Need Help?