How does a crawler determine when to stop?

would like to ask, write a crawler, how to tell when the crawler should stop?
the initial state is a url; and then there is a

while(isNotEmpty(urlList)){
    // do something
}

my idea is this, but the speed of queuing url can not keep up with the speed of consumption, so that the urlList is empty, and then the crawler stops. I would like to ask which Daniel has written the framework of the crawler and what are the conditions under which the crawler stops running.

Web crawler

Apr.02,2021

the idea is a little strange, urlList links are also put in, put a climb on the line. The crawler stops when you don't put a link to urlList.

depending on the specific circumstances to be crawled:

1:
2:Kafkatopic

how the crawler stops depends on your own business.
as long as the reptile is done to remove the weight.
if the crawler is controllable, replace multithreading with a single process. By killing the process. Control the crawler.
screen deploy the crawler project.

Previous: How do I delete data at a specific location in an array in the Models of Mongoose?

Next: Es6 calculation Properties

Python3 creates a new BeautifulSoup object in a child thread for a specific web page, but there is no exception in the encoding error, main thread.
as in the title, write a simple function test to generate a soup object from the URL using Python requests and BeautifulSoup, (see the example below). If you call this function directly in the main thread, everything will be fine, but if you call this f...

Web-crawler multithreaded beautifulsoup python

Feb.26,2021
My websocket can only accept data on newly opened pages.
segmentfault dvaws.open().websocketwebsocketws ws.openfetchGETloginsendfetchCOOKIECOOKIEFETCHCOOKIECOOKIE ...

Websocket react.js

Feb.26,2021
Why did spring fail to automatically match fields?
Why did spring fail to automatically match fields? Wechat s front end reads as follows: Java: ...

Webapi spring-mvc spring java

Feb.26,2021
Conflicts between eslint and webstrom formatting
there are some conflicts between eslint rules and webstrom shortcut key formatting for example: <Route path=" list" component={TopicList} >, Under the airbnb specification, there must be a space in front of the of the self-closin...

Webstorm eslint

Feb.26,2021
Using HtmlWebpackPlugin to dynamically inject compiled files, output settings are useless
study online tutorials learn to package projects using webpack, a simple code implementation. the entry file index.html of the interface is as follows: <!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8&qu...

Javascript webpack

Feb.26,2021
Webpack package react, error its MIME type ('text/html') is not executable
< H1 > github address < H1 > error will be reported when using npm run dev const path = require( path ); const merge = require( webpack-merge ); const common = require( . webpack.common.js ); dev-server let BundleAnalyzerPlugin = require( ...

React.js webpack

Feb.26,2021
How to use the mobile SPA developed by vue2.0 to call the navigation and calling function of the mobile phone?
want to use vue2.0 to develop a small web application for the company, running on the mobile side. How can the above functions be achieved? ...

Mobile-web vue.js front-end html5 javascript

Feb.26,2021
How to convert long data to ArrayBuffer 64-bit binary data?
the front and back end communicates with websocket. How does the front end convert long data into ArrayBuffer 64-bit binary data ...

Websocket

Feb.27,2021
React build Times wrong: syntax error: Unexpected token (26:6)
the npm start command runs normally, but the error will be reported when executing with npm run build as follows: syntax error: Unexpected token (Unexpected token (26:6) my index.js: import React, { Component } from react ; import ReactDOM from...

Webpack react.js javascript

Feb.27,2021
Global URL,dev.env.js in vue-cli
how to set the global URL here, you can use both in the development environment and online. ...

Html5 webpack

Feb.27,2021
Java: the front and rear ends are separated and the back-end (tomcat), is run with IDEA. How are the front-end pages deployed during the development phase?
for testing and production deployment, just throw the front-end page to nginx but at the time of development, how do you deploy the front-end page? ...

Web intellij-idea tomcat java

Feb.27,2021
Onload fails completely in safari
$(document).ready(function(){ $(".pageloading").show(); $(window).on("load", function() { $(".pageloading").fadeOut(); }); }); google chrome can, will $( ".pageloading "). FadeOut (); but the safari networ...

Safari-display-web-page-is-not-normal html5 css3 safari jquery

Feb.27,2021
Projects built with vue-cli always report errors when introducing static third-party plug-ins, and the path should be correct. Index.js has not changed.
...

Vue.js webpack vue-cli

Feb.27,2021
The problem of weblogic receiving message parsing message?
Today, the application receives the message in the production environment and reports an error in the parser, and then the weblogic console cannot log in, but the service of the background application is still there and can still receive the message. Th...

Weblogic message

Feb.27,2021
After vue-router lazily loads the configuration, the page becomes a blank page for solution and guidance.
after vue-router lazy loading configuration, the page becomes blank this problem does not occur when lazy loading is not configured, when configuring lazy loading *const product = resolve => { require.ensure([ . pages product.vue ], () =>...

Webpack vue-router

Feb.27,2021
How to keep the document stream from moving when using transform:scale
as shown in the figure above, use css to make a magnification effect when a mouse passes over a square, but as soon as you enlarge a square, the square next to it will be squeezed out or squeezed into the next row. Is there any way to fix the enlarge...

Html css css3 web front-end

Feb.27,2021
Require fs module in the project, using webpack, but always reporting errors?
require the fs module in the front-end project, but keep reporting errors? fs.readFileSync is not a function introduce code var fs = require( fs ); fs.readFileSync( index.html , utf8 ); this is the webpack configuration file const path = re...

Angularjs webpack

Feb.27,2021
How do I use webpack-dev-server proxyTable.bypass?
there is a bypass Filter in proxyTable. How do I use it? Is it the return path? Or return true false ...

Webpack front-end

Feb.27,2021
Report to babel of undefined after webpack4.1 configuration is completed
webpack project upgrade version configuration is as follows: webpack.json "scripts": { "dev": "webpack --mode development", "build": "webpack --mode production", "watch": &quo...

Node.js webpack javascript

Feb.27,2021
Webpack4 can't use dynamic import?
according to the demo in ...

Webpack

Feb.27,2021

MySQL Query : SELECT * FROM `codeshelper`.`v9_news` WHERE status=99 AND catid='6' ORDER BY rand() LIMIT 5
MySQL Error : Disk full (/tmp/#sql-temptable-64f5-327cfee-c1f5.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
MySQL Errno : 1021
Message : Disk full (/tmp/#sql-temptable-64f5-327cfee-c1f5.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
Need Help?