How does a java crawler get body content in with (document) with (body)?

request a link with http to get the following content

<!DOCTYPE html>
<html>
<head>
  <meta charset="utf-8">
  <meta http-equiv="X-UA-Compatible" content="IE=edge">
  <meta http-equiv="Cache-Control" content="no-siteapp" />
  <meta name="data-spm" content="a21ag"/>
  <meta name="renderer" content="webkit">
  <meta name="description" content="">
  <meta name="viewport" content="width=device-width, initial-scale=1">
  <title></title>

    
  
  
      <link rel="stylesheet" href=//g.alicdn.com/dt/op-ebase/3.4.41/op-ebase.css />
    <link rel="stylesheet" href=//g.alicdn.com/dt/op-eportal/1.12.20/login.css />
    <!--[if lte IE 9]>
  <script>if (!/update\.htm/.test(location.href)) window.location = "//sycm.taobao.com/portal/update.htm"; </script>
  <![endif]-->
</head>
<body class="esycm-wrapper" data-spm="8365346"><script>
with(document)with(body)with(insertBefore(createElement("script"),firstChild))setAttribute("exparams","category=&userid=&aplus&yunid=&&asid=AQAAAADhb8xakfQWKgAAAABSM9G79i330w==",id="tb-beacon-aplus",src=(location>"https"?"//g":"//g")+".alicdn.com/alilog/mlog/aplus_v2.js")
</script>
  <div id="app"></div>
      <script type="text/javascript" src=//g.alicdn.com/dt/op-ebase/3.4.41/op-ebase.js crossorigin></script>
    <script type="text/javascript" src=//g.alicdn.com/dt/op-eportal/1.12.20/login.js crossorigin></script>
  </body>
</html>

body doesn"t show up, such as getting?

Mar.04,2021

you need to use a headless browser in your situation. It is equivalent to having to wait for the execution of js before you parse the content.
if it is written by node, it may be more convenient, and it is more convenient to use PhantomJS .

however, HtmlUnit in java environment is slightly less compatible, so you can try

.
Menu