首页 > 开发 > PHP > 正文

php基于Snoopy解析网页html的方法

2024-05-04 22:35:58
字体:
来源:转载
供稿:网友

本文实例讲述了php基于Snoopy解析网页html的方法。分享给大家供大家参考。具体实现方法如下:

set_time_limit(0);$user = array('20517', '20518');header("content-Type: text/html; charset=utf-8");require_once './Snoopy.php';$snoopy = new Snoopy();//$uri = 'http://www.juzimi.com/meitumeiju';$uri = 'http://www.juzimi.com/meitumeiju?page=1';$snoopy->fetch($uri);$con = $snoopy->getResults();$doc = new DOMDocument();$doc->loadHTML($con);/* make a result array ... */$result = [];/* go through all nodes which have class="baby" ... */$i = 0;foreach( ( new DOMXPath( $doc ) )->query( '//*[@id="bdshare"]' ) as $element ) {  $row = $doc->saveHTML($element);  $start = stripos($row, '{');  $end = stripos($row, '}');  $len = $end - $start + 1;  $json = substr($row, $start, $len);  $json = str_replace("'", '"', $json);  $obj = json_decode($json);  $pic = $obj->pic;  $desc = $obj->desc;  $filename = '/Uploads/meiju/card' . date('ymdhis') . $i . '.png';  $file = '/alidata/www/test' . $filename;  $image = file_get_contents($pic);  file_put_contents($file, $image);}

希望本文所述对大家的php程序设计有所帮助。

发表评论 共有条评论
用户名: 密码:
验证码: 匿名发表