时间:2021-07-01 10:21:17 帮助过:34人阅读
本文实例讲述了php基于Snoopy解析网页html的方法,具体实现方法如下:
set_time_limit(0); $user = array('20517', '20518'); header("content-Type: text/html; charset=utf-8"); require_once './Snoopy.php'; $snoopy = new Snoopy(); //$uri = 'http://www.juzimi.com/meitumeiju'; $uri = 'http://www.juzimi.com/meitumeiju?page=1'; $snoopy->fetch($uri); $con = $snoopy->getResults(); $doc = new DOMDocument(); $doc->loadHTML($con); /* make a result array ... */ $result = []; /* go through all nodes which have class="baby" ... */ $i = 0; foreach( ( new DOMXPath( $doc ) )->query( '//*[@id="bdshare"]' ) as $element ) { $row = $doc->saveHTML($element); $start = stripos($row, '{'); $end = stripos($row, '}'); $len = $end - $start + 1; $json = substr($row, $start, $len); $json = str_replace("'", '"', $json); $obj = json_decode($json); $pic = $obj->pic; $desc = $obj->desc; $filename = '/Uploads/meiju/card' . date('ymdhis') . $i . '.png'; $file = '/alidata/www/test' . $filename; $image = file_get_contents($pic); file_put_contents($file, $image); }
总结:以上就是本篇文的全部内容,希望能对大家的学习有所帮助。
相关推荐:
PHP+Mysql+jQuery统计当前在线用户数
PHP+Mysql+jQuery实现的查询和列表框选择
php实现数字格式化,数字每三位加逗号的功能函数
以上就是使用Snoopy类解析html文件的方法的详细内容,更多请关注Gxl网其它相关文章!