drsxzut183207938 2012-04-29 16:59
浏览 44
已采纳

在div标签之间提取文本 - Simple Html Dom Parser [关闭]

Code :

$html = file_get_html('http://url.com');
$ret = $html->find('div[samplediv]');
echo $ret;

The output I get is just Array. that means it is empty. I am sure the div is preset on the page I am scraping.

Also, another thing I am trying to achieve is, take the text from the html. when I simply convert it to plaintext, it results in lot of unwanted numbers and stuff. So what I am trying to do is, get the text that I see in the browser. (Instead of getting the whole text from the html).

All suggestions are welcome.

  • 写回答

1条回答 默认 最新

  • dtll2016 2012-04-29 17:04
    关注

    Looks like you're outputting the whole document. Try

    echo $ret->innertext;
    

    to just output the contents of the div.

    PS: I just looked this up at on google and found http://simplehtmldom.sourceforge.net/manual.htm

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 mmocr的训练错误,结果全为0
  • ¥15 python的qt5界面
  • ¥15 无线电能传输系统MATLAB仿真问题
  • ¥50 如何用脚本实现输入法的热键设置
  • ¥20 我想使用一些网络协议或者部分协议也行,主要想实现类似于traceroute的一定步长内的路由拓扑功能
  • ¥30 深度学习,前后端连接
  • ¥15 孟德尔随机化结果不一致
  • ¥15 apm2.8飞控罗盘bad health,加速度计校准失败
  • ¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
  • ¥15 谁有desed数据集呀