简单的html dom解析器表到数组（扩展）

There is this website

http://www.oxybet.com/france-vs-iceland/e/5209778/

What I want is to scrape not the full table but PARTS of this table.

For example to only display rows that include sportingbet stoiximan and mybet and I don't need all columns only 1 x 2 columns, also the numbers that are with red must be scraped as is with the red box or just display an asterisk next to them in the scrape can this be done or do I need to scrape the whole table on a database first then query the database?

What I got now is this code I borrowed from another similar question on this forum which is:

<?php

require('simple_html_dom.php');


$html = file_get_html('http://www.oxybet.com/france-vs-iceland/e/5209778/');

$table = $html->find('table', 0);
$rowData = array();


foreach($table->find('tr') as $row) {
// initialize array to store the cell data from each row
$flight = array();

foreach($row->find('td') as $cell) {
    // push the cell's text to the array

    $flight[] = $cell->plaintext;
}
$rowData[] = $flight;
}

echo '<table>';
foreach ($rowData as $row => $tr) {
echo '<tr>'; 
foreach ($tr as $td)
    echo '<td>' . $td .'</td>';
echo '</tr>';
}
echo '</table>';

?>

which returns the full table. What I want mainly is somehow to detect the numbers selected in the red box (in 1 x 2 areas) and display an asterisk next to them in my scrape, secondly I want to know if its possible to scrape specific columns and rows and not everything do i need to use xpath?

I beg for someone to point me in the right direction I spent hours on this, the manual doesn't explain much http://simplehtmldom.sourceforge.net/manual.htm

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doumen6605 2016-10-09 12:54
关注
Link is dead. However, you can do this with xPath and reference the cells that you want by their colour and order, and many more ways too.

This snippet will give you the general gist; taken from a project I'm working on atm:

function __construct($URL) { // make new DOM for nodes $this->dom = new DOMDocument(); // set error level libxml_use_internal_errors(true); // Grab and set HTML Source $this->HTMLSource = file_get_contents($URL); // Load HTML into the dom $this->dom->loadHTML($this->HTMLSource); // Make xPath queryable $this->xpath = new DOMXPath($this->dom); } function xPathQuery($query){ return $this->xpath->query($query); }

Then simply pass a query to your DOMXPath, like //tr[1]
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

具有多个表的PHP简单HTML DOM解析器 html json php
2018-01-21 23:51

回答 1 已采纳 Found the answer to my question with help from user sms who commented above. This php pulls the da
PHP简单的HTML DOM解析器“字符问题 html php
2015-09-08 08:45

回答 1 已采纳 If i escape the characters, i lose them. But you can use addslashes() method for removing them. H
Xpath循环问题，用于将简单的HTML表解析为php数组 html php
2019-02-27 07:51

回答 1 已采纳 $strhtml=' <table id="Details" class="DATA_TABLE DATA_TABLE_WO_TOTAL"> <tr> <
simple php dom清空,php html解析器Simple HTML Dom使用说明
2021-03-25 10:04

luckyton的博客本文章来给大家介绍一下关于Simple HTML Dom解析器的使用方法详解，有需要了解的同学不防进入参考。1. 开始使用首先下载解压缩，然后将simple_html_dom.php文件包含进要编写的脚本文件中，加载要处理的html，支持三...
PHP简单的HTML DOM解析器：保存Dom树 html javascript jquery php
2013-12-27 11:55

回答 1 已采纳 I'd still use ->outertext, but simply save the content to an array, and then you can use file_p
PHP简单HTML DOM解析器在有效URL上返回false html5 php
2017-04-22 17:00

回答 4 已采纳 It looks like HTML DOM parser is failing because the HTML file size is greater than the library's
简单的html dom过滤器表单获取名称和值为php数组 html php
2016-10-15 10:22

回答 1 已采纳 this should do the trick: <?php include_once('simple_html_dom.php'); $url = '<!DOCTYPE ht
php网页解析器,适用于PHP的parse and process的HTML解析器
2021-03-26 14:37

可妈聊育儿的博客 DOMDOM扩展使您可以使用PHP 5通过DOM API通过XML文档进行操作。它是W3C的Document Object Model Core Level3的实现，它是一种平台和语言无关的界面，允许程序和脚本动态访问和更新。文件的内容，结构和样式。DOM能够...
PHP - 通过DOM解析html表 php
2013-05-05 10:50

回答 2 已采纳 There you go (you have to play with the attributes a bit to get your desire output): In this solut
PHP简单的HTML DOM解析器 html php
2014-09-05 08:02

回答 6 已采纳 In this case you can directly point it out with children() method. Example: foreach($html->fin
使用PHP简单的HTML DOM解析器代理 html php
2014-07-21 22:14

回答 2 已采纳 I only changed some parts, but clearly, the proxy example you provided it isn't working. Try this
php抓取dom处理后数据,写爬虫时PHP解析HTML最高效的方法那就是用DomCrawler!
2021-03-26 10:03

学徒MJ的博客需求来源,需要用PHP解析HTML提取我想要的数据用PHP写网站爬虫的时候,需要把爬取的网页进行解析,提取里面想要的数据,这个过程叫做网页HTML中数据结构化。很多人应该知道用phpQuery像JQuery一样的语法进行网页处理,...
PHP DOM解析HTML表 php
2011-11-13 23:46

回答 1 已采纳 Use DOMNodelist->item() (item() expects as argument the index, it's zero-based so 1 will return
php xmldom扩展,PHP_用PHP读取和编写XML DOM的实现代码，用 PHP 读取和编写可扩展标记 - phpStudy...
2021-04-23 05:57

马向文的博客用PHP读取和编写XML DOM的实现代码用 PHP 读取和编写可扩展标记语言(XML)看起来可能有点恐怖。实际上，XML 和它的所有相关技术可能是恐怖的，但是用 PHP 读取和编写 XML 不一定是项恐怖的任务。首先，需要学习一点...
php dom扩展 linux,Linux_Prototype框架是怎样扩展DOM的，Prototype框架最大的部分就是对D - phpStudy...
2021-05-06 06:39

拯救大兵张嘎的博客 Prototype框架是怎样扩展DOM的Prototype框架最大的部分就是对DOM的扩展。Prototype框架里的$()函数返回一个网页DOM元素，框架给这个元素添加了很多方便的方法。举个例子：你可以写这样的代码 $('comments')....
没有解决我的问题, 去提问

悬赏问题

¥15 基于卷积神经网络的声纹识别
¥15 Python中的request，如何使用ssr节点，通过代理requests网页。本人在泰国，需要用大陆ip才能玩网页游戏，合法合规。
¥100 为什么这个恒流源电路不能恒流？
¥15 有偿求跨组件数据流路径图
¥15 写一个方法checkPerson，入参实体类Person，出参布尔值
¥15 我想咨询一下路面纹理三维点云数据处理的一些问题，上传的坐标文件里是怎么对无序点进行编号的，以及xy坐标在处理的时候是进行整体模型分片处理的吗
¥15 CSAPPattacklab
¥15 一直显示正在等待HID—ISP
¥15 Python turtle 画图
¥15 stm32开发clion时遇到的编译问题

简单的html dom解析器表到数组（扩展）

1条回答 默认 最新

悬赏问题

1条回答默认最新