创建DOMDocument：匹配PHP解析器中的某个元素

good evening dear Community,

Well first of all: felize Navidad - I wanna wish you a Merry Christmas!! In my season-break i am workin on a little parser-script.

Today i'm trying to debug a little DOMDocument object in php. Ideally it'd be nice if I could get DOMDocument to output in a array-like format, to store the data in a database!

My example: head over to the url - see the example: the target

I want to filter out the data in the block:

Schulart: BBS
Schulnummer:60119
Anschrift: Berufsbildende Schule Boppard Antoniusstr. 21; 56154 Boppard
Telefon: (0 67 42) 80 61-0
Telefax: (0 67 42) 80 61-29
E-Mail: sekretary@bbs-boppard.de
Internet: website 
Träger:Kreisverwaltung Rhein-Hunsr&#65533;ck-Kreis
letzte Änderung: 08 Feb 2010 14:33:12 von 60119

I have investigated the sourcecode - and found out that the attribute of interest should be this one: class="content"div class="content"> or even better: wfqbeResults

So if i run the DOMDucument way i can use this like so:

$dom->getElementById('wfqbeResults');

here the code is: - my trails

<?php

$dom = new DOMDocument();
@$dom->loadHTMLFile(' -> here the website goes in<- ');
$divElement = $dom->getElementById('wfqbeResults');

$innerHTML= '';
$children = $divElement->childNodes;
foreach ($children as $child) {
   $innerHTML .= $child->ownerDocument->saveXML( $child );
} 
echo $innerHTML;

<?

Duhh: this outputs lot of garbage. The code spits out a lot of html anyway. I have to overhaul the code a bit to get the wanted 9 lines out of the parser:

what is aimed: i want to get out the following:

a. 9 lines with nine labels and nine values. b. I want to prepare the output to store it in a MySQL-DB!

Look forward to some hints greetings zero

展开全部

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

duan19750503 2010-12-24 21:34

关注

Here is the solution return the labels and values in a formatted array ready for input to mysql!

<?php

$dom = new DOMDocument();
@$dom->loadHTMLFile('http://schulen.bildung-rp.de/gehezu/startseite/einzelanzeige.html?tx_wfqbe_pi1%5buid%5d=60119');
$divElement = $dom->getElementById('wfqbeResults');

$innerHTML= '';
$children = $divElement->childNodes;
foreach ($children as $child) {
$innerHTML = $child->ownerDocument->saveXML( $child );

$doc = new DOMDocument();
$doc->loadHTML($innerHTML);
//$divElementNew = $dom->getElementsByTagName('td');
$divElementNew = $dom->getElementsByTagname('td');

    /*** the array to return ***/
    $out = array();
    foreach ($divElementNew as $item)
    {
        /*** add node value to the out array ***/
        $out[] = $item->nodeValue;
    }

echo '<pre>';
print_r($out);
echo '</pre>';

} 

?>

展开全部

本回答被题主选为最佳回答 , 对您是否有帮助呢?

编辑

预览

报告相同问题？

关注问题

php 使用xpath_在PHP中使用XPath
2020-06-20 13:06

cuyi7076的博客 CRUD：创建，读取，更新和删除 CSS：级联样式表 DOM：文档对象模型 JSON：JavaScript对象表示法 RDF：资源描述框架 REST：代表性状态转移 RSS：真正简单的联合 SKU：库存单位 URI：统一资源标识符 ...
php信息采集源码
2018-03-29 08:57

- PHP信息采集的核心在于解析HTML文档，通常会使用如DOMDocument或SimpleXMLElement类来解析HTML结构。 - 通过XPath或CSS选择器，可以定位到目标元素。XPath是一种在XML文档中查找信息的语言，而CSS选择器则更易于...
一个php站点采集工具
2017-12-03 13:00

2. **DOM解析**：PHP的DOMDocument和DOMXPath库用于解析HTML文档，通过XPath表达式定位到目标元素，提取数据。DOM是文档对象模型（Document Object Model）的缩写，它将HTML或XML文档转换为结构化的节点树，方便程序...
大数据可视化vue,node,npm笔记
2024-04-04 16:46

晨光—全栈学习者的博客 大数据可视化 vue3+echarts5 1、前端的跨域请求拿到数据，详见第8篇 2、javascript或其他办法处理接口返回的json数据，计算所需要的结果并且格式化为echarts图表需要的数据结构 post请求后端接口地址返回的json数据...
【burpsuite安全练兵场-客户端15】基于DOM的漏洞-7个实验（全）
2024-06-19 09:21

baimao__沧海的博客目录一、基于DOM的漏洞1、DOM2、污染流漏洞3、共同来源4、会导致基于DOM漏洞的汇点5、防止基于DOM的污染流漏洞二、反射型DOM、存储型DOM三、控制Web消息源1、简述：2、影响：3、使用Web消息作为攻击源构建攻击实验1...
PHP采集程序大全菜鸟必看包含思路小偷程序
2009-08-01 06:27

- **DOMDocument**：PHP提供的DOM解析器，用于加载HTML或XML文档并进行操作。 - **XPath**：基于路径表达式的查询语言，用于在XML文档中定位节点。 - **CSS选择器**：借鉴CSS的语法，方便地选取HTML元素，如`div....
IT技术分类.docx
2023-03-11 14:02

XML用于结构化数据存储和交换，DTD、XML DOM、XSLT、XPath等相关技术用于XML的解析和转换。 9. 其他技术 ASP、AppML、VBScript、Servlet、JSP、Lua、Scala等是其他编程语言和技术，正则表达式用于字符串匹配和处理...
PHP面试题大全
2019-12-20 01:54

Rudon滨海渔村的博客系统限制，只显示了2902行，请下载完整版： ...回答：PHP全称：Hypertext Preprocessor，是一种用来开发动态网站的服务器脚本语言。问题：什么是MVC？回答：MVC由Model（模型）, View（视图...
PHP学习笔记
2019-06-25 01:10

AOP_LIU的博客 - 变量名、常量名、元素下标：区分大小写 /* 可变标识符 */ 可变变量 $i = 3; $k = 'i'; echo $$k; //输出3 可变函数 function func() {echo 'hello!';} $i = 'func'; $i(); //输出hello 可变下标 $i = '...
php 基础代码大全（不断完善中）
2018-07-15 02:42

weixin_30764883的博客变量名、常量名、元素下标：区分大小写 /* 可变标识符 */ 可变变量 $i = 3; $k = 'i'; echo $ $k ; // 输出3 可变函数 function func() { echo 'hello!';} $i = 'func'; $i (); // 输出hello ...
没有解决我的问题, 去提问

码龄粉丝数原力等级 --

创建DOMDocument：匹配PHP解析器中的某个元素

1条回答默认最新

码龄粉丝数原力等级 --

创建DOMDocument：匹配PHP解析器中的某个元素

1条回答 默认 最新

1条回答默认最新