douhe4336 2014-12-24 19:10
浏览 168
已采纳

从HTML文本中解析并提取数据

I have the below HTML text stored in a variable $domText

<TR class="tableclass">
  <TD>Veteran Job Information</TD>
  <TD>9.00</TD>
  <TD>1.2</TD>
  <TD><INPUT type = "text" name = "notes"></TD>
</TR>

I want to check if the Text content in the First (Here "Veteran Job...") is equal to "Benefit Job" then need to store the values in second and third (Here 9.00 and 1.2) tags to a PHP Variable.

Below is what I have tried to do, But I am getting errors and my code is not working at all. "Invalid argument supplied for foreach()"

        $dom_ChangeResults = new DOMDocument();
        $dom_ChangeResults->loadHTML($domText); //Load the current changes as HTML String
        $dom_TableTags = $dom_ChangeResults->getElementsByTagName("TR"); //Check table data tags for Full time to PartTime Change
        $rows = $dom_TableTags->item(0)->getElementsByTagName('TD');

        /*** loop over the table rows ***/
        foreach ($rows as $row)
        {
            /*** get each column by tag name ***/
            $cols = $row->getElementsByTagName('td');
            /*** echo the values ***/
            echo $cols->item(0)->nodeValue.'<br />';
            echo $cols->item(1)->nodeValue.'<br />';
            echo $cols->item(2)->nodeValue;
            echo '<hr />';
        }
  • 写回答

2条回答 默认 最新

  • duanliaozhi2915 2014-12-24 19:28
    关注

    Don't use capitalized tags with PHP DOMElement. Don't know if this is all code that You gave us, but capitalization in PHP script is main issue, getElementsByTagName('TD') will return empty list, where getElementsByTagName('td') will return populated list.

        $dom_TableTags = $dom_ChangeResults->getElementsByTagName("TR"); //Check table data tags for Full time to PartTime Change
        $rows = $dom_TableTags->item(0)->getElementsByTagName('TD');
    

    shouldn't this be (or You have just one row?):

        $dom_TableTags = $dom_ChangeResults->getElementsByTagName("table"); //Check table data tags for Full time to PartTime Change
        $rows = $dom_TableTags->item(0)->getElementsByTagName('tr');
    

    well here is working sample code

    $domText = <<<DOM
            <TABLE>
            <TR class="tableclass">
              <TD>Veteran Job Information</TD>
              <TD>9.00</TD>
              <TD>1.2</TD>
              <TD><INPUT type = "text" name = "notes"></TD>
            </TR>
            <TR class="tableclass">
              <TD>Veteran Job Information</TD>
              <TD>9.00</TD>
              <TD>1.2</TD>
              <TD><INPUT type = "text" name = "notes"></TD>
            </TR>
            <TR class="tableclass">
              <TD>Veteran Job Information</TD>
              <TD>9.00</TD>
              <TD>1.2</TD>
              <TD><INPUT type = "text" name = "notes"></TD>
            </TR>
            </TABLE>
    DOM;
    
        $dom_ChangeResults = new DOMDocument();
        $dom_ChangeResults->loadHTML($domText); //Load the current changes as HTML String
        $dom_TableTags = $dom_ChangeResults->getElementsByTagName("table"); //Check table data tags for Full time to PartTime Change
        $rows = $dom_TableTags->item(0)->getElementsByTagName('tr');
    
        /*** loop over the table rows ***/
        foreach ($rows as $row)
        {
                /*** get each column by tag name ***/
                $cols = $row->getElementsByTagName('td');
                /*** echo the values ***/
                echo $cols->item(0)->nodeValue.'<br />';
                echo $cols->item(1)->nodeValue.'<br />';
                echo $cols->item(2)->nodeValue;
                echo '<hr />';
        }
    

    EDIT

    To work with data with one <tr> element:

    $dom_ChangeResults = new DOMDocument();
    $dom_ChangeResults->loadHTML($domText); //Load the current changes as HTML String
    //$dom_TableTags = $dom_ChangeResults->getElementsByTagName("tr"); //Check table data tags for Full time to PartTime Change
    $rows = $dom_ChangeResults->getElementsByTagName('tr');
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 R语言Rstudio突然无法启动
  • ¥15 关于#matlab#的问题:提取2个图像的变量作为另外一个图像像元的移动量,计算新的位置创建新的图像并提取第二个图像的变量到新的图像
  • ¥15 改算法,照着压缩包里边,参考其他代码封装的格式 写到main函数里
  • ¥15 用windows做服务的同志有吗
  • ¥60 求一个简单的网页(标签-安全|关键词-上传)
  • ¥35 lstm时间序列共享单车预测,loss值优化,参数优化算法
  • ¥15 Python中的request,如何使用ssr节点,通过代理requests网页。本人在泰国,需要用大陆ip才能玩网页游戏,合法合规。
  • ¥100 为什么这个恒流源电路不能恒流?
  • ¥15 有偿求跨组件数据流路径图
  • ¥15 写一个方法checkPerson,入参实体类Person,出参布尔值