douyanguo7964 2010-08-26 17:12
浏览 30
已采纳

需要正则表达式来解析HTML标签[重复]

This question already has an answer here:

Regular expressions are not my forte and could really do with assistance on matching and replacing the following:

In a HTML file I have many instances of HTML like this:

<font class=font8>text text text</font>

The font tag can have different content in either single word or multiple word with spaces and maybe numbers.

I need to find all instances of this and replace with:

<span class="bold">(text that was there)</span>

Thanks James

PS: the HTML was generated from word that is why it is so bad :o)

</div>
  • 写回答

1条回答 默认 最新

  • douzhang2680 2010-08-26 17:17
    关注

    Use getElementsByTagName('font') and the DOMDocument::loadHTML method and iterate through the nodelist based in the ->length, then createElement('span') and setAttribute for the class name value of bold, do a replaceChild to replace it.

    Reference for DOM: http://php.net/manual/en/book.dom.php

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 关于#matlab#的问题:期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707,使系统具有较小的超调量
  • ¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
  • ¥30 截图中的mathematics程序转换成matlab
  • ¥15 动力学代码报错,维度不匹配
  • ¥15 Power query添加列问题
  • ¥50 Kubernetes&Fission&Eleasticsearch
  • ¥15 報錯:Person is not mapped,如何解決?
  • ¥15 c++头文件不能识别CDialog
  • ¥15 Excel发现不可读取的内容
  • ¥15 关于#stm32#的问题:CANOpen的PDO同步传输问题