在HTML中搜索和替换单词

what I'm trying to do is make a 'jargon buster'. Basically I have some html and some glossary terms in a database. When the person clicks on jargon buster it replaces the words in the text with a nice tooltip (wztooltip) which shows them the meanings.

I've been trying hard on this one and been looking heavily at this question Regex / DOMDocument - match and replace text not in a link

and it seems like the answer lies in the simple_html_dom libs but I'm having trouble getting it to work. Obviously any words already linked don't get touched. Here is a strip down of what I've got.

$html = str_get_html($article['content']);

$query_glossary = "SELECT word,glossary_term_id,info FROM glossary_terms WHERE status = 1  ORDER BY LENGTH(word) DESC";
$result_glossary = mysql_query_run($query_glossary);

while($glossary = mysql_fetch_array($result_glossary)) {
    $glossary_link = SITEURL.'/glossary/term/'.string_to_url($glossary['word']).'-'.$glossary['glossary_term_id'];
    if(strlen($glossary['info'])>400) {
        $glossary_info = substr(strip_tags($glossary['info']),0,350).' ...<br /> <a href="'.$glossary_link.'">Read More</a>';
    }
    else {
        $glossary_info = $glossary['info'];
    }
    $glossary_tip = 'href="javascript:;" onmouseout="UnTip();" class="article_jargon_highligher" onmouseover="'.tooltip_javascript('<a href="'.$glossary_link.'">'.$glossary['word'].'</a>',$glossary_info,400,1,0,1).'"';
    $glossary_word = $glossary['word'];
    $glossary_word = preg_quote($glossary_word,'/');

    //once done we can replace the words with a nice tip    
    foreach ($html->find('text') as $element) {
        if (!in_array($element->parent()->tag,array())) {
            //problems are case aren't taken into account and grammer
            $element->innertext = str_ireplace(''.$glossary['word'].' ',' <a '.$glossary_tip.' >'.$glossary['word'].'</a> ', $element->innertext);

           //$element->innertext = str_ireplace(''.$glossary['word'].',',' <a '.$glossary_tip.'>'.$glossary['word'].'</a> ', $element->innertext);
           //$element->innertext = preg_replace ("/\s(".$glossary_word.")\s/ise","nothing(' <a'.'$glossary_tip.'>'.'$1'.'</a> ')" , $element->innertext);
          // $element->innertext = str_replace('__glossary_tip_replace__',$glossary_tip, $element->innertext);
        }
    }
}
$article['content'] = $html->save();

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dty3416 2011-07-01 18:19
关注
Use the inverted word character \W to select for any characters other than numbers and letters in your regex pattern. Because this would still fail at the boundaries of the text blob, you would also need to test those conditions as well. Thus using the word 'term' as the text you are searching for:

(^term$)|(^term\W)|(\Wterm\W)|(\Wterm$)

The first condition checks to make sure that term isn't the only contents of the blob, the second checks if its the first word, the third if it contained within the blob, and the last if its the last word.

If you want to consider any other characters as word characters (say a hyphen) you would need to repace the \W with [^\w\-].

Hope this helps. There are probably optimizations that can performed as well, but this should at least be a good starting point.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

在HTML中搜索和替换单词 javascript php
2011-06-29 12:14

回答 3 已采纳 Use the inverted word character \W to select for any characters other than numbers and letters in
在Go字符串中替换HTML实体 html
2017-11-13 18:28

回答 1 已采纳 Use the html.UnescapeString function: fmt.Println(html.UnescapeString("Rø&d grød &a
C#中html中img标签src替换 c# html5
2017-06-02 09:19

回答 2 已采纳 ``` string s = "变成 "; s = Regex.Replace(s, "(" + s + ""); ```
HTML前端常用（必记单词）
2021-11-13 14:58

杨不旧的博客测试鼠标滚轮向上向下弹得数字 oEvent.detail 火狐测试鼠标滚轮向上向下弹得数字 return false 阻止浏览器默认事件但是在事件绑定中失效 oEvent.preventDefoult 在事件绑定中用阻止浏览器默认事件（如果单独用只...
替换HTML代码块中的单词而不更改HTML html php
2014-10-30 14:18

回答 4 已采纳 Don't use regex to find text in HTML use a DOM parser instead: You could use DomDocument but be c
前端图片src 地址替换前缀 vue.js webpack 前端
2022-09-07 22:21

回答 4 已采纳你看下这篇博客吧, 应该有用👉 ：前端通过src路径下载图片
一本通 1406 单词替换 c++
2022-01-26 19:50

回答 2 已采纳 #include <iostream> #include <string> int main() { std::string string, word1, word2
学习前端,需要掌握的单词集汇总
2021-07-14 12:02

MmM豆的博客感觉自己音乐不好就放弃学习前端,其实所有语言,常用单词也就500左右,下面笔者整理前端常用的单词翻译趋向于术语一, html div 分隔, 盒子 p 段 h1~h6 标题1到标题6 a 锚 span strong 加重语义 meta table 表格 th ...
HTML怎样通过按钮把内嵌css替换为外联css css3 html 前端
2023-01-19 14:29

回答 2 已采纳在 HTML 中，可以通过 button 按钮来实现替换内嵌 CSS 文件为外联 CSS 文件的操作。这需要使用 JavaScript 来操作 DOM。创建一个 HTML 文件，在 head 标签中添
string在一个句子中替换多个单词 php
2013-11-23 05:37

回答 2 已采纳 do this foreach($tag_id_explode as $id) { $tag_friend_query=mysqli_query($con,"select f_na
PHP如何实现本地html文件标签中的字符串替换？ html5 php
2018-07-19 07:18

回答 11 已采纳需要知道你采用的那个模板引擎，语法不太一样常规的是 ``` {php str_replace("必选的","","权威医学验光配镜，第一次配镜必选的正规医院");}或{str_repla
前端必备单词
2022-01-05 22:21

Java小朝的博客前端必备单词 absolute 绝对的 active 活动的，激活的 align 对齐 alpha 透明度，半透明 anchor 锚记标记是这个单词的缩写 arrow 箭头 auto 自动 background 背景 border 边框 banner 页面上的一个横条 both 二者都...
替换数组PHP中的一个单词[复制] php
2019-06-19 12:22

回答 2 已采纳 foreach ($ar as &$item) { if ($item['title'] === 'My contracts') { $item['title'] = 'S
前端必备基础单词
2022-11-10 02:17

_自有天意的博客前端必备基础单词
前端之HTML篇(二)——HTML标签详解
2022-10-02 14:02

今晚务必早点睡的博客 HTML标签部分详解
没有解决我的问题, 去提问

悬赏问题

¥15 lammps拉伸应力应变曲线分析
¥15 C++ 头文件/宏冲突问题解决
¥15 用comsol模拟大气湍流通过底部加热（温度不同）的腔体
¥50 安卓adb backup备份子用户应用数据失败
¥20 有人能用聚类分析帮我分析一下文本内容嘛
¥15 请问Lammps做复合材料拉伸模拟，应力应变曲线问题
¥30 python代码，帮调试
¥15 #MATLAB仿真#车辆换道路径规划
¥15 java 操作 elasticsearch 8.1 实现索引的重建
¥15 数据可视化Python

在HTML中搜索和替换单词

3条回答 默认 最新

悬赏问题

3条回答默认最新