在post中将实体转换为不允许的标记并允许标记

I have a form where an user can post a global notice into the system (for other users to see).
The system outputs HTML directly from the DB (when a user wanto to see a notice).
I'd like to allow some html tags to stay intact and to have the rest of them with htmlspecialchars() applied.
I already tried to apply

 str_replace($search, $replace, htmlspecialchars($str))

strategy but it seems to be really slow. Too slow, actually. And also it's not safe that will always work, Is there an alternative for this?
I wanted something that did the strip_tags() job except that it, instead of striping tags it would apply htmlspecialchars to the not allowed tags.

ADD(ed) info (by request):

$str can be any size you can think of. I thought of using a big string (1M characters (generated rendomly with some allowed and some unallowed tags inside. All tags had attributes) for the reason of testing one of the worst case scenarios With the logic: If it works like this, it should work for simpler cases.
The server took 5s to process the complete str_replace (with htmlspecialchars). This test was made in my computer that has 2GHz CPU and DDR3 RAM.
both $search and $replace have a total of 7 replacements. Still they do not always work. In some cases $search gives false positives or false negatives.
To clarify, I apply these changes while saving to the DB and not while retrieving from the DB.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanchi6377 2011-06-03 10:46
关注
You might try this code (should be improved):

function callback(array $matches) { return htmlspecialchars_decode($matches[0]); } $str = 'some <i>string</i> <b>with</b> tags ' . '<a href="#">some link</a> ' . '<img alt="" src="http://sstatic.net/stackoverflow/img/favicon.ico"/><hr/>'; $str = htmlspecialchars($str); $str = preg_replace_callback('#(<(i|a)(?: .+?)?>.*?</(\1)>|<(?:img)(?: .*?)?/>)#', 'callback', $str); echo $str;

Regular expression looks (should look) for 2 types of strings:

<tag attributes>content</tag>, with tag part being the same for opening an closing tag, and attributes and content being optional

<tag attributes/>, with attributes being optional

Tags are listed in (i|a) part for <tag></tag> types of tags and (?:img) for <tag/> types of tags.

If it finds matching tags, it passes content to callback() function which converts it back by using htmlspecialchars_decode(). This is necessary for decoding quotes and other encoded characters in the list of attributes.

I'm not sure if it works in all cases, i.e., if it matches all necessary tags. If this works in general, then pattern and callback() function should be improved so that callback() decodes only <, > characters and list of attributes; content of tags (i.e., some link part in <a href='#'>some link</a>) must not be decoded.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

在post中将实体转换为不允许的标记并允许标记 php
2011-06-03 10:06

回答 2 已采纳 You might try this code (should be improved): function callback(array $matches) { return html
如何在PHP中将m3u8转换为base64编码？ php
2019-06-30 08:51

回答 1 已采纳 it should work like this: <script> var player = new Clappr.Player({ source: window.atob
在PHP中将标记保存为字符串 php
2018-05-05 15:11

回答 1 已采纳 You are getting literal content from token.php Try: $my_var = file_get_contents('http://example.c
php学习
2022-04-14 17:51

拓海AE的博客 PHP 简介 PHP 是服务器端脚本语言。 PHP 是什么？ PHP（全称：PHP：Hypertext Preprocessor，即"PHP：超文本预处理器"）是一种通用开源脚本语言。 PHP 脚本在服务器上执行。 PHP 可免费下载使用。 PHP 文件是什么...
显示HTML而不剥离标记或转换为html实体 html php
2017-08-10 11:30

回答 1 已采纳 You are converting the tags to plain text using htmlentities() here: echo htmlentities($descripti
在php中将索引数组转换为多维关联/索引数组 php
2018-11-09 14:35

回答 3 已采纳 This is a blanket statement on how to get your desired array : $desired_array = array(array("0"=&
如何在codeigniter中将表单标记转换为form_open php
2016-01-20 08:43

回答 3 已采纳 Try this $attributes = array('name' => 'login_data'); echo form_open_multipart('loader/login_
PHP低版本安全问题
2023-11-17 21:05

信安成长日记的博客在 php4.2.0 后默认为 off，如果为 on，需要为每个变量初始化，get，post，cookie等变量直接被注册为全局变量，比如表单的username，程序中使用 $username 就能获取到值，不需 $_POST 来获取值。
如何在SQL查询中将日期转换为时间戳？ database mysql php sql
2018-10-27 12:20

回答 3 已采纳 The function you are looking for is UNIX_TIMESTAMP(), e.g. select UNIX_TIMESTAMP('1970-01-01 00:0
如何在php中将对象数组转换为键值对 laravel php
2018-08-01 07:08

回答 3 已采纳 As the data your trying to iterate over is an array (from an object) the notation should be... $r
在laravel php中将现有图像转换为base64 laravel php
2018-09-25 18:09

回答 1 已采纳 If your upload images in public folder try: $path = public_path('images/upload/file_name'); If y
简述php和web交互过程,PHP与Web页面交互操作实例分析
2021-04-12 20:44

2063650662的博客 PHP与Web页面交互操作实例分析,表单,数组,参数,字符串,...分享给大家供大家参考，具体如下：Web交互1.Web表单交互当表单的method属性提交方式为POST时，浏览器发送POST请求当表单的method属性提交方式为GET时，浏...
在php中将集合对象转换为字符串或数组 php
2017-05-19 05:11

回答 1 已采纳 I noticed this is tagged with laravel, in that case you could do collect($vals)->toArray(). If
PHP 面试题汇总
2019-10-29 22:30

Yohann丶blog的博客 image 1. PHP执行的时候有如下...A、将PHP代码转换为语言片段(Tokens)、将Tokens转换成简单而有意义的表达式、顺次执行Opcodes、将表达式编译成Opocdes B、将PHP代码转换为语言片段(Tokens)、将表达式编译成Opoc...
php html5交互_PHP与Web页面交互操作实例分析
2021-03-22 23:21

weixin_39631951的博客分享给大家供大家参考，具体如下：Web交互1.Web表单交互当表单的method属性提交方式为POST时，浏览器发送POST请求当表单的method属性提交方式为GET时，浏览器发送GET请求当PHP收到来自浏览器提交的数据后，会自动...
没有解决我的问题, 去提问

悬赏问题

¥15 一直显示正在等待HID—ISP
¥15 Python turtle 画图
¥15 关于大棚监测的pcb板设计
¥15 stm32开发clion时遇到的编译问题
¥15 lna设计源简并电感型共源放大器
¥15 如何用Labview在myRIO上做LCD显示？(语言-开发语言)
¥15 Vue3地图和异步函数使用
¥15 C++ yoloV5改写遇到的问题
¥20 win11修改中文用户名路径
¥15 win2012磁盘空间不足,c盘正常，d盘无法写入

在post中将实体转换为不允许的标记并允许标记

2条回答 默认 最新

悬赏问题

2条回答默认最新