保持现有HTML实体不变，但转换双引号和单引号

I'm using PHP code to generate my meta description tag, like so:

<meta name="description" content="<?php
echo $this->utf->clean_string(word_limiter(strip_tags(trim($paperResult['file_content'])),27));
?>

Here's an example of the meta description output:

<meta name="description" content="blah blah &#182; &#8230; blah blah "words in quotation marks" blah blah "more words in quotation marks" blah blah" />

The two HTML entities in that example meta description are a paragraph sign (¶) followed by an ellipsis (…). They are already in HTML entity form in the source text, so I want them to remain unchanged. The problem is that I also need the quotation marks within the description to convert to " in order to prevent the meta tag from breaking. Every combination/configuration that I try either does not work or breaks my site because I'm getting the code wrong. For example, when I try the following code, the quotation marks convert to their HTML entity, as desired, but the paragraph symbol and ellipsis entities break because the ampersand character at the beginning of the existing HTML entities gets converted to &. That leaves me with a broken ¶ (&#182;) and a broken … (&#8230;) :

 echo $this->utf->clean_string(word_limiter(htmlspecialchars(strip_tags(trim($paperResult['file_content']))),27));

I've been trying—literally, for days—to figure this out. I've searched extensively in Stack Overflow, to no avail. I just need the existing HTML entities to remain unchanged and quotation marks to be converted to their HTML entity ("). I have studied the ENT_QUOTES option and I know that the solution probably exists therein, but I can't figure out how to incorporate it into my particular line of code. I'm hoping that you PHP gurus will have mercy on this tortured soul! I'd truly appreciate your help.

Thank you!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douyou9923 2018-10-16 19:43
关注
If it's the contents of the "content" attribute you can do this

$str = 'blah blah ¶ … blah blah "words in quotation marks" blah blah "more words in quotation marks" blah blah'; echo htmlentities($str, ENT_QUOTES, "UTF-8", false);

Output

blah blah ¶ … blah blah "words in quotation marks" blah blah "more words in quotation marks" blah blah

Sandbox

The key thing here is the 4th argument

string htmlentities ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = ini_get("default_charset") [, bool $double_encode = TRUE ]]] )

Specifically

double_encode When double_encode is turned off PHP will not encode existing html entities. The default is to convert everything.

That way it doesn't double encode the ampersand.

htmlspecialchars also has a double encode argument.

htmlspecialchars ( string $string [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = ini_get("default_charset") [, bool $double_encode = TRUE ]]] )

$str = 'blah blah ¶ … blah blah "words in quotation marks" blah blah "more words in quotation marks" blah blah'; echo htmlspecialchars($str, ENT_QUOTES, "UTF-8", false);

Output

blah blah ¶ … blah blah "words in quotation marks" blah blah "more words in quotation marks" blah blah

Sandbox

If it's the whole tag, then you'll have to pull out the contents and modify it and then replace it so as to preserve the < and >, but it's not clear in the question if that is the case.

PS there is not a whole lot of difference between htmlspecialchars and htmlentities, it mainly has to do with é accute and other accent things like that, htmlentities encodes those too, if I remember correctly.

UPDATE

I need the solution to be incorporated into my particular format of PHP code (i.e., a single line of PHP that maintains my existing functions/functionality), as miken32 brilliantly did above

To put it in your code,

<meta name="description" content="<?=htmlspecialchars(word_limiter(trim($paperResult['file_content']),27),ENT_QUOTES,"UTF-8",false);?>"/>

UPDATE2

With preg_replace('/[ ]+/', ' ', $string) removes or one or more times +. But it may be better to do it this way preg_replace(['/[ ]+/', '/\s+/'], ' ', $string). Which would remove run on spaces too.

<meta name="description" content="<?=htmlspecialchars(word_limiter(preg_replace('/[ ]+/', ' ', trim($paperResult['file_content'])),27),ENT_QUOTES,"UTF-8",false);?>"/>

Basically what it amounts to is anything that makes the text shorter you probably want to do before word_limiter (whatever that is). And any thing that makes it longer, like changing " to &quote; you probably want to do after (maybe). It just seems more logical to me.

Cheers!
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

保持现有HTML实体不变，但转换双引号和单引号 html php
2018-10-16 19:32

回答 2 已采纳 If it's the contents of the "content" attribute you can do this $str = 'blah blah ¶ …
前端框架中，ts为什么有的地方用双引号，有的地方用单引号呢有什么规律吗 typescript 前端前端框架
2022-04-24 13:39

回答 2 已采纳单双引号没啥区别，但是双引号内部只能用单引号来界定字符串了；标签内部属性值习惯用双引号
字符相关问题，关于单引号和双引号的问题 c语言
2022-04-04 11:22

回答 1 已采纳 char b='h' 是字符型，strlen查的是字符串的长度。char a[] = "ghds\n";``strlen查的是字符串的长度是5；输入ghds的长度是4。 #include <io
HTML和CSS笔记
2022-09-14 22:57

wangkay88的博客前端复习查找笔记
vue加单引号还是双引号的规则是啥啊 vue.js
2022-10-10 10:13

回答 1 已采纳正常情况你绑定变量或者在视图上的语句都是用双引号的例如 v-if = "条件"单引号更多是用来避免和避免更双引号冲突比如你用 v-if = "data == 某个字符串"这
mysql使用单引号报错，双引号却正常 mysql sql 数据库
2022-08-17 20:54

回答 5 已采纳你是全角的引号，和你写代码这些一样，得用半角符号
C加加编程中双引号和单引号有什么区别(语言-c++) c++
2023-03-04 15:41

回答 2 已采纳在C++编程中，双引号和单引号用于表示不同类型的字符字面值。双引号用于表示字符串字面值，而单引号用于表示字符字面值。如果使用双引号括起来的内容，将被视为字符串，输出时会输出双引号内的内容。而使用单引
前端学习笔记 - HTML+CSS
2021-10-07 23:16

茗0309的博客基础认知网页什么是网页网站是指在因特网上...通常我们看到的网页，常见以 .htm 或 .html 后缀结尾的文件，因此将其俗称为 HTML 文件。什么是 HTML HTML 指的是超文本标记语言 (Hyper Text Markup Lang
html里的JavaScript的正则表达式匹配字符串怎么写带单引号？双引号？不带引号？ css html5 javascript 有问必答
2021-08-05 20:09

回答 2 已采纳正则没有classList属性。。获取对应的dom移除hidden样式，圈出来那句改成下面的就可以。而且是hidden要用引号扩起，要不是变量了。有帮助麻烦点个采纳【本回答右上角】，谢谢~~有其他问
c#语句的双引号和单引号 c#
2015-07-02 10:50

回答 8 已采纳应该是这样吧： "+ count+ ", 'Name"+ count.ToString()+" ', 'Value" +count.ToString()+"' , " +count.ToStr
Mysql数据库中遇到奇怪的单引号双引号语法问题？ mysql 数据库
2023-03-21 07:18

回答 5 已采纳你想取别名，那应该把前面的加法用括号括起来，后面的别名不要加引号否则应该写as关键字如果你连续写两个单引号，中间不加空格，这是转义了如果中间加空格，那是连续两个字符串，拼接了
【WEB前端开发】基础知识大总结（HTML5+CSS3）
2022-01-26 21:21

我想养只猫 •͓͡•ʔ的博客覆盖HTML5+CSS3基础知识，内容包括：转义字符、表单标签、语义化标签、Head标签、CSS引用方式、CSS背景属性、CSS文本属性、基础选择器、伪类选择器、伪元素选择器、CSS优先级、块级元素与行内元元素、盒子模型、定位...
char数组和int数组初始化双引号问题 c语言
2023-02-17 16:47

回答 3 已采纳 int数组不能这么进行初始化的。字符串只能初始化char数组编译器认为int数组是要输入整数，类型不一致就不让你初始化
【前端】HTML&CSS
2022-08-11 14:16

Uaena.&的博客 HTML的全称为超文本标记语言，是一种标记语言。它包括一系列标签．通过这些标签可以将网络上的文档格式统一，使分散的Internet资源连接为一个逻辑整体。HTML文本是由HTML命令组成的描述性文本，HTML命令可以说明文字...
HTML+CSS基础自学笔记（前端入门）
2021-05-11 15:58

Junfu Chang的博客根据教程内容及查阅W3C相关文档，本文系统介绍了自学的基础Html及CSS相关知识，适合有从事前端开发或学习了解意向的前端小白，仅供参考，若发现错误望及时指正！ word文档下载链接： 1、蓝奏云链接下载（建议） ...
web前端知识总结一（HTMl+CSS）
2022-08-30 21:41

木头的猫.的博客 HTML+CSS知识
前端之JavaScript
2022-05-08 09:16

栋zzzz的博客 1.3 string字符串类型 JS中也是不区分字符串和字符的,剩下的都基本和Java类似了,另外由于这里的字符串不区分单双引号 因此如果一个字符串中有其他单双引号,外面单引号里面可以使用双引号,外面单引号里面可以使用双...
没有解决我的问题, 去提问

悬赏问题

¥15 怎么获取下面的： glove_word2id.json和 glove_numpy.npy 这两个文件
¥15 js调用html页面需要隐藏某个按钮
¥15 ads仿真结果在圆图上是怎么读数的
¥20 Cotex M3的调试和程序执行方式是什么样的？
¥20 java项目连接sqlserver时报ssl相关错误
¥15 一道python难题3
¥15 牛顿斯科特系数表表示
¥15 arduino 步进电机
¥20 程序进入HardFault_Handler
¥15 oracle集群安装出bug

保持现有HTML实体不变，但转换双引号和单引号

2条回答 默认 最新

悬赏问题

2条回答默认最新