字符编码问题UTF-8和ISO-8859-1

I have a web application that I'm having problems getting Japanese/Chinese characters to display properly. The thing being that i can display these characters properly when I am hard coding them into an HTML document.

Characters such as:

アイヌの工芸 : ペンシルバニア大学考古学人類学博物館ヒラーコレクション

But when I grab them out of this proprietary database it comes out as junk:

ã¢ã¤ãã®å·¥è¸ : ãã³ã·ã«ããã¢å¤§å¦èå¤å¦äººé¡å¦åç©é¤¨ãã©ã¼ã³ã¬ã¯ã·ã§ã³

Now i have the html document encoded in utf-8

<meta http-equiv="content-type" content="text/html; charset=utf-8"/>

The actual html file itself is saved as "Encoded in utf-8" and not ISO-8859-1 or Western Latin etc.

So the weird thing is that when I use iconv to take the junk character string and convert it from utf-8 to ISO-8859-1 it displays correctly.

iconv("UTF-8", "ISO-8859-1//TRANSLIT", $junk_string)

It seems like the junk string is UTF-8 and when I convert the string to ISO-8859-1 it then displays the characters correctly. This doesn't make sense to me at all.

So I sort of have an answer to my problem but I do not know why it works. I thought that having encoding in UTF-8 was supposed to fix this kind of thing. And I am using Verdana but have tried a couple of other fonts with no success. And the weird thing being that I can hard code the characters with no problem into the html page and they display fine. But when get the same data from the database it is displayed as junk without me changing the encoding to ISO-8859-1.

Anyone have any insight here? And instead of doing this to every piece of data gotten from the database is there a way I can change this on the individual page level? I also tried to change the encoding to

<meta http-equiv="content-type" content="text/html; charset=ISO-8859-1"/>

And the characters from the database still do not display correctly.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douque2016 2011-12-16 16:41
关注
The answer would be you have wrong data in the database. What probably happened is that you did a conversion ISO-8859-1 -> UTF-8 on data that's already in UTF-8. Therefore, doing a conversion UTF-8 -> ISO-8859-1 gives you the original UTF-8 data back.

Make sure you're not calling utf8_encode (which does an ISO-8859-1 -> UTF-8 conversion) on UTF-8 data!

Since every UTF-8 string is also a valid ISO-8859-1 string (well, not quite, but it's commonly extended so that that's the case), you have no errors on the ISO-8859-1 -> UTF-8 conversion over UTF-8 data.

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

PHP输出编码与MySQL数据库中的UTF-8字符串有关 mysql php
2015-06-23 00:47

回答 3 已采纳 You should not change all the character_set% fields, just the three that are affected by SET NAMES
utf-8 Carbon格式的错误字符 laravel php
2018-05-14 18:37

回答 1 已采纳 I had the same problem when trying to use the sk_SK.UTF-8 locale. What helped me to solve the prob
使用html格式在ISO-8859-1环境中发送波兰字符 html php
2013-05-13 08:14

回答 1 已采纳 You cannot do anything with character encoding conversion here. Polish characters simply cannot be
php iso 8859 1转utf8,Python：从ISO-8859-1 / latin1转换为UTF-8
2021-05-06 09:30

凯然的博客没有前缀的字符串)，必须将本机编码(iso8859-1/ latin1，除非使用enigmaticsys.setdefaultencoding函数进行修改)解码为unicode，然后编码为可以显示所需字符的字符集，在这种情况下，会推荐UTF-8。首先，这是一个...
在电子邮件中编码的随机HTML字符 php
2015-04-29 20:45

回答 2 已采纳 My guess is since that's a closing "tr" that shouldn't be there (you have another right after it),
哪个字符编码是字节0不为空？ php
2013-01-29 20:46

回答 2 已采纳 Any straightforward multibyte encoding (e.g. UTF-16 in all forms) will represent each code point a
使用mysql_real_escape_string编码错误，返回空字符串 mysql php
2013-06-13 12:28

回答 1 已采纳 mysql_real_escape_string considers the current connection character set. Your ISO-8859-1 scripts s
php iso 8859 1 utf8,PHP：从ISO-8859-1到“UTF-8”转换“'”字符时出现问题
2021-05-08 16:24

kkcfhdq的博客我运行下面的代码进行测试：PHP：从ISO-8859-1到“UTF-8”转换“'”字符时出现问题// Connect to a latin1 charset database// and retrieve "Georgia O’Keeffe", which contains a "’" character$connec...
在php中显示字符编码 php
2015-11-09 08:52

回答 2 已采纳 You're specifying the charset as UTF-8 in meta: <meta charset="utf-8"/> But you're specif
jQuery加载字符编码问题 javascript jquery php
2014-06-04 22:23

回答 1 已采纳 Play around with utf8_decode or utf8_encode and wrap it around your DB Output: <p><?php
文件中的file_get_contents包含西里尔字符和未定义的编码 php
2014-04-09 13:01

回答 1 已采纳 The file is encoded in CP1251 a.k.a. MS-CYRL a.k.a. "Cyrillic (Windows)". $string = file_get_cont
php iso 8859 1转utf8,Python:从ISO-8859-1/latin1转换为UTF-8
2021-05-06 09:29

新德里的雨的博客对于非unicode字符串(即那些没有u前缀的字符串，如u'\xc4pple')，必须从本机编码(iso8859-1/latin1，除非modified with the enigmatic ^{}函数)解码到^{}，然后编码到可以显示所需字符的字符集，在这种情况下，我...
通过我的URL解码一个字节编码的字符串 php
2012-09-12 13:18

回答 1 已采纳 This method seems to do what you're looking for: http://li.php.net/manual/en/function.stripcslash
PHP 中文转iso8859-1_如何解决php iso 8859 1乱码问题
2021-03-22 20:18

小方有点小方的博客 phpiso 88591乱码的解决办法：首先使用iconv解码为“ISO-8859-1”；然后通过“iconv("GB18030", "UTF-8", $str));”方法转为“GB18030”；最后再还原到“UTF-8”即可解决乱码问题。解决 UTF 文档中的乱码问题在...
php 数据库 iso8859,php – Utf-8字符显示为ISO-8859-1
2021-04-29 09:53

weixin_39846898的博客从插入/读取数据库中的utf8内容时遇到问题.我正在做的所有验证似乎都指出我的数据库中的内容应该是utf8编码的事实,但它似乎是拉丁编码的.最初从CLI从PHP脚本导入数据.组态：Zend Framework Version: 1.10.5mysql-...
没有解决我的问题, 去提问

悬赏问题

¥20 sub地址DHCP问题
¥15 delta降尺度计算的一些细节，有偿
¥15 Arduino红外遥控代码有问题
¥15 数值计算离散正交多项式
¥30 数值计算均差系数编程
¥15 redis-full-check比较两个集群的数据出错
¥15 Matlab编程问题
¥15 训练的多模态特征融合模型准确度很低怎么办
¥15 kylin启动报错log4j类冲突
¥15 超声波模块测距控制点灯，灯的闪烁很不稳定，经过调试发现测的距离偏大

字符编码问题UTF-8和ISO-8859-1

3条回答 默认 最新

悬赏问题

3条回答默认最新