douluchuo0801 2012-11-30 13:06
浏览 135
已采纳

str_replace不会替换阿拉伯字符

<?php 
$utf8_string = 'مع السلامة مع السلامة مع السلامة مع السلامة مع السلامة مع السلامة مع السلامة السلامة الرائعة على الطويلة ';
echo $utf8_string;
echo'<br/><br/>';

$patterns = array("على", "مع");
$replacements   = array("", "");

$r_string = str_replace($patterns, $replacements, $utf8_string);

//echo $r_string;
print_r ($r_string);
echo'<br/>';
//$words = preg_split( "/ ( |مع|على) /",$r_string);
$words = explode(" ",$r_string);

$num = count($words);
echo 'There are <strong>'.$num.'</strong> words.';
?>

I have this code to count the number of words in an arabic sentence.however i want to remove some words and count the rest.i tried to use str_replace, but this way is counting the number of words of the original sentence. can anyone help me?

  • 写回答

3条回答 默认 最新

  • doumeng1089 2012-11-30 13:11
    关注

    You could use:

    $num = count(
        explode(
            " ", 
            str_replace(
                $word, //Word you want to remove from your text.
                "",
                $string //String you want the word to be removed from.
            )
        )
    );
    

    Or even:

    $num = count(
        explode(
            " ", 
            str_replace(
                array("word1", "word2", [...]), //Words you want to remove from your text.
                "",
                $string //String you want the word to be removed from.
            )
        )
    );
    

    EDIT: As pointed out, the above won't work. I tried pinpointing where the error is, and apparently str_replace can't handle arabic characters, even though explode can. PHP is not reliable with non-ascii characters.

    What you can do, alternatively, is:

    $num = Count(explode(" ", $utf8_string)) - Count(array_intersect(explode(" ", $utf8_string), $patterns))
    

    It should return the value you want.

    You could also try writing your own string replacement function, but I would advice against it, seeing you'd have to manually loop through your array and compare each word. Doing so should take longer to run, and make it much more verbose.


    Coming here to warn yall that the correct way to handle this is with the mbstring extension (http://php.net/manual/en/book.mbstring.php). Please use this extension, no the ugly hack/workaround above.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(2条)

报告相同问题?

悬赏问题

  • ¥15 matlab数字图像处理频率域滤波
  • ¥15 在abaqus做了二维正交切削模型,给刀具添加了超声振动条件后输出切削力为什么比普通切削增大这么多
  • ¥15 ELGamal和paillier计算效率谁快?
  • ¥15 file converter 转换格式失败 报错 Error marking filters as finished,如何解决?
  • ¥15 ubuntu系统下挂载磁盘上执行./提示权限不够
  • ¥15 Arcgis相交分析无法绘制一个或多个图形
  • ¥15 关于#r语言#的问题:差异分析前数据准备,报错Error in data[, sampleName1] : subscript out of bounds请问怎么解决呀以下是全部代码:
  • ¥15 seatunnel-web使用SQL组件时候后台报错,无法找到表格
  • ¥15 fpga自动售货机数码管(相关搜索:数字时钟)
  • ¥15 用前端向数据库插入数据,通过debug发现数据能走到后端,但是放行之后就会提示错误