dongzhijing8202 2018-12-27 14:03
浏览 68

如何从PHP中的文本中提取名称

I want to extract the name from a paragraph or text content. I am using PHP. I tried to extract the name from below library.

https://packagist.org/packages/php-text-analysis/php-text-analysis

https://packagist.org/packages/php-text-analysis/php-text-analysis

    $text = "my name is maneesh, and my friend name is Paritosh";
    $freqDist = freq_dist(tokenize($text));
    print_r($freqDist); die;

My expected output is : maneesh, Paritosh

Actual result is getting only frequency of word:

   (
        [my] => 2
        [name] => 2
        [is] => 2
        [maneesh] => 1
        [and] => 1
        [friend] => 1
        [Paritosh] => 1
    )
  • 写回答

1条回答 默认 最新

  • duanbo6482 2018-12-27 14:25
    关注

    If you are going to use the library you mentioned, you have to train your model. That means, fill them with many possible ways in which people can say their name. But even so, I wouldn't be perfect (depends on how well you trained your model).

    Moreover, you are getting only frequency of words because that's the analysis you requested with the method freq_dist. I think you have to use corpus analysis for what you want.

    评论

报告相同问题?

悬赏问题

  • ¥20 测距传感器数据手册i2c
  • ¥15 RPA正常跑,cmd输入cookies跑不出来
  • ¥15 求帮我调试一下freefem代码
  • ¥15 matlab代码解决,怎么运行
  • ¥15 R语言Rstudio突然无法启动
  • ¥15 关于#matlab#的问题:提取2个图像的变量作为另外一个图像像元的移动量,计算新的位置创建新的图像并提取第二个图像的变量到新的图像
  • ¥15 改算法,照着压缩包里边,参考其他代码封装的格式 写到main函数里
  • ¥15 用windows做服务的同志有吗
  • ¥60 求一个简单的网页(标签-安全|关键词-上传)
  • ¥35 lstm时间序列共享单车预测,loss值优化,参数优化算法