dongzhuo7291 2010-08-17 17:20 采纳率: 100%
浏览 40
已采纳

如何在PHP中检测无意义的文本?

I have comments enabled on my site and I require users to enter at least 30 characters to publish their comments (Just to get some value because they usualy just submitted "I like it") But some users now use simple technique to overcome this and enter e.g.:

"I like it. asdsdf dfdsfsdf tt erretrt re"

As you can see the rest of the text is nonsense. Is there a way (algorithm) how to filter these comments out in PHP ?

  • 写回答

6条回答 默认 最新

  • dongxinyue2817 2010-08-17 17:25
    关注

    Get a dictionary of English words from the net. Check the post has a certain % (maybe 50%? maybe 70%?) of words that are in the dictionary. You can't look for 100%, or names and technical jargon will not be found.

    users will get around this by entering.
    I like it ....................................................
    So then add logic to parse out punctuation.
    Then users will get around it with
    I like it. the the the the the the the the
    Then you will need to parse it for proper English grammer
    Then no one will be able to post on your site becuase it has too many rules.

    Better suggestion: Add comment moderation. Dumb posts get downvoted and go away. Good posts stay.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(5条)

报告相同问题?

悬赏问题

  • ¥15 c语言怎么用printf(“\b \b”)与getch()实现黑框里写入与删除?
  • ¥20 怎么用dlib库的算法识别小麦病虫害
  • ¥15 华为ensp模拟器中S5700交换机在配置过程中老是反复重启
  • ¥15 java写代码遇到问题,求帮助
  • ¥15 uniapp uview http 如何实现统一的请求异常信息提示?
  • ¥15 有了解d3和topogram.js库的吗?有偿请教
  • ¥100 任意维数的K均值聚类
  • ¥15 stamps做sbas-insar,时序沉降图怎么画
  • ¥15 买了个传感器,根据商家发的代码和步骤使用但是代码报错了不会改,有没有人可以看看
  • ¥15 关于#Java#的问题,如何解决?