duanlv1366 2018-06-01 12:55
浏览 44
已采纳

SQL搜索字符串中的#字符问题

I use some Regexp in my mySQL search queries and it seems to work fine unless I have a # character in the query.

The Regexp matches based on word boundaries because the field in which this query is searching is entire resumes / curriculum-vitaes stored in the SQL database.

For instance this works as expected and returns the correct number of results:

SELECT COUNT(*) n FROM candidate c WHERE (c.CV REGEXP '[[:<:]]java[[:>:]]');

However, this doesn't and returns 0 results when it should return a few hundred:

SELECT COUNT(*) n FROM candidate c WHERE (c.CV REGEXP '[[:<:]]c#[[:>:]]');

I understand now that this is because I am matching based on word boundaries and # cannot be the end of the word. Interestingly, "C++" works fine though.

Is there a way of modifying this REGEXP so it also works with a string like "c#"?

  • 写回答

5条回答 默认 最新

  • douke1905 2018-09-04 15:22
    关注

    You might be able to use something like this:

    SELECT 'c#' REGEXP '(^|[^a-zA-Z0-9_])c#($|[^a-zA-Z0-9_])'
    SELECT 'java' REGEXP '(^|[^a-zA-Z0-9_])java($|[^a-zA-Z0-9_])'
    

    In newer MySQL versions (8.0.4+) which support regex through ICU instead of Henry Spencer's implementation you can use \w which looks a bit cleaner:

    SELECT 'c#' REGEXP '(^|[^\w])c#($|[^\w])'
    SELECT 'java' REGEXP '(^|[^\w])java($|[^\w])'
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(4条)

报告相同问题?

悬赏问题

  • ¥15 基于单片机的靶位控制系统
  • ¥15 AT89C51控制8位八段数码管显示时钟。
  • ¥15 真我手机蓝牙传输进度消息被关闭了,怎么打开?(关键词-消息通知)
  • ¥15 下图接收小电路,谁知道原理
  • ¥15 装 pytorch 的时候出了好多问题,遇到这种情况怎么处理?
  • ¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
  • ¥15 手机接入宽带网线,如何释放宽带全部速度
  • ¥30 关于#r语言#的问题:如何对R语言中mfgarch包中构建的garch-midas模型进行样本内长期波动率预测和样本外长期波动率预测
  • ¥15 ETLCloud 处理json多层级问题
  • ¥15 matlab中使用gurobi时报错