douji0588 2012-12-28 16:38
浏览 39

RegEx删除http:// www。 如果它存在于PHP和JS中

Could someone please help me with a regular expression (I need it in php and in js) to remove http:// and www. from the beginning of a url string and remove the trailing / if its there.

For Example

  • http://www.google.com/ would be google.com
  • https://yahoo.com?page=1 would be yahoo.com?page=1
  • fancysite.com/articles/2012/ would be fancysite.com/articles/2012

Heres the code Im using for the JS side:

row.page_href.replace(/^(https?|ftp):\/\//, '')

And heres the code Im using for the php side:

$urlString = rtrim($urlString, '/');
$urlString = preg_replace('~^(?:https?://)?(?:www[.])?~i', '', $urlString);

As you can see the JS regex only removes http:// currently and the php requires two steps to do everything.

  • 写回答

2条回答 默认 最新

  • dongru3726 2012-12-28 16:42
    关注

    #(https?(://))?(www.?)?(.*)#i

    Worked just fine for me. You could change the last (.*) to match the RFC standards of a URL.

    Outputs:

    david@david-desktop ~ $ php -a
    Interactive shell
    
    php > $str = preg_replace('#(https?(://))?(www.?)?(.*)#i', '$4', 'https://www.google.ca');
    php > echo $str . PHP_EOL;
    google.ca
    php > $str = preg_replace('#(https?(://))?(www.?)?(.*)#i', '$4', 'https://google.ca');
    php > echo $str . PHP_EOL;
    google.ca
    php > $str = preg_replace('#(https?(://))?(www.?)?(.*)#i', '$4', 'http://google.ca');
    php > echo $str . PHP_EOL;
    google.ca
    php > 
    
    评论

报告相同问题?

悬赏问题

  • ¥15 Arduino无法同时连接多个hx711模块,如何解决?
  • ¥50 需求一个up主付费课程
  • ¥20 模型在y分布之外的数据上预测能力不好如何解决
  • ¥15 processing提取音乐节奏
  • ¥15 gg加速器加速游戏时,提示不是x86架构
  • ¥15 python按要求编写程序
  • ¥15 Python输入字符串转化为列表排序具体见图,严格按照输入
  • ¥20 XP系统在重新启动后进不去桌面,一直黑屏。
  • ¥15 opencv图像处理,需要四个处理结果图
  • ¥15 无线移动边缘计算系统中的系统模型