douji0588 2012-12-28 16:38
浏览 39

RegEx删除http:// www。 如果它存在于PHP和JS中

Could someone please help me with a regular expression (I need it in php and in js) to remove http:// and www. from the beginning of a url string and remove the trailing / if its there.

For Example

  • http://www.google.com/ would be google.com
  • https://yahoo.com?page=1 would be yahoo.com?page=1
  • fancysite.com/articles/2012/ would be fancysite.com/articles/2012

Heres the code Im using for the JS side:

row.page_href.replace(/^(https?|ftp):\/\//, '')

And heres the code Im using for the php side:

$urlString = rtrim($urlString, '/');
$urlString = preg_replace('~^(?:https?://)?(?:www[.])?~i', '', $urlString);

As you can see the JS regex only removes http:// currently and the php requires two steps to do everything.

  • 写回答

2条回答 默认 最新

  • dongru3726 2012-12-28 16:42
    关注

    #(https?(://))?(www.?)?(.*)#i

    Worked just fine for me. You could change the last (.*) to match the RFC standards of a URL.

    Outputs:

    david@david-desktop ~ $ php -a
    Interactive shell
    
    php > $str = preg_replace('#(https?(://))?(www.?)?(.*)#i', '$4', 'https://www.google.ca');
    php > echo $str . PHP_EOL;
    google.ca
    php > $str = preg_replace('#(https?(://))?(www.?)?(.*)#i', '$4', 'https://google.ca');
    php > echo $str . PHP_EOL;
    google.ca
    php > $str = preg_replace('#(https?(://))?(www.?)?(.*)#i', '$4', 'http://google.ca');
    php > echo $str . PHP_EOL;
    google.ca
    php > 
    
    评论

报告相同问题?

悬赏问题

  • ¥15 求差集那个函数有问题,有无佬可以解决
  • ¥15 【提问】基于Invest的水源涵养
  • ¥20 微信网友居然可以通过vx号找到我绑的手机号
  • ¥15 寻一个支付宝扫码远程授权登录的软件助手app
  • ¥15 解riccati方程组
  • ¥15 display:none;样式在嵌套结构中的已设置了display样式的元素上不起作用?
  • ¥15 使用rabbitMQ 消息队列作为url源进行多线程爬取时,总有几个url没有处理的问题。
  • ¥15 Ubuntu在安装序列比对软件STAR时出现报错如何解决
  • ¥50 树莓派安卓APK系统签名
  • ¥65 汇编语言除法溢出问题