从相对路径解析绝对路径

I'm making a web-crawler and I'm trying to figure out a way to find out absolute path from relative path. I took 2 test sites. One in ROR and 1 made using Pyro CMS.

In the latter one, I found href tags with link "index.php". So, If I'm currently crawling at http://example.com/xyz, then my crawler will append and make it http://example.com/xyz/index.php. But the problem is that, I should be appending to root instead i.e. it should have been http://example.com/index.php. So if I crawl http://example.com/xyz/index.php, I'll find another "index.php" which gets appended again.

While in ROR, if the relative path starts with '/', I could've easily known that it is a root site.

I can handle the case of index.php, but there might be so many rules that I need to take care of if I start doing it manually. I'm sure there's an easier way to get this done.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doupai8533 2015-09-17 06:26
关注
In Go, package path is your friend.

You can get the directory or folder from a path with path.Dir(), e.g.

p := "/xyz/index.php" dir := path.Dir(p) fmt.Println("dir:", dir) // Output: "/xyz"

If you find a link with root path (starts with a slash), you can use that as-is.

If it is relative, you can join it with the dir above using path.Join(). Join() will also "clean" the url:

p2 := path.Join(dir, "index.php") fmt.Println("p2:", p2) p3 := path.Join(dir, "./index.php") fmt.Println("p3:", p3) p4 := path.Join(dir, "../index.php") fmt.Println("p4:", p4)

Output:

p2: /xyz/index.php p3: /xyz/index.php p4: /index.php

The "cleaning" tasks performed by path.Join() are done by path.Clean() which you can manually call on any path of course. They are:

Replace multiple slashes with a single slash.

Eliminate each . path name element (the current directory).

Eliminate each inner .. path name element (the parent directory) along with the non-.. element that precedes it.

Eliminate .. elements that begin a rooted path: that is, replace "/.." by "/" at the beginning of a path.

And if you have a "full" url (with schema, host, etc.), you can use the url.Parse() function to obtain a url.URL value from the raw url string which tokenizes the url for you, so you can get the path like this:

uraw := "http://example.com/xyz/index.php" u, err := url.Parse(uraw) if err != nil { fmt.Println("Invalid url:", err) } fmt.Println("Path:", u.Path)

Output:

Path: /xyz/index.php

Try all the examples on the Go Playground.
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

oracle相对路径绝对路径 oracle sql
2021-07-05 22:12

回答 3 已采纳都是绝对路径，但有没有效跟绝对相对没关系。 DIR_EXCEL是数据库里的逻辑目录，/opt/leasing，D:\TEST 对应的是数据库服务器上的物理目录，语句都能创建上，但只有物理目录存在时，
绝对路径怎么改成相对路径 python
2022-11-15 00:17

回答 2 已采纳没法改，建议将这个脚本和图片文件放到同一个目录打包发过去，然后脚本里相对路径写当前图片的路径就行，例如./img.png，到时候运行就可以。
python可以使用相对路径创建文件吗 python
2022-01-14 11:08

回答 3 已采纳可以，使用 isExists 判断是否存在这个文件夹，若没有则 makedirs 创建
include 绝对路径 php,require和include相对路径和绝对路径详解
2021-04-12 21:42

吴聊创业金融的博客开头的路径，例如:./a/a.php (相对当前目录)../common.inc.php (相对上级目录)相对路径需要一个参考目录才能确定文件的最终路径，在包含解析中，不管包含嵌套多少层，这个参考目录是程序执行入口文件所在目录。...
Java web中相对路径和绝对路径。 java
2015-09-12 12:01

回答 4 已采纳以盘符开始的才是绝对路径，以 /开头的是相对web根目录的相对路径，没/的是相对当前路径的相对路径。
C# 如何获取两个路径的相对路径？开发语言
2020-07-22 16:38

回答 1 已采纳 Uri url = new Uri("c:\\a\\b\\c"); Uri relativeUrl = url.MakeRelativeUri(new Uri("c:\\a\\d
Java相对路径和绝对路径 java
2022-02-21 11:44

回答 4 已采纳 1、相对路径：就是相对于自己的目标文件的位置。2、绝对路径：是指文件在硬盘上真正存在的路径。可以这样理解，比如你的文件夹a中有一个test1.txt文件，a的上一级是文件夹b，里面有一个文件test
java相对的路径_Java相对路径总结
2021-03-04 00:53

乐干面的博客 1.基本概念的理解绝对路径：绝对路径就是你的主页上的文件或目录在硬盘上真正的路径，(URL和物理路径)例如：C:xyzest.txt 代表了test.txt文件的绝对路径。http://www.sun.com/index.htm也代表了一个URL绝对路径。...
如何在golang中解析相对路径到绝对路径？
2017-11-13 10:06

回答 1 已采纳 Resolving ~ (denoting the user home) is a different story, and usually it's the shell that resolve
php 文件上传路径问题 php
2021-06-28 08:20

回答 2 已采纳 upload后面再加一个 \
Qt打不开相对路径文件 c++
2019-11-22 18:18

回答 1 已采纳 相对路径打开文件，跟你用什么开发语言、开发框架，没有任何关系。跟你的程序当前运行目录有关。例如双击一个exe，当前运行目录就是exe所在目录 cmd里输入运行一个exe，当前运行目录
java 相对路径转绝对路径_Java相对路径/绝对路径总结(转）
2021-02-27 21:55

大鹏侃金的博客 1.基本概念的理解绝对路径：绝对路径就是你的主页上的文件或目录在硬盘上真正的路径，(URL和物理路径)例如：C:xyz est.txt 代表了test.txt文件的绝对路径。http://www.sun.com/index.htm也代表了一个URL绝对路径。...
php怎么post数据给相对路径文件？ php
2015-06-17 08:51

回答 1 已采纳是系统内部的吗？直接用函数调用不就行了？为什么要用http的方式呢？
java得到相对路径_[Java]JAVA获取相对路径问题的解决
2021-02-26 08:33

weixin_39525307的博客 1.基本概念的理解绝对路径:绝对路径就是你的主页上的文件或目录在硬盘上真正的路径,(URL和物理路径)例如:C:xyz est.txt 代表了test.txt文件的绝对路径.http://www.sun.com/index.htm也代表了一个URL绝对路径.相对...
java获取项目的相对路径_在JAVA文件中获取该项目的相对路径
2021-02-26 10:09

weixin_39633781的博客 1.基本概念的理解绝对路径：绝对路径就是你的主页上的文件或目录在硬盘上真正的路径，(URL和物理路径)例如：C:\xyz\test.txt 代表了test.txt文件的绝对路径。http://www.sun.com/index.htm也代表了一个URL绝对路径。...
没有解决我的问题, 去提问

悬赏问题

¥15 IAR程序莫名变量多重定义
¥15 (标签-UDP|关键词-client)
¥15 关于库卡officelite无法与虚拟机通讯的问题
¥15 qgcomp混合物线性模型分析的代码出现错误：Model aliasing occurred
¥100 已有python代码，要求做成可执行程序，程序设计内容不多
¥15 目标检测项目无法读取视频
¥15 GEO datasets中基因芯片数据仅仅提供了normalized signal如何进行差异分析
¥100 求采集电商背景音乐的方法
¥15 数学建模竞赛求指导帮助
¥15 STM32控制MAX7219问题求解答

从相对路径解析绝​​对路径

1条回答 默认 最新

悬赏问题

从相对路径解析绝对路径

1条回答默认最新