douyi9447 2012-03-31 13:03
浏览 25
已采纳

维基媒体网址使用哪些特定的清理功能?

I'm writing a simple query to find urls on commons.wikimedia.org but i can't seem to get around which specific sanitizing rules i should use to get the exact name files used there.

Eg.: The flag of Ivory Coast is listed in french as Drapeau_de_la_Côte_d%27Ivoire so i get it that apostrophes are being sanitized but the regular ô isn't. I've seen a lot of other file names with special characters preserved.

Is it safe to assume that all special chars are preserved and all punctuation and/or non-letters are sanitized?

  • 写回答

1条回答 默认 最新

  • dqvy87517 2012-03-31 13:11
    关注

    Wikipedia uses all the url escaped in %nnnn format (according all URL RFCs), and your browser does the final work for you, just to have the urls more friendly.

    So even though my chrome shows http://en.wikipedia.org/wiki/Flag_of_Côte_d'Ivoire url, originally it was http://en.wikipedia.org/wiki/Flag_of_C%C3%B4te_d'Ivoire

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 有了解d3和topogram.js库的吗?有偿请教
  • ¥100 任意维数的K均值聚类
  • ¥15 stamps做sbas-insar,时序沉降图怎么画
  • ¥15 unity第一人称射击小游戏,有demo,在原脚本的基础上进行修改以达到要求
  • ¥15 买了个传感器,根据商家发的代码和步骤使用但是代码报错了不会改,有没有人可以看看
  • ¥15 关于#Java#的问题,如何解决?
  • ¥15 加热介质是液体,换热器壳侧导热系数和总的导热系数怎么算
  • ¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
  • ¥15 cmd cl 0x000007b
  • ¥20 BAPI_PR_CHANGE how to add account assignment information for service line