doujiao3346 2013-08-25 19:11
浏览 256
已采纳

如何通过pageid获取维基百科中特定页面的所有链接(id)

I`m trying to build query with Wiki API that will return all internal links from specific article in id format. I have pageId of some article. For example for article "Android (Operational System)" id is 12610483. In my client side i need to work only with id and later obtain all information only by id. My goal is to find all internal links(ids of articles) from give article id.

Unfortunately, the only possible way i found is to obtain links that represented by titles of articles: http://en.wikipedia.org/w/api.php?action=parse&format=json&pageid=12610483&prop=links

Is there any other way to obtain ids of links as well and not only titles?

  • 写回答

2条回答 默认 最新

  • doulou0882 2013-08-26 00:14
    关注

    What you want to do is to use action=query&prop=links to get data from the pagelinks database table, instead of parsing the page text.

    This will still give you only page titles (because a link can lead to a non-existent page, which implies no page id).

    But you can fix that by using prop=links as a generator:

    http://en.wikipedia.org/w/api.php?action=query&format=json&pageids=12610483&generator=links&gpllimit=max

    If the article has many links (like the one you suggested), you will need to use paging (see the gplcontinue element).

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 uniapp uview http 如何实现统一的请求异常信息提示?
  • ¥15 有了解d3和topogram.js库的吗?有偿请教
  • ¥100 任意维数的K均值聚类
  • ¥15 stamps做sbas-insar,时序沉降图怎么画
  • ¥15 买了个传感器,根据商家发的代码和步骤使用但是代码报错了不会改,有没有人可以看看
  • ¥15 关于#Java#的问题,如何解决?
  • ¥15 加热介质是液体,换热器壳侧导热系数和总的导热系数怎么算
  • ¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
  • ¥15 cmd cl 0x000007b
  • ¥20 BAPI_PR_CHANGE how to add account assignment information for service line