从MediaWiki API调用中提取内容（XML，cURL）

URL:

http://en.wikipedia.org/w/api.php?action=parse&prop=text&page=Lost_(TV_series)&format=xml

This outputs something like:

<api><parse><text xml:space="preserve">text...</text></parse></api>

How do I get just the content between <text xml:space="preserve"> and </text>?

I used curl to fetch all the content from this URL. So this gives me:

$html = curl_exec($curl_handle);

What's the next step?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongtang3155 2010-09-13 08:07
关注
Use PHP DOM to parse it. Do it like this:

//you already have input text in $html $html = '<api><parse><text xml:space="preserve">text...</text></parse></api>'; //parsing begins here: $doc = new DOMDocument(); @$doc->loadHTML($html); $nodes = $doc->getElementsByTagName('text'); //display what you need: echo $nodes->item(0)->nodeValue;

This outputs:

text...
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

使用PHP从MediaWiki数据库中提取压缩文本 mysql php
2012-11-26 20:05

回答 2 已采纳 From Text table: old_flags Comma-separated list of flags. Contains the following possibl
如何从其他专题页面调用功能 php
2017-01-25 14:14

回答 2 已采纳 You should refactor your code in such a way that anything that's needed by multiple special pages
子类别未列在MediaWiki的父类别页面中 php
2017-12-19 18:02

回答 1 已采纳 I could resolve the issue following an advice from Ciencia Al Poder. I contribute the answer her
HistoryOfPage:返回非次要链接的扩展名到MediaWiki页面
2021-05-25 09:05

在PHP中，可以使用cURL库或者file_get_contents函数发起HTTP请求来调用API。获取到JSON或XML格式的响应后，我们需要解析这些数据，提取出非次要链接的扩展名。非次要链接通常指的是那些不是页面主要内容但仍然重要的...
如何在MediaWiki中列出所有用户？ php
2015-03-18 11:55

回答 1 已采纳 You will have to query the user table in the databas. Something like this (have a look in the manu
MediaWiki中的不同链接 php
2012-04-02 19:41

回答 2 已采纳 Mediawiki allows you to wrap html tags around links; you can set the default to not open a new ta
MediaWiki批量页面重命名 php
2016-04-11 10:18

回答 2 已采纳 I could finish the task. Here are the steps: Backup your database Execute this to export all pag
MediaWiki安装插件Semantic MediaWIKI + PageForms
2021-10-27 15:32

失去斗志的菜鸟的博客 Semantic MediaWiki - 主页 (zh-hans) - semantic-mediawiki.org 官方安装教程 Installation – Quick Guide - semantic-mediawiki.org 一、安装Composer Composer是PHP项目的依赖管理工具，通过此工具可以方便...
升级Fedora 24 Mediawiki站点时出错 php
2016-11-25 22:38

回答 1 已采纳 The wiki move instructions on the MediaWiki site only work if you are moving from and to the EXACT
在usbwebserver上更新php版本 php
2016-11-04 16:25

回答 1 已采纳 Download the Windows release of PHP 5.6 from the official site, open the archive and overwrite all
mediawiki密码重置不发送电子邮件和创建用户不工作 php
2014-10-09 15:12

回答 3 已采纳 withing LocalSettings.php the include path set incorrectly. set_include_path( implode( PATH_SEPAR
jexus php 重写,Jexus 支持PHP的三种方式
2021-04-23 13:14

蒸汽猫marterio的博客 1、安装PHP-CGI：[azureuser@mono ~]$ sudo yum -y install php-cgi2、配置：1)修改“/etc/php.ini”文件:找到cgi.force_redirect=1一行，把前边的"#"号去掉，把值从1改为0，如：cgi.force_redirect=02)修改jws.conf...
Wiki安装
2020-08-31 19:51

传而习乎的博客如果你继续向下滚动,你会看到所有在php中已经启用的模块。mysql是没有列出,这意味着我们没有在php5支持mysql。原文链接：https://blog.csdn.net/weixin_41978547/article/details/79886744 ubuntu安装PHP扩展...
【Spark NLP】第 14 章：搜索引擎
2022-10-30 14:47

Sonhhxg_柒的博客中，您正在组合存储在不同索引中的数据，也许还有其他类型的数据存储，并一次搜索所有数据。我们使用排名的对数，以便折扣将列表中较早的项目比列表中较晚的项目更强烈地分开。然后，我们将构建一个查询函数，允许...
Go学习路线
2022-05-02 14:37

kgduu的博客 API 服务和工具图形语言 GraphJin - 用于 Postgres 的即时 GraphQL API。无需代码，将 GraphQL 编译为 SQL。 MTProto MTProto - 在纯 Go 上编写的 Telegram API 的完整本实现。天文学 go-fits - FITS（灵活...
zbbix服务器搭建_zabbix服务器搭建
2020-12-23 15:36

耳鸣的大金的博客 zabbix服务器源码安装参看官方文档这里不做过多的翻译，我的系统是centos6.5，安装的时候是base安装，所以要装一些其他依赖包，除此之外，还有一些php插件：extension=bcmath.soextension=gd.soextension=gettext....
Jexus 支持PHP的三种方式
2013-10-14 22:12

weixin_30394669的博客 Jexus不仅支持ASP.NET，而且能够通个自带的PHP-FCGI服务以及PHP-FPM等方式灵活支持PHP而且还可以以.NET(Phalanger)方式支持PHP。 PHP-FCGI服务支持PHP 1、安装PHP-CGI： [azureuser@mono ~]$ sudo yum -y install...
python代码案例详解-Python代码样例列表
2020-11-01 12:05

weixin_37988176的博客从日志文件中提取ip并找到归属地完成输出.py │ 使用Python完成访问同时下载网页内容的方法.py │ 分享冒泡排序与选择排序源码示例.py │ 初学python怎么用while循环笔记分享.py │ 可视化SVM分类器开源实现的...
Buildroot笔记
2019-11-20 11:31

hceng_blog的博客进行配置，编译出一个完整的、可以直接烧写到机器上运行的Linux系统文件(包含bootloader、kernel、rootfs以及rootfs中的各种库和应用程序)。回想构建开源软件包的流程，工作流大致如下：获取：获取源代码 ...
python语言实例-Python代码样例列表
2020-11-01 12:04

weixin_37988176的博客从日志文件中提取ip并找到归属地完成输出.py │ 使用Python完成访问同时下载网页内容的方法.py │ 分享冒泡排序与选择排序源码示例.py │ 初学python怎么用while循环笔记分享.py │ 可视化SVM分类器开源实现的...
没有解决我的问题, 去提问

悬赏问题

¥15 求京东批量付款能替代天诚
¥15 slaris 系统断电后，重新开机后一直自动重启
¥15 51寻迹小车定点寻迹
¥15 谁能帮我看看这拒稿理由啥意思啊阿啊
¥15 关于vue2中methods使用call修改this指向的问题
¥15 idea自动补全键位冲突
¥15 请教一下写代码，代码好难
¥15 iis10中如何阻止别人网站重定向到我的网站
¥15 滑块验证码移动速度不一致问题
¥15 Utunbu中vscode下cern root工作台中写的程序root的头文件无法包含

从MediaWiki API调用中提取内容（XML，cURL）

1条回答 默认 最新

悬赏问题

1条回答默认最新