2015-02-26 12:52
浏览 107


i am building web app and at some point i need to scrape the value of variable called fid from remote page this variable present in script like this:

<script type="text/rocketscript">fid="mytarget"; v_width=640; v_height=360;</script>

i have tried using xpath and i come with the code below:

$fid = $xpath->xpath_query("/body/script[contains(local-name(), 'fid')]", 1)->nodeValue;

but no luck

any help appretaited

  • 写回答
  • 好问题 提建议
  • 追加酬金
  • 关注问题
  • 收藏
  • 邀请回答

1条回答 默认 最新

  • doufang3001 2015-02-26 13:22

    We have to guess most of your input document and code. I doubt that body is the outermost element of that HTML document though.

    A meaningful Xpath expression to search for this script element would be

    //script[contains(., 'fid')]

    This will find script element anywhere in the document if their text content includes fid. By the way: local-name() returns the local part of the name of the context node - which is not what you want.


    So, assuming the following document (please do not post code as links!):

    <html xmlns="" xml:lang="en-gb" lang="en-gb" dir="ltr">
        <meta http-equiv="content-type" content="text/html; charset=utf-8" />
    <body class="contentpane">
        <script type="text/rocketscript">fid="mytarget"; v_width=640; v_height=360;</script>
        <script type="text/rocketscript" data-rocketsrc=""></script>

    You can retrieve the value of fid with

    substring-before(substring-after(//*[local-name() = 'script' and contains(.,'fid')],'='),';')

    which will return


    Without the quotes:

    substring-before(substring-after(//*[local-name() = 'script' and contains(.,'fid')],'"'),'"')

    Caveat: In some environments, double quotes must be represented differently.

    There is a namespace in your document and the path expression above is ignoring that namespace. The proper way to address this would be to register this namespace and use a prefix in the path expression.

    解决 无用
    打赏 举报

相关推荐 更多相似问题