doudaotui4297 2018-11-26 12:37
浏览 42

使用php从docx文件中提取文本时忽略文本引用/引用

We have a .docx file from which we need to extract text using PHP. The docx file contains text like:

Literature had found out that HIV related stigma and discrimination exists among different sectors of life like family, job settings, society and health care settings (1).

The (1) is denoted as in-text citation with field citation text as

>ADDIN CSL_CITATION { 
  "citationItems": [{
    "id": "ITEM-1",
    "itemData": { 
      "URL":"http://www.unaids.org/en",
      "accessed": {
        "date-parts":[["2018","11","4"]]
      },
      "id":"ITEM-1",
      "issued": {
        "date-parts":[["0"]]
      },
      "title":"UNAIDS",
      "type":"webpage"
    },
    "uris":[
      "http://www.mendeley.com/documents/?uuid=fedd4311-1013-3a16-bb82-27566ac11365"
    ]
  }],
  "mendeley": {
    "formattedCitation": "(1)",
    "plainTextFormattedCitation": "(1)",
    "previouslyFormattedCitation":"(1)"
  },
  "properties": {
    "noteIndex":0
  },
  "schema": "https://github.com/citation-style-language/schema/raw/master/csl-citation.json"
}}.

We want to remove the field citation text in the text extraction. The result should be plain text without citation. Kindly help us how to remove this field text in PHP.

  • 写回答

0条回答 默认 最新

    报告相同问题?

    悬赏问题

    • ¥20 有关区间dp的问题求解
    • ¥15 多电路系统共用电源的串扰问题
    • ¥15 slam rangenet++配置
    • ¥15 有没有研究水声通信方面的帮我改俩matlab代码
    • ¥15 对于相关问题的求解与代码
    • ¥15 ubuntu子系统密码忘记
    • ¥15 信号傅里叶变换在matlab上遇到的小问题请求帮助
    • ¥15 保护模式-系统加载-段寄存器
    • ¥15 电脑桌面设定一个区域禁止鼠标操作
    • ¥15 求NPF226060磁芯的详细资料