1.我想下载2000年以后中国专利的PDF,尝试过incopat(用ui path进行重复点击下载,但速度太慢需要将近八年,太快需要验证,暂时没办法解决),国家知识产权总局的一个专利检索系统目前没办法依靠ui path(每下载一个都需要验证码)。两者虽然我都可以批量下载,但是太少了,还是需要很长的时间,请问各位牛人知不知道哪里可以直接下载很多很多的。
2.下面的xml是其他研究者批量下载专利的地方,但是我找不到,点不进去,请问各位牛人能不能帮我解析一下。
This XML file does not appear to have any style information associated with it. The document tree is shown below.
<business:PatentDocumentAndRelated xmlns:business="http://www.sipo.gov.cn/XMLSchema/business" xmlns:base="http://www.sipo.gov.cn/XMLSchema/base" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:tbl="http://oasis-open.org/specs/soextblx" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" country="CN" dateProduced="20151231" datePublication="20160106" docNumber="105217357" file="CN102013000481828CN00001052173570APIDZH20160106CN00V.XML" kind="A" lang="zh" status="C" xsi:schemaLocation="http://www.sipo.gov.cn/XMLSchema/business /DTDS/PatentDocument/Elements/OtherElements.xsd" xsdVersion="V2.2.1">
<business:FullDocImage imgPath="\CN102013000481828CN00001052173570APIDZH20160106CN00V" type="pdf" numberOfFigures="11" country="CN" lang="zh">
<business:ImageFile fileName="CN102013000481828CN00001052173570APDFZH20160106CN00U.PDF" num="1" fileType="PDF">
<business:TitlePages>
<base:PageRange>
<base:FirstPageNumber>1</base:FirstPageNumber>
<base:LastPageNumber>1</base:LastPageNumber>
</base:PageRange>
</business:TitlePages>
</business:ImageFile>
<business:ImageFile fileName="CN102013000481828CN00001052173570APDFZH20160106CN00U.PDF" num="1" fileType="PDF">
<business:ClaimPages>
<base:PageRange>
<base:FirstPageNumber>2</base:FirstPageNumber>
<base:LastPageNumber>2</base:LastPageNumber>
</base:PageRange>
</business:ClaimPages>
</business:ImageFile>
<business:ImageFile fileName="CN102013000481828CN00001052173570APDFZH20160106CN00U.PDF" num="5" fileType="PDF">
<business:DescriptionPages>
<base:PageRange>
<base:FirstPageNumber>3</base:FirstPageNumber>
<base:LastPageNumber>7</base:LastPageNumber>
</base:PageRange>
</business:DescriptionPages>
</business:ImageFile>
<business:ImageFile fileName="CN102013000481828CN00001052173570APDFZH20160106CN00U.PDF" num="4" fileType="PDF">
<business:DrawingPages>
<base:PageRange>
<base:FirstPageNumber>8</base:FirstPageNumber>
<base:LastPageNumber>11</base:LastPageNumber>
</base:PageRange>
</business:DrawingPages>
</business:ImageFile>
</business:FullDocImage>
</business:PatentDocumentAndRelated>