ssummeraw 2013-04-25 05:23 采纳率: 10%
浏览 2920

如何从服务器中读取 pdf 内容并保存在文件中?

我有一个服务器链接,我需要读取pdf内容。我使用的下面的代码,但是它以其它的格式给出结果。

public String readPDF() throws Exception
{
     BufferedReader in = null;
     String page = "";
     try {
         HttpClient client = new DefaultHttpClient();
         HttpGet request = new HttpGet();
         request.setURI(new URI("http://14.140.41.194/monali/i.pdf"));
         HttpResponse response = client.execute(request);
         in = new BufferedReader
         (new InputStreamReader(response.getEntity().getContent()));
         StringBuffer sb = new StringBuffer("");
         String line = "";
         String NL = System.getProperty("line.separator");
         while ((line = in.readLine()) != null) {
             sb.append(line + NL);
         }
         in.close();
         page = sb.toString();
         } finally {
         if (in != null) {
             try {
                 in.close();
            } catch (IOException e) {e.printStackTrace();}
         }
     }         
     return page;       
}

输出:

04-21 19:00:49.894: INFO/System.out(624): page .. %PDF-1.4
04-21 19:00:49.894: INFO/System.out(624): %����
04-21 19:00:49.903: INFO/System.out(624): 15 0 obj
04-21 19:00:49.903: INFO/System.out(624): <</Length 2496
04-21 19:00:49.916: INFO/System.out(624): /Subtype /XML
04-21 19:00:49.916: INFO/System.out(624): /Type /Metadata
04-21 19:00:49.916: INFO/System.out(624): >>
04-21 19:00:49.916: INFO/System.out(624): stream
04-21 19:00:49.916: INFO/System.out(624): <?xpacket begin='' id='W5M0MpCehiHzreSzNTczkc9d'?>
04-21 19:00:49.916: INFO/System.out(624): <x:xmpmeta x:xmptk="3.1-701" xmlns:x="adobe:ns:meta/">
04-21 19:00:49.916: INFO/System.out(624):   <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
04-21 19:00:49.916: INFO/System.out(624):     <rdf:Description rdf:about="" xmlns:xmp="http://ns.adobe.com/xap/1.0/">
04-21 19:00:49.916: INFO/System.out(624):       <xmp:CreateDate>2011-05-20T05:46:02Z</xmp:CreateDate>
04-21 19:00:49.916: INFO/System.out(624):       <xmp:CreatorTool>Nitro PDF Professional  (6, 0, 1, 8)</xmp:CreatorTool>
04-21 19:00:49.916: INFO/System.out(624):       <xmp:ModifyDate>2011-05-20T05:46:04Z</xmp:ModifyDate>
04-21 19:00:49.916: INFO/System.out(624):       <xmp:MetadataDate>2011-05-20T05:46:04Z</xmp:MetadataDate>
04-21 19:00:49.916: INFO/System.out(624):     </rdf:Description>
04-21 19:00:49.916: INFO/System.out(624):     <rdf:Description rdf:about="" xmlns:dc="http://purl.org/dc/elements/1.1/">
04-21 19:00:49.916: INFO/System.out(624):       <dc:format>application/pdf</dc:format>
04-21 19:00:49.916: INFO/System.out(624):       <dc:creator>
04-21 19:00:49.916: INFO/System.out(624):         <rdf:Seq>
04-21 19:00:49.916: INFO/System.out(624):           <rdf:li></rdf:li>
04-21 19:00:49.916: INFO/System.out(624):         </rdf:Seq>
04-21 19:00:49.916: INFO/System.out(624):       </dc:creator>
04-21 19:00:49.916: INFO/System.out(624):       <dc:title>
04-21 19:00:49.916: INFO/System.out(624):         <rdf:Alt>
04-21 19:00:49.916: INFO/System.out(624):           <rdf:li xml:lang="x-default"></rdf:li>
04-21 19:00:49.916: INFO/System.out(624):         </rdf:Alt>
04-21 19:00:49.916: INFO/System.out(624):       </dc:title>
04-21 19:00:49.916: INFO/System.out(624):       <dc:description>
04-21 19:00:49.916: INFO/System.out(624):         <rdf:Alt>
04-21 19:00:49.916: INFO/System.out(624):           <rdf:li xml:lang="x-default"/>
04-21 19:00:49.916: INFO/System.out(624):         </rdf:Alt>
04-21 19:00:49.916: INFO/System.out(624):       </dc:description>
04-21 19:00:49.916: INFO/System.out(624):     </rdf:Description>
04-21 19:00:49.916: INFO/System.out(624):     <rdf:Description rdf:about="" xmlns:pdf="http://ns.adobe.com/pdf/1.3/">
04-21 19:00:49.916: INFO/System.out(624):       <pdf:Keywords></pdf:Keywords>
04-21 19:00:49.916: INFO/System.out(624):       <pdf:Producer>Nitro PDF Professional  (6, 0, 1, 8)</pdf:Producer>
04-21 19:00:49.916: INFO/System.out(624):     </rdf:Description>
04-21 19:00:49.916: INFO/System.out(624):     <rdf:Description rdf:about="" xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/">
04-21 19:00:49.916: INFO/System.out(624):       <xmpMM:DocumentID>uuid:3e7ef8a9-a526-45bf-9db1-a31533c20f86</xmpMM:DocumentID>
04-21 19:00:49.916: INFO/System.out(624):     </rdf:Description>
04-21 19:00:49.916: INFO/System.out(624):   </rdf:RDF>
04-21 19:00:49.916: INFO/System.out(624): </x:xmpmeta>
04-21 19:00:49.916: INFO/System.out(624):                                                                                                     
04-21 19:00:49.916: INFO/System.out(624):   
  • 写回答

1条回答 默认 最新

  • 码界一棵葱 2016-08-17 08:30
    关注

    直接创建个文件,写到文件中不行么?

    评论

报告相同问题?

悬赏问题

  • ¥20 我想使用一些网络协议或者部分协议也行,主要想实现类似于traceroute的一定步长内的路由拓扑功能
  • ¥30 深度学习,前后端连接
  • ¥15 孟德尔随机化结果不一致
  • ¥15 apm2.8飞控罗盘bad health,加速度计校准失败
  • ¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
  • ¥15 谁有desed数据集呀
  • ¥20 手写数字识别运行c仿真时,程序报错错误代码sim211-100
  • ¥15 关于#hadoop#的问题
  • ¥15 (标签-Python|关键词-socket)
  • ¥15 keil里为什么main.c定义的函数在it.c调用不了