duanjia1870 2011-08-31 19:24 采纳率: 0%
浏览 53
已采纳

搜索pdf并在找到时提取页面

Does anyone know how to search through a multiple page pdf for some text (e.g., an invoice number) and then extract that page to a separate file? I see how I can use FPDI to extract a particular page and then use FPDF to modify and save. The part I can't figure out is how to search the pdf and determine the page number that text is on. This would preferably be done with php, but I'd be willing to use something else if necessary.

Are there any suggestions?

Thank you.

  • 写回答

1条回答 默认 最新

  • dqcd84732 2012-08-29 19:59
    关注

    This page helped me find a solution:

    http://www.freak-search.com/en/thread/2817957/find_page_number_containing_a_given_text

    Basically, you use the command line program "pdftotext" in a bash script (see the link) to return the page number and then FPDI to extract the page. Works great.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥20 iqoo11 如何下载安装工程模式
  • ¥15 本题的答案是不是有问题
  • ¥15 关于#r语言#的问题:(svydesign)为什么在一个大的数据集中抽取了一个小数据集
  • ¥15 C++使用Gunplot
  • ¥15 这个电路是如何实现路灯控制器的,原理是什么,怎么求解灯亮起后熄灭的时间如图?
  • ¥15 matlab数字图像处理频率域滤波
  • ¥15 在abaqus做了二维正交切削模型,给刀具添加了超声振动条件后输出切削力为什么比普通切削增大这么多
  • ¥15 ELGamal和paillier计算效率谁快?
  • ¥15 蓝桥杯单片机第十三届第一场,整点继电器吸合,5s后断开出现了问题
  • ¥15 file converter 转换格式失败 报错 Error marking filters as finished,如何解决?