从PDF获取数据到php / html / javascript

i want to ask one think about pdfs.

So i want to get out some data from pdf, but only specified data. Is it possible to choose what to get out from pdf?

For example is this image, so you can see which data i want to put out from pdf: pic http://shrani.si/f/1k/AA/Ph2cBYG/informativna-ponudba-gre.png

thanks

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douxian1939 2013-05-07 18:43
关注
This question touched two major processes: OCR and Data Capture (or parsing)

OCR stands for Optical Character Recognition. This process converts images to text. You will have to use this category of software if your PDFs are image-only PDFs (no text layer, such as scan, fax, rasterized, etc.). If your PDF already contains electronic text data, you 'may' be able to skip this step.

Data Capture standard for intelligent data location and extraction, such as finding specific fields among all other text. There are specialized software packages and/or parsing processes for that (see my previous post here).

If all your docs have the same 'area' that contains your text, you can crop the images, then pass smaller zones to OCR, which in turn will simplify your text extraction logic (because there will be less text to deal with).

ilya

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

从PDF获取数据到php / html / javascript php
2013-05-07 16:53

回答 1 已采纳 This question touched two major processes: OCR and Data Capture (or parsing) OCR stands for Optic
PHP / HTML：下载处理页面加载的PDF链接 html javascript php
2016-03-06 11:15

回答 1 已采纳 You can achieve your desired results by different methods. In this way what you are doing right kn
使用php将mysql中的数据打印成pdf文件 mysql php
2018-02-17 18:02

回答 1 已采纳 This line $pdf->SetFont('','',12); sets your font to current font. But you didn't set any
[nixon_robin]_learning_php,_mysql_&_javascript__w.pdf
2019-09-28 10:50

PHP、JavaScript、MySQL学习教材，[nixon_robin]_learning_php,_mysql_&_javascript__w.pdf
Codeigniter文件从没有ajax / jquery / javascript的表单上传 php
2018-12-02 12:54

回答 2 已采纳 Resolved ! 4 important steps needed and some missing in most Q&A in all other posts to make this c
PHP PDF密码保护（无密码无法打开） javascript php
2017-03-14 04:59

回答 1 已采纳 FPDF (+FPDF_Protection) and also TCPDF will work. You will have to pass BOTH user AND owner passwo
通过post方法将日期值发送到ajax的php进程 ajax html javascript php
2019-07-01 15:12

回答 1 已采纳 Here you go. I have modified your code so it includes the value of the input with ID 'expire'. You
PHP+JavaScript+HTML实现上传PDF和浏览PDF课件
2015-03-06 02:55

Eastmount的博客在寒假简单制作PHP网站时，需要实现在线浏览PDF和上传PDF的简单功能，下面就简单介绍下该功能。实现效果如下图所示： 1.当用户登录后，点击“上传课件”超链接可以实现隐藏和显示上传table的功能； 2.当用户选择...
PHP - 从链接参数获取变音符号 php
2016-12-15 09:48

回答 1 已采纳 You may have to set the correct encoding, UTF-8, both in your input file and in FPDF. You also nee
使用PHP验证查找恶意PDF文件？ javascript php
2016-09-21 11:28

回答 3 已采纳 Take a look into this project https://github.com/urule99/jsunpack-n - A Generic JavaScript Unpacke
生成HTML到PDF，并使用Laravel将生成的PDF上传到数据库？ javascript laravel php
2018-06-23 16:52

回答 2 已采纳 You dont need to store it in database if u already have a html template of the data that are going
PHP从入门到精通.pdf-入门教程.CHM
2013-08-07 14:26

《PHP从入门到精通》从初学者角度出发，通过通俗易懂的语言，丰富多彩的实例，详细介绍了使用PHP进行网络开发应该掌握的各方面技术。全书共分27章，包括初识．PHP、PHP环境搭建和开发工具、PHP语言基础、流程控制...
通过PHP下载PDF不会在手机上打开 javascript php
2015-07-07 01:00

回答 1 已采纳 Figured it out. Thanks to everyone for helping me debug (on mobile without dev tools it is a pain)
Learning.PHP.MySQL.&.JavaScript.4th.Edition.2014.12.pdf
2015-01-15 10:04

Learning.PHP.MySQL.&.JavaScript.4th.Edition.2014.12.pdf
browsershot：将html转换为图像，pdf或字符串
2021-02-03 01:57

使用无头Chrome将网页转换为图像或pdf 该软件包可以将网页转换为图像或pdf。转换是由在后台完成的，控制着无头版本的Google Chrome。这是一个简单的例子： use Spatie ... 执行JavaScript后，Browsershot也可以获取h
没有解决我的问题, 去提问

悬赏问题

¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？
¥15 c++头文件不能识别CDialog

从PDF获取数据到php / html / javascript

1条回答 默认 最新

悬赏问题

1条回答默认最新