转到：读取zip文件中的行块[关闭]

I need to read a block of n lines in a zip files quickly as possible.

I'm beginer in Go. For bash lovers, I want to do the same as (to get a block of 500 lines between lines 199500 and 200000):

time query=$(zcat fake_contacts_200k.zip | sed '199500,200000!d')

real    0m0.106s
user    0m0.119s
sys 0m0.013s

Any idea is welcome.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanjucong3124 2017-10-26 08:25
关注
Import archive/zip.

Open and read the archive file as shown in the example right there in the docs.

Note that in order to mimic the behaviour of zcat you have to first check the length of the File field of the zip.ReadCloser instance returned by a call to zip.OpenReader, and fail if it is not equal to 1 — that is, there is no files in the archive or there are two or more files in it¹.

Note that you have to check the error value returned by a call to zip.OpenReader for being equal to zip.ErrFormat, and if it's equal, you have to:

Close the returned zip.ReadCloser.

Try to reinterpret the file as being gzip-formatted (step 4).

Take the first (and sole) File member and call Open on it.

You can then read the file's contents from the returned io.ReaderCloser.

After reading, you need to call Close() on that instance and then close the zip file as well. That's all. ∎

If step (2) failed because the file did not have the zip format, you'd test whether it's gzip-formatted.

In order to do this, you do basically the same steps using the compress/gzip package.

Note that contrary to the zip format, gzip does not provide file archival — it's merely a compressor, so there's no meta information on any files in the gzip stream, just the compressed data. (This fact is underlined by the difference in the names of the packages.)

If an attempt to opening the same file as a gzip archive returns the gzip.ErrHeader error, you bail out, otherwise you read the data after which you close the reader. That's all. ∎

To process just the specific lines from the decompressed file, you'd need to

Skip the lines before the first one to process.

Process the lines until, and including the last one to process.

Stop processing.

To interpret the data read from an io.Reader or io.ReadCloser, it's best to use bufio.Scanner — see the "Example (Lines)" there.

P.S.

Please read thoroughly this essay to try to make your next question better that this one.

¹ You might as well read all the files and interpret their contents as a contiguous stream — that would deviate from the behaviour of zcat but that might be better. It really depends on your data.
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

java读取文件第一行和三行读不到 java
2022-06-23 11:52

回答 2 已采纳你需要贴一下后面你读取文件内容的代码，现在只是开启了一个输入流，后面用的是什么方式读取？
matlab：读取文件的数据类型 matlab 有问必答
2022-04-09 02:13

回答 2 已采纳 fread(fileID,sizeA,precision) 将文件数据读取到维度为 sizeA 的数组 A 中，并将文件指针定位到最后读取的值之后。fread 按列顺序填充 A。根据 precisio
引发了异常: 读取访问权限冲突。 c++
2022-09-28 16:30

回答 1 已采纳 halfedges = spt_eval_->find_poly_boundry_2(new_poly);应改为std::vector halfedges1=spt_eval_->find
java读取zip中指定文件_java读取zip中指定文件
2021-02-25 18:39

weixin_39626586的博客 public static void main(String args[]) {String file = "c://ssi.zip";String saveRootDirectory = "c://test/";new TestZip().zipFileRead(file, ...}/**** @Description: TODO(读取Zip信息，获得zip中所有...
C语言文件的读取读不到第一行 c语言
2020-06-01 16:53

回答 1 已采纳 ``` 你把18行的 fgets(buffer,100,fp); 注释掉再试试。 ```
读取zip文件，然后根据读取的数据在创建个zip java
2017-06-08 01:54

回答 2 已采纳 ``` HttpPost httppost = new HttpPost(url); FileBody bin = new FileBod
C++读取txt文件出现中文乱码 c++ 数据结构链表
2022-04-06 21:49

回答 1 已采纳可能是读入的文件编码格式不对，可以将.txt文件重新保存，在保存的界面跟换正确的编码格式
批量读取zip 压缩文件中的csv格式excel文件
2023-06-06 09:01

yssa1125001的博客批量读取支付宝账单zip文件下csv格式excel文件内容
Python：读取一个英文文件，将文件中所有的小写字母转换为大写字母，所有的大写字母转换为小写字母 python
2021-12-01 14:34

回答 2 已采纳 #文件名 filename = 'test.txt' rf = open(filename) txt = rf.read() rf.close() for ch in txt: if ch.
文件操作：无法读取文件里的内容 c++
2023-01-11 20:23

回答 2 已采纳第一次打开文件往里面输入后未关闭文件 #include<iostream> #include<fstream> using namespace std; int main()
C#Winform中读取txt文本文件每一行，并将其写入textBox中 c#
2022-05-11 22:23

回答 1 已采纳 string path = @"D:\qq.txt"; string[] str = File.ReadAllLines(path); for (int i = 0; i &l
Java IO流：ZIP文件的读取与写入
2022-05-30 21:25

卡多希y的博客 ZipInputStream是一种FileInputStream流，它可以直接读取Zip压缩包的内容： ┌───────────────────┐ │ InputStream │ └───────────────────┘ ▲ │ ┌──────...
java读取zip压缩文件里面二级文件夹目录内的文件的问题 java
2016-02-23 08:45

回答 2 已采纳 ``` public static void main(String[] args) { try { Enumeration urls = X509Util.class.getCla
Java中如何读取和写入zip文件
2022-05-30 16:07

仙草不加料的博客在我们日常使用中，zip压缩文件是非常常用的，市面上也有许多压缩软件，那么我们为什么要用java去操作zip，使用...ZipInputStream：继承自FilterInputStream类，采用了装饰器模式，可以直接读取zip包中的内容，内部
php-zip:PhpZip是一个PHP库，用于ZIP归档文件的扩展工作
2021-05-08 16:45

7.0 目录文献资料\PhpZip\ZipFile类的方法概述创建/打开ZIP归档文件从档案中读取条目迭代条目获取有关条目的信息将条目添加到存档从档案中删除条目处理条目和存档使用密码取消更改保存文件或输出到浏览器关闭档案...
没有解决我的问题, 去提问

悬赏问题

¥15 使用ue5插件narrative时如何切换关卡也保存叙事任务记录
¥20 软件测试决策法疑问求解答
¥15 win11 23H2删除推荐的项目，支持注册表等
¥15 matlab 用yalmip搭建模型，cplex求解，线性化处理的方法
¥15 qt6.6.3 基于百度云的语音识别不会改
¥15 关于#目标检测#的问题：大概就是类似后台自动检测某下架商品的库存，在他监测到该商品上架并且可以购买的瞬间点击立即购买下单
¥15 神经网络怎么把隐含层变量融合到损失函数中？
¥15 lingo18勾选global solver求解使用的算法
¥15 全部备份安卓app数据包括密码，可以复制到另一手机上运行
¥20 测距传感器数据手册i2c

转到：读取zip文件中的行块[关闭]

1条回答 默认 最新

悬赏问题

1条回答默认最新