从S3下载时CSV编码损坏

I'm trying to download a CSV file from S3 using golang's SDK but it comes out encoded wrongly and is interpreted as one slice.

input := &s3.GetObjectInput{
    Bucket:                  aws.String(bucket),
    Key:                     aws.String(key),
    ResponseContentType:     aws.String("text/csv"),
    ResponseContentEncoding: aws.String("utf-8"),
}

object, err := s3.New(s).GetObject(input)
if err != nil {
    var obj s3.GetObjectOutput

    return &obj, err
}

defer object.Body.Close()

lines, err := csv.NewReader(object.Body).ReadAll()
if err != nil {
    log.Fatal(err)
}

log.Printf("%q", lines[0])


// returns ["\ufeffH1" "H2" "field1" "field2" "field1" field200602"]

I'm guessing this is incorrect character encoding. Problem is that I'm not clear what encoding that it is. When I'm putting the file, I'm specifying csv.

I would have expected to see [][]string:

[
  [],
  []
]

Any advice?

Approach 2

buffer := new(bytes.Buffer)
buffer.ReadFrom(object.Body)

str := buffer.String()

lines, err := csv.NewReader(strings.NewReader(str)).ReadAll()
if err != nil {
    log.Fatal(err)
}

log.Printf("length: %v", len(lines))
// still one line

Approach 3

My new approach is going to be manually removing byte sequences that are problematic. This is pretty terrible. Godocs on this need work.

This is closer but now I have to split out on new lines then again on commas.

Edit When I print out the bytes it looks like: "\ufeffH1,H2,field1,field2

I have tried using the following encodings:

utf-8, iso-8859-1, iso-8859-1:utf-8

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

如何从S3读取CSV文件
2018-09-28 03:22

回答 1 已采纳 As the error says: cannot use body (type []byte) as type io.Reader in argument to csv.NewRea
使用S3 Golang SDK从S3下载选择性文件
2019-03-22 06:54

回答 1 已采纳 AWS SDK does not have this possibility. You can list objects in a bucket, filter the output based
如何使用Golang从公共S3存储桶下载
2019-02-06 13:55

回答 1 已采纳 We can set Credentials: credentials.AnonymousCredentials when creating session. Following is the w
Amazon S3简介
2018-12-26 17:29

cangyu2013的博客 S3数据一致性模型存储类别存储桶策略 AWS Identity and Access Management 操作创建请求 AWS 账户访问密钥 IAM 用户访问密钥临时安全凭证请求终端节点通过IPv6向S3发出请求使用 AWS 开发工具包...
从s3 bucket ajax php下载 ajax php
2018-10-23 22:00

回答 1 已采纳 You should enable cors on s3 bucket: Select bucket Select permissions tab Select CORS Enable CO
使用AWS开发工具包Go的完整URI从S3下载文件
2018-03-27 22:37

回答 4 已采纳 There is no way to do what you want. The only ways to get a private object are: Use the bucket a
从S3同时下载多个文件并合并它们
2017-06-26 16:39

回答 1 已采纳 This code: lenghts3:= int64(len(buffer.Bytes())) Is a concurrency problem: two routines may get
【爬虫】从零开始使用 Scrapy
2022-01-09 11:12

惜鸟的博客本文主要从下面几个方面进行介绍：我的学习过程需求分析搭建项目编写代码实现需求部署爬虫项目到 SpiderKeeper 二. 我的学习过程学习一个新的技术，首先就是去阅读它的官方文档，因为官方文档写的是比较全面...
从AWS S3紧急状态下载日志文件：运行时错误：
2015-12-17 22:38

回答 1 已采纳 You're passing nil to s3manager.NewDownloader where it requires a Session sess := session.New() m
使用Golang的AWS S3并行下载
2019-01-29 11:34

回答 1 已采纳 Try altering your NewDownLoader() to this. See https://docs.aws.amazon.com/sdk-for-go/api/service
在S3中生成下载文件夹的链接 php
2016-02-18 13:16

回答 1 已采纳 S3 does not have this feature. You would need to zip up all the files you want to provide a link f
最终初学者指南，以数据科学用例赢得分类黑客马拉松
2020-11-10 21:44

磐创 AI的博客作者|VETRIVEL_PS 编译|Flin 来源|analyticsvidhya 总览本文是我的第一篇Analytics Vidhya的博客文章的第二部分，该...从几个月前在Hackathons作为初学者开始，我最近成为了Kaggle专家，并且是 Vidhya 的JanataHack
从S3目录的指定列表中检索内容
2019-06-11 16:56

回答 1 已采纳 Not with a single call, no - particularly if you have enough objects to trigger paginated results.
爬虫日记(10)：爬取国外名人名言
2021-03-09 22:17

caimouse的博客 4）强大的编码支持和自动检测，用于处理外来的、非标准的和损坏的编码声明。 5）具有强大的扩展功能，可以通过信号量进行交互，也可以通过定义好的API接口扩展，比如中间件、扩展、管道等。 6）广泛相关网络功能：...
Scarpy2.5从入门到高级系列教程（一）：快速了解Scrapy框架
2021-09-27 15:18

大器晚成你别不信的博客 Scrapy 快速一览 Scrapy 是一个用于抓取网站和提取结构化数据的应用程序框架，可用于各种有用的应用程序，如数据挖掘、信息处理或历史存档。...下面是一个爬虫的代码，它从网站 http://quotes.toscrape.
没有解决我的问题, 去提问

悬赏问题

¥20 测距传感器数据手册i2c
¥15 RPA正常跑，cmd输入cookies跑不出来
¥15 求帮我调试一下freefem代码
¥15 matlab代码解决，怎么运行
¥15 R语言Rstudio突然无法启动
¥15 关于#matlab#的问题：提取2个图像的变量作为另外一个图像像元的移动量，计算新的位置创建新的图像并提取第二个图像的变量到新的图像
¥15 改算法，照着压缩包里边，参考其他代码封装的格式写到main函数里
¥15 用windows做服务的同志有吗
¥60 求一个简单的网页(标签-安全|关键词-上传)
¥35 lstm时间序列共享单车预测，loss值优化，参数优化算法

码龄粉丝数原力等级 --

从S3下载时CSV编码损坏

0条回答默认最新

悬赏问题

从S3下载时CSV编码损坏

0条回答 默认 最新

悬赏问题

0条回答默认最新