如何在S3中保存数据流？ aws-sdk-go示例不起作用？

I am trying to persist a given stream of data to an S3 compatible storage. The size is not known before the stream ends and can vary from 5MB to ~500GB.

I tried different possibilities but did not find a better solution than to implement sharding myself. My best guess is to make a buffer of a fixed size fill it with my stream and write it to the S3. Is there a better solution? Maybe a way where this is transparent to me, without writing the whole stream to memory?

The aws-sdk-go readme has an example programm that takes data from stdin and writes it to S3: https://github.com/aws/aws-sdk-go#using-the-go-sdk

When I try to pipe data in with a pipe | I get the following error: failed to upload object, SerializationError: failed to compute request body size caused by: seek /dev/stdin: illegal seek Am I doing something wrong or is the example not working as I expect it to?

I although tried minio-go, with PutObject() or client.PutObjectStreaming(). This is functional but consumes as much memory as the data to store.

Is there a better solution?
Is there a small example program that can pipe arbitrary data into S3?

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongsu2807 2017-04-24 23:55
关注
You can use the sdk's Uploader to handle uploads of unknown size but you'll need to make the os.Stdin "unseekable" by wrapping it into an io.Reader. This is because the Uploader, while it requires only an io.Reader as the input body, under the hood it does a check to see whether the input body is also a Seeker and if it is, it does call Seek on it. And since os.Stdin is just an *os.File which implements the Seeker interface, by default, you would get the same error you got from PutObjectWithContext.

The Uploader also allows you to upload the data in chunks whose size you can configure and you can also configure how many of those chunks should be uploaded concurrently.

Here's a modified version of the linked example, stripped off of code that can remain unchanged.

package main import ( // ... "io" "github.com/aws/aws-sdk-go/service/s3/s3manager" ) type reader struct { r io.Reader } func (r *reader) Read(p []byte) (int, error) { return r.r.Read(p) } func main() { // ... parse flags sess := session.Must(session.NewSession()) uploader := s3manager.NewUploader(sess, func(u *s3manager.Uploader) { u.PartSize = 20 << 20 // 20MB // ... more configuration }) // ... context stuff _, err := uploader.UploadWithContext(ctx, &s3manager.UploadInput{ Bucket: aws.String(bucket), Key: aws.String(key), Body: &reader{os.Stdin}, }) // ... handle error }

As to whether this is a better solution than minio-go I do not know, you'll have to test that yourself.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

QT集成aws-cpp-sdk及开发S3接口
2024-10-28 17:53

野生小番茄的博客 aws即Amazon Web Services，S3是aws提供的一种持久性、高可用性、可扩展的对象存储服务，用于检索和存储大规模的数据对象、如文件、图像、视频等；应用于数据存储和管理、备份和恢复、数据归档和大数据分析等多个...
springboot集成amazon aws s3对象存储sdk(javav2)
2022-01-12 14:23

在Maven的pom.xml文件中，我们需要引入`software.amazon.awssdk:s3`库： ```xml <groupId>software.amazon.awssdk <artifactId>s3 <version>latest_version</version> <!-- replace with the latest version --...
ng-aws-sdk:AngularJS + AWS JavaScript 开发工具包
2021-07-09 13:09

- **异步处理**：ng-aws-sdk 封装了异步调用，使得操作 AWS 服务的结果可以通过 AngularJS 的 promise 进行处理，更好地与 AngularJS 的数据流和事件处理相融合。 - **错误处理**：ng-aws-sdk 提供了统一的错误处理...
aws-clj-sdk：适用于AWS的Clojure绑定
2021-01-29 23:50

在`aws-clj-sdk-master`压缩包中，通常会包含源代码、示例、测试用例、README文件和项目的构建脚本等。这些资源可以帮助开发者了解如何在自己的Clojure项目中集成和使用aws-clj-sdk。 ### 总结 aws-clj-sdk是...
基于centos9的AWS S3 SDK for cpp配置及快速入门
2025-08-13 21:24

小马记得开心点的博客本文介绍了在CentOS9系统上配置和使用AWS S3 C++ SDK的完整流程。主要内容包括：AWS S3 SDK的功能特性及与传统文件存储的对比；通过vcpkg工具安装SDK及其依赖的详细步骤；在线/离线环境下S3存储位置的区别讨论；以及...
Java开发者必读：AWS云服务实战全解——从SDK调用到Serverless架构
2025-05-18 17:57

墨夶的博客文章详细展示了AWS Java SDK的核心操作，如S3对象存储的配置与优化，Spring Boot与AWS Lambda的集成，以及微服务架构下DynamoDB的使用。此外，还提供了完整的代码示例和部署配置，助力开发者打造云原生Java应用。
AWS SDK for Go v2 完全指南：从入门到实战
2025-06-19 09:10

井隆榕Star的博客 AWS SDK for Go v2是亚马逊云服务(AWS)官方提供的Go语言软件开发工具包的第二代版本。它为开发者提供了与AWS服务交互的编程接口，让Go开发者能够轻松地在应用程序中集成AWS的各种云服务。 ## 核心特性 1. **模块化...
AWS SDK for Go V2 教程：从零构建云原生应用
2024-08-10 07:43

孔祯拓Belinda的博客还在为 Go 语言与 AWS 服务集成而烦恼？AWS SDK for Go V2 带来了全新的开发体验！本文将带你从零开始，全面掌握这个强大的开发工具包，让你轻松构建云原生应用。通过本教程，你将学会： - ✅ SDK 核心架构与设计...
AWS S3存储服务SDK
2017-08-04 14:12

Zemo的博客 Amazon S3 提供了一个简单 Web 服务接口，可用于随时在 Web 上的任何位置存储和检索任何数量的数据。此服务让所有开发人员都能访问同一个具备高扩展性、可靠性、安全性和快速价廉的数据存储基础设施， Amazon 用它...
java8stream源码-aws-greengrass-core-sdk-java:GreengrassJavaSDK
2021-06-04 16:58

java8流源码适用于 Java 的 AWS Greengrass 核心开发工具包 AWS Greengrass Core SDK for Java使 Java 开发人员能够开发将在 Greengrass 中运行的 Lambda 函数。概述本文档提供有关准备 Greengrass Core 环境以...
没有解决我的问题, 去提问

如何在S3中保存数据流？ aws-sdk-go示例不起作用？

1条回答 默认 最新

1条回答默认最新