如何使用HTTP提取共享会话并将其存储到S3操作？

I need to fetch the contents from many several URLs and store it in AWS S3. I've written a function to do that which works. But I am looking to make it faster and more efficient by re-using http client connection and re-using the AWS session. Furthermore I'm looking to get them to run concurrently, say 5 at a time.

func fetchPut(fromURL string, toS3 string) error {

      start := time.Now()
      resp, err := http.Get(fromURL)
      if err != nil {
          return err
      }
      defer resp.Body.Close()

      sess := session.Must(session.Must(session.NewSession()))
      s3svc := s3.New(sess)

      s3URL, _ := url.Parse(toS3)

      byteArray, _ := ioutil.ReadAll(resp.Body)
      fetchElapsed := time.Since(start).Seconds()

      start = time.Now()
      input := &s3.PutObjectInput{
          Body:         bytes.NewReader(byteArray),
          Bucket:       aws.String(s3URL.Host),
          Key:          aws.String(s3URL.Path),
      }
      _, err = s3svc.PutObject(input)
      putElapsed := time.Since(start).Seconds()

      return err
}

What I don't understand is how I can re-use the session (both http & AWS). Can I have it in some global variable? Or do I have to create some sort of context?

Are there any good examples of this sort of use case to study?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongyou1926 2017-10-31 13:32
关注
Your problem seems to be pretty general.

As a principle you need to separate things which don't change (session & AWS service object, destination non-varying part like the bucket name) from the ones which change (src, dest. varying part like the key name), then setup non-changing configuration once, then run URL fetch + S3 store concurrently, passing your config as an additional arg.

That would boil down to moving your s3svc creation out of fetchPut function and passing it as an arg, then running fetchPutin goroutines, possibly with using async.WaitGroup if you want to wait for all of them to finish.

Other variation would be to run two pools of workers: producers (fetching URLs) and consumers (putting to S3) and use a channel to inform that one can feed another. That would probably give most of speedup.

In general, I agree with your idea of making it concurrent - it's pretty good mind-stretching example; doesn't have to be considered as premature optimization. I also can't resist advertising Rob Pike's excellent talk "Concurrency Is Not Parallelism". Rob's example of a load balancer is more complicated than your case, still gives a good overview how to process requests concurrently.

Btw, "session" used for http fetch is kind of transparent; as the commenters already mentioned, http client from standard library will be reused and you don't have to worry about that.

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

在会话中存储什么以在登录后将用户重定向到注册页面？ laravel php
2018-04-12 20:39

回答 1 已采纳 A. screen 1 should submit to a route protected by auth middleware. B. the route protected by auth
将对象保存到会话 - 如何从以前的会话中获取对象？ php
2018-12-11 16:00

回答 1 已采纳 You should use the function serialize() to returns a string containing a byte-stream representatio
如何通过访问会话信息超时会话并在超时前执行操作？ php
2014-04-27 09:34

回答 1 已采纳 You should do the FINAL quantity check when order is placed/submitted, not when added to cart. Thi
Amazon S3简介
2018-12-26 17:29

cangyu2013的博客目录 ...操作创建请求 AWS 账户访问密钥 IAM 用户访问密钥临时安全凭证请求终端节点通过IPv6向S3发出请求使用 AWS 开发工具包创建请求使用 REST API 创建请求 S3应用程序编程接口 ...
一个会话流使用不同端口进行通信是怎么回事？ macos
2020-10-11 16:08

回答 2 已采纳端口号会变那是临时端口吧，临时端口主要分布在1024-5000，
如何在会话变量中保存从数据库中提取的数组，以将其发送到php中的其他网页 php
2019-03-13 11:37

回答 1 已采纳 The reason why your print_r looked good is that you put into the loop. You rewrite the $_SESSION['
如何将会话变量传递到PHP中的表单？ php
2018-06-07 09:37

回答 1 已采纳 Just break out from the string and concatenate, like this: echo "<form method='POST' action='"
题库【操作系统】
2022-06-29 12:35

日星月云的博客题库【操作系统】
如何使用连接池将mgo会话转换为mongo-go-driver客户端？ mongodb
2019-09-18 17:56

回答 1 已采纳 There are a couple things I learned on this quest through the mongo-go-driver codebase that I thou
如何使用CodeIgniter 3.x将会话数据传递给CKFinder？ php
2016-10-10 19:07

回答 1 已采纳 You should not be altering a config file by writing to it for each user. Since this is a commercia
如何在PHP中使用数据会话？ php
2019-03-04 13:15

回答 1 已采纳 session_start() must be called before any output to the browser. Please update your code on all p
计算机操作系统|汤小丹|第四版|习题答案
2021-11-01 21:50

程子的小段的博客方便性：系统可以使用编译命令将用户采用高级语言书写的程序翻译成机器代码，或者直接通过OS所提供的各种命令操作计算机系统。有效性：提高系统资源的利用率；提高系统的吞吐量可扩充性：能方便地增添新的功能和...
无法将会话变量存储到数据库中 mysql php
2016-07-29 02:48

回答 3 已采纳 Hey guys thnx for help i found soulution. this line was wrong: $sql= "INSERT INTO test (`column`
计算机操作系统(第四版)课后习题答案(完整版)
2020-12-30 02:07

张红利的博客第一章 1．设计现代OS的主要目标是什么？答：（1）有效性（2）方便性（3）可扩充性（4）开放性 2．OS的作用可表现在哪几个方面？...答：OS首先在裸机上覆盖一层I/O设备管理软件，实现了对计算机硬件操作的第
计算机操作系统第四版课后题答案汤小丹
2021-03-16 10:00

面试成神的博客计算机操作系统第四版课后题答案汤小丹第一章 1．设计现代OS的主要目标是什么？计算机操作系统第四版课后答案答：（1）有效性（2）方便性（3）可扩充性（4）开放性 2．OS的作用可表现在哪几个方面？计算机操作...
没有解决我的问题, 去提问

悬赏问题

¥15 表达式必须是可修改的左值
¥15 如何绘制动力学系统的相图
¥15 对接wps接口实现获取元数据
¥20 给自己本科IT专业毕业的妹m找个实习工作
¥15 用友U8：向一个无法连接的网络尝试了一个套接字操作，如何解决？
¥30 我的代码按理说完成了模型的搭建、训练、验证测试等工作(标签-网络|关键词-变化检测)
¥50 mac mini外接显示器画质字体模糊
¥15 TLS1.2协议通信解密
¥40 图书信息管理系统程序编写
¥20 Qcustomplot缩小曲线形状问题

如何使用HTTP提取共享会话并将其存储到S3操作？

1条回答 默认 最新

悬赏问题

1条回答默认最新