如何理解Yahoo Streaming Benchmark运行结果seen.txt和updated.txt中值的含义

Yahoo! Streaming Benchmark简介

Yahoo! Streaming Benchmark是Yahoo的一个团队在2015年对当前热门的流式计算平台：Sparking Streaming, Storm和Flink开发的一个基准测试系统。

该系统是当时第一个将这三个流式计算平台在模拟真实应用场景下的基准测试，对后面的基准测试系统的发展有重要的意义。

该系统详细的介绍见：https://yahooeng.tumblr.com/post/135321837876/benchmarking-streaming-computation-engines-at。

Github地址：https://github.com/yahoo/streaming-benchmarks。

问题描述

在使用该基准测试系统进行运行之后，会产生两个结果文件：seen.txt,updated.txt。其中记录的相关的测试结果信息，但对这两个文件中数据的含义存在困惑。相似的问题在Github中仍存在，但无人解答：https://github.com/yahoo/streaming-benchmarks/issues/22。

因此，如何理解这个基准测试的信息，并且如何使用这些数据绘制出如下统计图（该图是该系统开发人员进行给出的）：

感谢你的回答！！！

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
varuy322 2018-12-12 02:25
关注
计算latency主要使用updated.txt 其存的是10s窗口生成的最后一条数据被处理的时间（last_record_timestamp）与10s窗口第一条数据产生的时间（所说的window产生时间window_time）之差。最终计算latency时还要减去窗口时间（10s）,表示数据产生窗口的最后一条record从kafka发出到被处理的时间。希望对你有所帮助！

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Yahoo Streaming Benchmark for Heron
2019-11-04 19:55

Ethan_pika的博客 Github地址：https://github.com/yahoo/streaming-benchmarks At Yahoo we have adoptedApache Stormas our stream processing platform of choice. But that was in 2012 and the landscape has changed si...
streaming_benchmark
2021-05-26 14:41

Streaming_benchmark 流基准测试旨在测量流处理系统（如flink和spark）的性能。模拟了三个用例（用户访问会话分析，实时广告评估和购物记录分析）。原始数据将生成并存储在Kafka中。流映射到流表中，并且对这些...
【Flink】:No operators defined in streaming topology. Cannot execute.
2022-10-25 14:17

一杯咖啡半杯糖的博客 Flink:No operators defined in streaming topology. Cannot execute.
Flink No operators defined in streaming topology. Cannot execute.
2022-07-27 18:27

Airy11的博客 Flink Exception in thread "main" java.lang.IllegalStateException: No operators defined in streaming topology. Cannot execute.
【SparkStreaming】java.lang.NoClassDefFoundError: org/apache/spark/streaming/StreamingContext
2020-06-10 12:40

象在舞的博客说一件很神奇的事情，今天在使用SparkStreaming进行Scala编程的时候，发生了如下问题： Exception in thread "main" java.lang.NoClassDefFoundError: org/apache/spark/streaming/StreamingContext at ...
Flink Caused by:org.apache.flink.streaming.connectors.kafka.internal.Handover$ClosedException
2020-12-17 09:43

大鹏_展翅的博客 org.apache.flink.streaming.connectors.kafka.internal.Handover$ClosedException at org.apache.flink.streaming.connectors.kafka.internal.Handover.close(Handover.java:182) at org.apache.flink.streaming...
org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.createLocalEnvironment
2019-11-08 17:12

Michealkz的博客运行flink scala 程序报错： Exception in thread "main" java.lang.NoSuchFieldError: MODE at org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.createLocalEnvironment...
unity webgl获取页面Token信息，及加载StreamingAssets下.txt
2022-01-14 13:05

Monkey_Xuan的博客加载StreamingAssets下urlID.txt获取服务器地址，携带token和服务器交互，获取跟多信息，并进行处理；上述urlID.txt内地址需要自己修改成需要地址；如何加载web端StreamingAssets下urlID.txt，调用Unity_Get请求...
【flink1.14.4】No operators defined in streaming topology. Cannot execute.
2022-04-08 16:23

程序媛小紫的博客 org.apache.flink.client.program.ProgramInvocationException: The main method caused an error: No operators defined in streaming topology. Cannot execute. The program finished with the following ...
【Flink】:ClassNotFoundException: org.apache.flink.streaming.api.functions.source.SourceFunction
2022-10-26 16:12

一杯咖啡半杯糖的博客 FLINK:ClassNotFoundException: org.apache.flink.streaming.api.functions.source.SourceFunction
没有解决我的问题, 去提问

如何理解Yahoo Streaming Benchmark运行结果seen.txt和updated.txt中值的含义

Yahoo! Streaming Benchmark简介

问题描述

2条回答 默认 最新

2条回答默认最新