dooo61733 2017-10-04 23:42
浏览 20

公制模式分析以进行故障排除

When I troubleshoot some site issues, I need to check many metrics like CPU, memory, application metrics and so on. generally, I want to know the following items automatically (without checking all the metrics one by one by human) :

  1. How many metrics have spikes during that time.
  2. if metric X has the same pattern with metric Y
  3. if metric X has some periodicity characters.

for item 1 and 2, I think I can get it by calculating some change rate. for item 3, I have no idea so far.

my questions here are:

  1. do we have some library already which can be used here, language (Go, Java, Python is ok).
  2. do you have any suggestion for requirement 3.

=====

More background here:

I have a Prometheus(a monitor system) setup already, but my issue is I want to analyze these metrics automatically. For example: User input: Here are 1000 time-serial data and I have an issue on time 1 to time 2, I see metrics X has spiked during that time. Program output: item 1/2/3 above.

I just have some issue during implement the program.

  • 写回答

1条回答 默认 最新

  • doufan1899 2017-10-05 08:42
    关注

    I think you need some monitoring & analytic services like:

    DataDog: https://www.datadoghq.com/

    Librato: https://www.librato.com/

    etc...

    Or a self hosted infrastructure to run Graphite

    (https://github.com/hopsoft/docker-graphite-statsd) or similar tools.

    评论

报告相同问题?

悬赏问题

  • ¥15 mmocr的训练错误,结果全为0
  • ¥15 python的qt5界面
  • ¥15 无线电能传输系统MATLAB仿真问题
  • ¥50 如何用脚本实现输入法的热键设置
  • ¥20 我想使用一些网络协议或者部分协议也行,主要想实现类似于traceroute的一定步长内的路由拓扑功能
  • ¥30 深度学习,前后端连接
  • ¥15 孟德尔随机化结果不一致
  • ¥15 apm2.8飞控罗盘bad health,加速度计校准失败
  • ¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
  • ¥15 谁有desed数据集呀