时间序列异常检测（python）

多变量的时间序列异常检测怎么做（python），找了几个模型代码，没看懂

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

22条回答默认最新

专家-郭老师 Java领域新星创作者 2024-08-05 08:56

关注

获得0.45元问题酬金

🕖上午好，题主！目前：🟢[在线]
📢参考通义千问、由郭老师整理提供！
📣今日诗号：古来云海茫茫，八表神游，顶峰何在？行尽九洲四海，人间自有，谪仙风采。

多变量时间序列异常检测 (Python)

以下是用 Python 进行多变量时间序列异常检测的解决方案和代码示例，包括多种方法：

1. 基于统计的方法

a. 多元高斯分布

import numpy as np
from scipy.stats import multivariate_normal

# 训练数据
X_train = np.array([[1, 2, 3], [2, 3, 4], [3, 4, 5]])

# 计算均值和协方差矩阵
mean = np.mean(X_train, axis=0)
cov = np.cov(X_train.T)

# 定义异常阈值 (例如，3倍标准差)
threshold = 3 * np.sqrt(np.diag(cov))

# 测试数据
X_test = np.array([[10, 2, 3], [2, 3, 4], [3, 4, 10]])

# 计算马氏距离
mahalanobis_distance = np.array([multivariate_normal.mahalanobis(x, mean, cov) for x in X_test])

# 识别异常
anomalies = np.where(mahalanobis_distance > threshold)[0]

print("异常索引:", anomalies)

b. 滑动窗口 + z-score

import pandas as pd

# 创建示例数据
df = pd.DataFrame({'value1': [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 100],
                    'value2': [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 101]})

# 设置滑动窗口大小
window_size = 3

# 计算滚动平均值和标准差
rolling_mean = df.rolling(window=window_size).mean()
rolling_std = df.rolling(window=window_size).std()

# 计算 z-score
z_scores = (df - rolling_mean) / rolling_std

# 设置异常阈值
threshold = 3

# 识别异常
anomalies = np.where(np.abs(z_scores) > threshold)

print("异常索引：", anomalies)

2. 基于机器学习的方法

a. Isolation Forest

from sklearn.ensemble import IsolationForest
from sklearn.preprocessing import StandardScaler

# 训练数据
X_train = np.array([[1, 2, 3], [2, 3, 4], [3, 4, 5]])

# 标准化数据
scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)

# 训练 Isolation Forest 模型
model = IsolationForest()
model.fit(X_train)

# 测试数据
X_test = np.array([[10, 2, 3], [2, 3, 4], [3, 4, 10]])
X_test = scaler.transform(X_test)

# 预测异常
anomaly_scores = model.score_samples(X_test)
anomalies = np.where(anomaly_scores < model.offset_)[0]

print("异常索引：", anomalies)

b. One-Class SVM

from sklearn.svm import OneClassSVM

# 训练数据
X_train = np.array([[1, 2, 3], [2, 3, 4], [3, 4, 5]])

# 训练 One-Class SVM 模型
model = OneClassSVM(nu=0.1)
model.fit(X_train)

# 测试数据
X_test = np.array([[10, 2, 3], [2, 3, 4], [3, 4, 10]])

# 预测异常
predictions = model.predict(X_test)
anomalies = np.where(predictions == -1)[0]

print("异常索引：", anomalies)

3. 基于深度学习的方法

a. LSTM Autoencoder

from keras.layers import Input, LSTM, RepeatVector, Dense
from keras.models import Model

# 定义模型参数
timesteps = 10
n_features = 3

# 定义编码器
inputs = Input(shape=(timesteps, n_features))
encoded = LSTM(10, activation='relu')(inputs)

# 定义解码器
decoded = RepeatVector(timesteps)(encoded)
decoded = LSTM(n_features, activation='linear', return_sequences=True)(decoded)

# 创建模型
autoencoder = Model(inputs, decoded)
autoencoder.compile(optimizer='adam', loss='mse')

# 训练数据 (需要根据时间步长进行调整)
X_train = np.random.rand(100, timesteps, n_features)

# 训练模型
autoencoder.fit(X_train, X_train, epochs=10)

# 测试数据
X_test = np.random.rand(10, timesteps, n_features)
X_test[2, :, :] = 10  # 添加异常数据

# 预测重构误差
reconstructions = autoencoder.predict(X_test)
reconstruction_errors = np.mean(np.power(X_test - reconstructions, 2), axis=1)

# 设置异常阈值
threshold = np.percentile(reconstruction_errors, 95)

# 识别异常
anomalies = np.where(reconstruction_errors > threshold)[0]

print("异常索引：", anomalies)

以上只是一些示例代码，需要根据具体的数据和问题进行调整。在实际应用中，还需要考虑数据预处理、特征工程、模型选择和评估等问题。

报告相同问题？

关注问题

时间序列数据，python预处理 python
2021-02-19 11:53

回答 5 已采纳代码编辑完成，基本实现三个需求。 # coding='utf-8' import os import datetime import shutil import openpyxl from op
python对时间序列栅格数据进行mann-kendall突变检测 python 开发语言
2023-04-17 20:31

回答 3 已采纳自己解决了。。。
如何用编程语言求输出数字序列？(语言-python) python
2023-03-18 16:26

回答 4 已采纳该回答引用GPTᴼᴾᴱᴺᴬᴵ可以使用Python中的for循环和range函数来实现输出小于n的所有自然数，每个数字之间用空格分隔的效果。下面是一个简单的示例代码： n = int(input("请
【时间序列异常检测】时序异常检测综述整理(2020-2021)
2022-08-11 22:34

AI蜗牛车的博客赵越博士的异常检测库Python Outlier Detection (PyOD) [1]写的很好，还提供了关于异常检测的学习资料[2]，我阅读了几篇综述，个人比较推荐以下三篇：● 2020 | Anomaly detection in univariate time-seri...
Python3 关于collatz序列编程疑问 python
2022-07-17 15:15

回答 1 已采纳第12/13行中调用了2次函数，如果只想调用一次，在第12行前插入一句temp=collatz(number)，再在12/13行中将collatz(number)替换为temp。 temp=co
R语言时间序列 滚动窗口预测 r语言算法
2023-02-03 08:52

回答 5 已采纳下面是一个可能的 R 代码实现： # Load required libraries library(glmnet) # Load data data <- read.csv("file.cs
三路归并得到递增序列（python语言） python 数据结构
2022-05-30 18:53

回答 2 已采纳归并排序的过程是： 1.将一个序列从中间位置分成两个序列。 2.再将这两个子序列按照第一步继续二分下去。 3.直到所有的子序列的长度都为1，即不可再二分为止。 4.最后再将所有的子序列两两归并成一个有
Python-金融时间序列技术分析Python库
2019-08-11 06:06

Python作为一种强大的编程语言，拥有众多库用于处理和分析金融时间序列数据。本文将深入探讨标题提及的"Python-金融时间序列技术分析Python库"，以及与之相关的机器学习应用。一、金融时间序列库介绍 1. Pandas ...
R语言时间序列预测出现问题 r语言
2022-06-03 10:19

回答 1 已采纳重启然后重新运行一遍看看；又或者把放入模型的数据拿出来看看有没有问题。
Python3：collatz序列编程疑问 python
2018-05-21 13:15

回答 4 已采纳你调用了2次 ``` def collatz(number): if number % 2==0: print(number//2) retur
怎么用python修改fasta序列的ID名字 python
2022-04-19 08:29

回答 1 已采纳 import re def run(): fasta_str = """WP_018731760.1 NAD(P)-binding domain-containing protein [Sal
使用Python进行时间序列数据的异常检测和预测计算机毕设
2024-09-13 00:51

sj52abcd的博客 时间序列数据异常检测和预测系统的开发：通过运用 Python 的 Flask 框架和 MySQL 数据库，我们能够构建一个高效的时间序列数据异常检测和预测系统，支持用户对数据进行可视化处理和分析，以及预测未来的数据趋势。
python序列判断 python
2022-10-05 23:10

回答 3 已采纳能不能把缩进整清楚一点
使用Python进行时间序列数据的异常检测
2024-09-13 12:39

sj52abcd的博客这些方法主要包括：基于统计的方法（如基于密度的方法、基于聚类的异常检测方法等）、基于深度学习的方法（如基于神经网络的异常检测方法等）以及基于异常分析的方法（如基于异常统计的方法等）。2. 基于异常点检测...
时间序列python代码
2018-07-25 23:02

本项目涉及的主题是“时间序列Python代码”，通过它我们可以深入理解如何使用Python编程语言来处理和预测时间序列数据。 时间序列分析是研究数据随时间变化趋势的一种统计方法，它的核心是识别数据中的趋势、季节性...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 8月13日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 8月5日

悬赏问题

¥15 android 集成sentry上报时报错。
¥50 win10链接MySQL
¥35 跳过我的世界插件ip验证
¥15 抖音看过的视频，缓存在哪个文件
¥15 自定义损失函数报输入参数的数目不足
¥15 如果我想学习C大家有是的的资料吗
¥15 根据文件名称对文件进行排序
¥15 deploylinux的ubuntu系统无法成功安装使用MySQL❓
¥15 有人会用py或者r画这种图吗
¥15 MOD04_3K图像预处理

时间序列异常检测（python）

22条回答 默认 最新

多变量时间序列异常检测 (Python)

问题事件

悬赏问题

22条回答默认最新