存储所有数据更改的每个细节（如Stackoverflow）[关闭]

I have system written using Codeigniter and as a database using MySQL. System have user, usergroups with different privileges and etc. Have lots of mysql tables which have many to many relationships.

Some of the tables I have:

items
contracts
customers
products
product_features
orders
order_features
order_products
etc...

Currently I am logging every change on data for these tables which made by users. Users can change these datas due to their privilege. Storing change of logs only simple form like

A user changed product features with id of A8767
B user added new customer with id 56
C user edited content of orderlist
A user added new product (id: A8767) to order (id: or67)
...

I want keep all changes which made with every detail, like edit history of question Stackoverflow. I can think about log_table design to keep all data changes from various tables. Is there any way, tutorial, engine , plugin to do that ? Only i can think make duplicate of every table and keep storing changes on them, but i dont think its good way.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
donglang1894 2012-03-24 22:58
关注
I've been thinking about that for a while now and can only think of two ways to do this. Both can work fully transparent when crafted into an abstract data layer / model.

By the way there is an implementation for "versionable" table data in the ORM mapper doctrine. See this example in their docs. Maybe that fits your needs, but it doesn't fit mine. It seems to delete all history data when the original record is deleted, making it not really revision safe.

Option A: have a copy of each table to hold revision data

Lets say you have a simple contact table:

CREATE TABLE contact ( id INT NOT NULL auto_increment, name VARCHAR(255), firstname VARCHAR(255), lastname VARCHAR(255), PRIMARY KEY (id) )

You would create a copy of that table and add revision data:

CREATE TABLE contact_revisions ( id INT NOT NULL, name VARCHAR(255), firstname VARCHAR(255), lastname VARCHAR(255), revision_id INT auto_increment, type ENUM('INSERT', 'UPDATE', 'DELETE') NOT NULL, change_time DEFAULT current_timestamp, PRIMARY KEY(revision_id) )

Keep track of INSERT and UPDATE using AFTER triggers. On each new data revision in the original, insert a copy of the new data in the revision table and set the modification type properly.

To log a DELETE revisionally safe you must also insert a new row in the history table! For this you should use a BEFORE DELETE trigger and store the latest values before they are deleted. Otherwise you will have to remove every NOT NULL constraint in the history table as well.

Some important notes regarding this implementation

For the history table you must drop every UNIQUE KEY (here: the PRIMARY KEY) from the revision table because you will have the same key multiple times for each data revision.

When you ALTER the schema and data in the original table via an update (e.g. software update) you must ensure the same data or schema corrections are applied to the history table and its data, too. Otherwise you will run into trouble when reverting to an older revision of a record set.

In a real world implementation you would want to know which user modified the data. To have that revisionally safe a user record should be never deleted from the users table. You should just set the account disabled with a flag.

Usually, a single user action involves more than one table. In a real world implementation, you would also have to keep track which changes in multiple tables belong to a single user transaction and also in which order. In a real use case you would want to revert all changes of a single transaction together, in a reverse order. That would require an additional revision table which keeps track on the users and transactions and holds a loose relationship to all those individual revisions in the history tables.

Benefits:

completely in database, independent from application code. (well, not when tracking user transactions is important. that would require some logic outside the scope of the single query)

all data is in their original format, no implicit type conversions.

good performance on search in the revisions

easy rollback. Just do a simple INSERT .. ON DUPLICATE KEY UPDATE .. statement on the original table, using the data from the revision you want to roll back.

Merits:

Hard to implement manually.

Hard (but not impossible) to automate when it comes to database migrations / application updates.

As already stated above, doctrines versionable does something similiar.

Option B: have a central change log table

preface: bad practice, shown for illustration of the alternative only.

This approach does heavily rely on application logic, which should be hidden in a data layer / model.

You have a central history table that keeps track on

Who did

when

modify, insert or delete

what data

in which field

of which table

Like in the other approach, you may also want to track which individual data changes belong to a single user action / transaction and in which order.

Benefits:

no need to keep in sync with the original table when adding fields to a table or creating a new table. it scales transparently.

Merits:

bad practice using a simple value = key store in database

bad search performance, because of implicit type conversions

may slowdown overall performance of the application/database, when the central history table becomes a bottleneck because of write locks (this only applies for specific engines with table locks, i.e. MyISAM)

It's much harder to implement rollbacks

possible data conversion errors / precision loss because of implicit type conversion

doesn't keep track of changes when you directly access the database somewhere in your code instead of using your model / data layer and forget that in this case you must write to the revision log manually. May be a big issue when working in a team with other programmers.

Conclusion:

Option B can be very handy for small apps as a simple "drop in" when its just for logging changes.

If you want to go back in time and be able to easily compare the differences between historic revison 123 to revision 125 and/or revert to the old data, then Option A is the hard way to go.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

存储所有数据更改的每个细节（如Stackoverflow）[关闭] mysql php
2012-03-24 14:58

回答 2 已采纳 I've been thinking about that for a while now and can only think of two ways to do this. Both can
用stackoverflow提供的API下载用户数据
2016-01-20 12:24

回答 4 已采纳 page那个是分页。数据多的时候需要分页。
更改字符串中每个单词的字体颜色（JS或PHP） css javascript jquery php
2015-09-24 13:44

回答 3 已采纳 You can wrap the each word in separate span or any other element and then can be styled differentl
时间序列大数据平台建设经验谈
2018-02-07 10:37

Laurence　的博客在大数据的生态系统里，时间序列数据(Time Series Data，简称TSD)是很常见也是所占比例最大的一类数据，几乎出现在科学和工程的各个领域，一些常见的时间序列数据有：描述服务器运行状况的Metrics数据、各种IoT系统...
java中报 StackOverflowError java
2022-01-25 11:09

回答 8 已采纳你可以看看参考https://blog.csdn.net/danpu0978/article/details/107276621https://blog.csdn.net/weixin_3619403
java.lang.StackOverflowError null（栈溢出异常） java
2022-07-17 01:22

回答 1 已采纳这个堆栈溢出看异常信息应该是页面有关,请从页面相关内容进行排查.应该是和thymeleaf标签使用有问题.如有帮助,欢迎采纳!
stackoverflow网页显示字体重影，其他网页都是正常显示。 python 有问必答问答团队
2022-04-02 07:54

回答 2 已采纳亲亲，你这个是开启了3d眼镜模式，你手边如果有3d眼镜的话带上看看就知道是立体的，点击页面下方的button关闭和开启此功能哦。
2020 BAT大厂数据开发面试经验：“高频面经”之大数据研发篇
2020-02-16 14:44

大数据之眸的博客注：数据研发侧重组件框架原理和编程实践经验，在面试中也会问到数据结构与算法、机器学习算法等。以下试题为作者日常整理的通用高频面经，包含题目，答案与参考文章，欢迎纠正与补充。 _ _ _ _ 目录 1....
我需要一个正则表达式来匹配所有单个单词和每两个单词 php
2015-03-16 15:32

回答 5 已采纳 I don't think you can achieve that with just regular expressions. I would say, use explode and con
hibernate 根据id获取对象时报错StackOverflowError hibernate java
2022-12-15 19:26

回答 1 已采纳这个应该是你28行那里方法嵌套调用了自己，产生的栈溢出
Dataframe数据如何按行调整位置？ python r语言数据挖掘
2019-07-14 08:39

回答 1 已采纳问题解决了，用了下面的方法：每行排序再变回dataframe。 https://stackoverflow.com/questions/25817930/fastest-way-to-sort-
【大数据处理技术】期末复习整理
2020-07-19 21:24

鸽子不二的博客所用教材：《大数据技术原理与应用——概念、存储、处理、分析与应用（第2版）》，由厦门大学计算机科学系林子雨编著。教材官网：http://dblab.xmu.edu.cn/post/bigdata/ 慕课：...
请求大佬支援!!!报错java.lang.StackOverflowError eclipse java
2019-11-25 14:27

回答 3 已采纳我本地运行了一下，输入比较小的数的时候能够很快结束的。是不是你输入的数据太大了，导致递归层数过多，而堆栈溢出了。试试比较小的数看看，代码没有问题。
数据分析大数据面试题大杂烩02
2021-03-09 16:30

爱学习的菜鸟罢了的博客当写入的数据达到设定的阈值时,系统将会启动一个线程将缓冲区的数据写到磁盘,这个过程叫做spill(spill写入之前,会先进行二次排序,首先根据数据所属的partition进行排序,然后每个partition中的数据再按key来排序 ....
当我说要做大数据工程师时他们都笑我，直到三个月后……
2017-10-25 14:52

软件供应链安全的博客作者：Fickr孫啟誠原文：三个月大数据研发学习计划实战解析关注微信公众号：「GitChat 技术杂谈」一本正经的讲技术【不要错过文末彩蛋】申明：本文旨在为普通程序员（Java程序员最佳）提供一个入门级别的...
当我说转行大数据工程师时，众人笑我太疯癫，直到四个月后......
2022-10-06 15:22

大数据研习社的博客本人目前是一名大数据高级工程师，项目数据容量100P+，日处理数据量200T+，集群规模1000+节点，个人是Java前后端开发，因公司项目开发需要，边学习边做项目，四个月成功完成公司项目并成功转型大数据工程师，后经过...
大数据技术原理与应用：期末考点总结
2021-02-18 22:37

虾米奥的博客个人期末复习材料，根据林子雨的大数据技术教材与其它资料整理。第一章 大数据概述 1.大数据的4v特征数据量大 volume 价值密度低 value 数据类型繁多 variety 处理速度快 velocity 2.大数据3种思维方式的转变在...
Haddop+spark大数据分析（二）之Hadoop 集群的搭建
2022-08-16 13:34

xiaoweiwei99的博客 HOME=JAVA_HOME的地址 export JAVA_HOME=${JAVA_HOME} 如下图配置 core-site.xml 创建 HDFS 数据存储目录，我的存储路径是放在$HADOOP_HOME目录下的 /hdfs_data/ mkdir /usr/local/hadoop/hadoop-3.3.0/hdfs_data ...
大数据面试题
2022-07-14 10:15

AllenGd的博客 Java栈是与每一个线程关联的，JVM在创建每一个线程的时候，会分配一定的栈空间给线程。存储局部变量、引用、方法、返回值等。 StackOverflowError：如果在线程执行的过程中，栈空间不够用，那么JVM就会抛出此异常，...
没有解决我的问题, 去提问

悬赏问题

¥15 微信公众平台自制会员卡可以通过收款码收款码收款进行自动积分吗
¥15 随身WiFi网络灯亮但是没有网络，如何解决？
¥15 gdf格式的脑电数据如何处理matlab
¥20 重新写的代码替换了之后运行hbuliderx就这样了
¥100 监控抖音用户作品更新可以微信公众号提醒
¥15 UE5 如何可以不渲染HDRIBackdrop背景
¥70 2048小游戏毕设项目
¥20 mysql架构，按照姓名分表
¥15 MATLAB实现区间[a,b]上的Gauss-Legendre积分
¥15 delphi webbrowser组件网页下拉菜单自动选择问题

存储所有数据更改的每个细节（如Stackoverflow）[关闭]

2条回答 默认 最新

悬赏问题

2条回答默认最新