douzhang7603 2016-01-03 05:20

mysql数据库有很多很多关系混乱

I have 4 tables

hubs | countries | categories | news

here hubs and countries have many to many relation

country_hub

id   
hub_id    
country_id

and then this pivot table country_hub has many to many relation with categories so I did like

category_country_hub

id   
country_hub_id   
category_id

and again this table has many to many relation with news table

category_country_hub_news

category_country_hub_id      
news_id

this is giving me a complicate relation to query

so I am thinking of modifying the relation like

country_hub

country_id   
hub_id

category_country_hub

country_id   
hub_id   
category_id

category_country_hub_news

hub_id   
country_id   
category_id   
news_id

which is one to many relation with hubs/countries/ categories

is there any better way to handle these kind of relation please help or any tutorials links

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

doudi8829 2016-01-03 09:03

关注

Ok, clear now. Expanded comments below, here is the summary:

tl;dr:
1) your revised approach makes more sense to me.
2) your naming conventions could use some polish, will improve readability for humans (specifically the 'tokens' in table names matching order of columns in table, fwiw the database itself won't care).
3) book: I will recommend "SQL for Smarties" (Celko), which goes into some of the modeling issues you're dealing with. http://www.amazon.com/Joe-Celkos-Smarties-Fourth-Edition/dp/0123820227

Let's dig into the table definitions... I can't reason well from a text summary, my brain works better if I can see examples.

Let me know if the examples are (more or less) suitable.

raw data tables

Seems ok to call these fact tables.

|-----------------|---------------|---------------|--------------------|
| select * from   | select * from | select * from | select * from      |
| COUNTRIES       | HUBS          | CATEGORIES    | NEWS               |
|-----------------|---------------|---------------|--------------------|
|  id :   name    |   id : name   |  id : name    |    id :   title    |
| --- : --------- |  --- : -----  | --- : ------- |  ---- : -----------|
| 101 : China     |  201 : X      | 301 : Red     |   401 : 'aa aaaa a'|
| 102 : Nepal     |  202 : Y      | 302 : Blue    |   402 : 'bbbb b bb'|
| 103 : Australia |  203 : Z      | 303 : Green   |   403 : 'cc ccc cc'|
| 104 : NewZealand|  ...etc...    | 304 : Orange  |   404 : 'ddddd d'  |
|   ...etc...     |               | ...etc...     |   405 : 'ee eeee'  | 
|-----------------|---------------|---------------|--------------------|

original relation tables

Observation: These are not really dimension tables, I don't see an obvious hierarchy here.

Let's carry this out a little further.

  |-----------------------|---------------------------|--------------------------|
  | select * from         | select * from             | select * from            |
  | COUNTRY_HUB           | CATEGORY_COUNTRY_HUB      | CATEGORY_COUNTRY_HUB_NEWS|
  |-----------------------|---------------------------|--------------------------|
  |     :        : country|    : country  :  category |        cat_cnt  : news   |
  |  id : hub_id : _id    | id :  _hub_id :  _id      |  id  : _hub_id  : _id    |
  |---- : ------ : -------|----: -------- : ----------| ---- : -------- : ------ |
  |  11 :    101 :  201   | 21 :     11   :   301     |  31  :    21    :   401  |
  |  12 :    101 :  202   | 22 :     11   :   303     |  32  :    21    :   403  |
  |  13 :    101 :  203   | 23 :     12   :   302     |  33  :    21    :   404  |
  |  14 :    102 :  200   | 24 :     12   :   304     |  34  :    22    :   405  |
  | ...etc...             | ...etc...                 |  ...etc...               |
  |-----------------------|---------------------------|--------------------------|

Yes, this is starting to look complicated. :-)

observation: If you were going to stay with the approach, I think it could be a little easier if you follow a naming convention embedding the Raw Data tables last:

 Original tbl names         |    Notes
----------------------------|------------------------------------------------------
COUNTRY_HUB                 | Two raw-data id#'s (hub_id & country_id)s
----------------------------|------------------------------------------------------
CATEGORY_COUNTRY_HUB        | One raw data id#, last column (category_id), but CATEGORY_... first
                            | token in the table name.
                            | I will suggest COUNTRY_HUB_CATEGORY would be easier to read
                            | for human readers, since both right-most column and right-most token
                            | in the table name tie back to the same concept (the CATEGORY raw data table).
----------------------------|------------------------------------------------------
CATEGORY_COUNTRY_HUB_NEWS   | One raw data id#, last column (news_id), also _NEWS  is last token
                            | in the table name, easier for human readers to parse & follow.
----------------------------|------------------------------------------------------

modified relationship tables

This looks better.

  |-----------------------|-------------------------------|-------------------------------------------|
  | select * from         | select * from                 | select * from                             |
  | COUNTRY_HUB           | CATEGORY_COUNTRY_HUB          | CATEGORY_COUNTRY_HUB_NEWS                 |
  |-----------------------|-------------------------------|-------------------------------------------|
  |     : country: hub    |    : country : hub  : category|      : hub   : country : category : news  |
  |  id : _id    : _id    | id : _id     : _id  :  _id    |  id  : _id   : _id     : _id      : _id   |
  |---- : ------ : -------|----: --------: ---- : --------| ---- : ----- : ------- : -------- : ----- |
  |  11 :    201 :  101   | 21 : 201     :  101 :  301    |  31  :  101  :    201  :  301     : 401   |
  |  12 :    202 :  101   | 22 : 201     :  101 :  302    |  32  :  101  :    201  :  301     : 401   |
  |  13 :    203 :  102   | 23 : 201     :  101 :  303    |  33  :  102  :    201  :  301     : 401   |
  |  14 :    204 :  102   | 24 : 201     :  102 :  301    |  34  :  102  :    201  :  301     : 402   |
  | ...etc...             | ...etc...                     |  ...etc...                                |
  |-----------------------|-------------------------------|-------------------------------------------|

About Naming Conventions The table-name "tokens" still don't follow the column order. As a favor to yourself and future maintainers, consider changing that:

COUNTRY_HUB is fine.
CATEGORY_COUNTRY_HUB still seems flipped, use COUNTRY_HUB_CATEGORY
CATEGORY_COUNTRY_HUB_NEWS doesn't follow from previous, I would use COUNTRY_HUB_CATEGORY_NEWS and adjust the columns accordingly (though I
don't know enough about your data relationships to comment on what is
the best order).

The thing that you have implicit in your naming is a rough "category"

overly simplistic:
   Each COUNTRY has 0..many HUBS.
   Each HUB has 0..many CATEGORIES.
   Each CATEGORY  has 0..many NEWS items.

I'll suggest you work on making your table-name "tokens" match the "column order". You seem to have (in order of few to many):

COUNTRY  : COUNTRIES (relatively few)
HUB      : HUBS (# of HUBS greater than # of COUNTRIES)
CATEGORY : Assigned CATEGORIES (# of COUNTRY+HUB+CATEGORY combinations exceeds # of previous)
NEWS     : Assigned NEWS items (# of COUNTRY+HUB+CATEGORY+ combinations exceeds # of previous)

Let's do a little data modeling and describe the relationships...

COUNTRY <*----*> HUB
   Each COUNTRY has 0..many HUBS.  
   A given HUB may be associated w/multiple COUNTRIES.


HUB ----*> CATEGORY
or..?
COUNTRY + HUB <*----*> CATEGORY
   Your tables suggest CATEGORIES do not simply associate directly with a given HUB.
   Consider HUB.id=101 name='X' 
      X.China.categories = ( Blue, Yellow );
      X.Nepal.categories = ( Orange, Green );
      X.Australlia.categories = ( ); e.g. none.

   Instead of all countries associated with that HUB sharing the same "HUB CATEGORIES",
   it sounds like the CATEGORIES are like "tags" and that the various countries involved
   with a given HUB can have their collection of 0..many CATEGORIES.
   It seems weird, but I don't know your data.
   In the interests of simplifying I would try to make CATEGORIES be HUB-specific, not
   HUB+COUNTRY specific... but that may be unavoidable for you.

COUNTY + HUB + CATEGORY <*----*> NEWS
   This suggests that a given NEWS item can be associated with 2+ (COUNTRY+HUB+CATEGORY) triples.
   If that is what you need, then it can't be avoided.

You're going to have a challenge keeping all of the relationships up to date.

You will want to study up on foreign key constraints and cascading deletes.

I did greatly enjoy this book: SQL for Smarties (Celko), which goes into some of the modeling issues you're dealing with.

Splitting them out the way you are has the advantage of avoiding some anomalies (one of the examples Celko uses involved class scheduling at a school: teachers, classes, rooms, students and the relationships between them). I will recommend the book, I think it reads well.

报告相同问题？

关注问题

mysql数据库多表查询的问题 mysql 数据库
2016-12-29 01:24

回答 9 已采纳 select *,'项目1 ' as ITEM from table1 union all select * ,'项目2 ' as ITEM from table1 union all sel
关于mysql关系数据库问题 mysql
2022-04-26 16:37

回答 4 已采纳是的，很多搜索引擎也就是通过这样把数据记录成json保存在内存中
在sql中多大的数据才算是大数据？ java mysql 数据库
2022-03-31 17:24

回答 5 已采纳其实没有实际的标准明确定义多少数据量算大数据，不过阿里开发手册中建议，表数据超过500万条时，建议考虑分表，以防影响查询效率，不过我们公司也有单表超过几千万条的数据，效率确实不高，所以理论上百万级别以
MySQL数据库面试题（2020最新版）
2020-03-10 17:20

ThinkWon的博客 数据库三大范式是什么mysql有关权限的表都有哪几个MySQL的binlog有有几种录入格式？分别有什么区别？数据类型mysql有哪些数据类型引擎MySQL存储引擎MyISAM与InnoDB区别MyISAM索引与InnoDB索引的区别？InnoDB引擎的4...
请问MySQL数据库拒绝访问是什么情况？ mysql 数据库
2022-04-10 09:25

回答 3 已采纳根据描述很可能是用户授权问题，看一下localhost对应的账号和密码，如果都正确，就要查看是否有localhost访问权限了，一般默认是有的，如果有人改了授权，指定了某个ip能访问，那就无法访问了，
接触大数据有一年了，现在很迷茫大数据
2022-09-21 20:59

回答 1 已采纳你好，涉猎广，这本身是没错的，这能够让你的知识面更广，从而让你在工作和交谈中得心应手。但“长板效应”又告诉我们人不可贪多，应当有所取舍。这就需要你在工作中抓住自己的兴趣和喜好，强化自己的特长，提升自己
mysql的多对多关系映射查询问题 java mysql
2017-07-16 07:47

回答 3 已采纳我刚刚学习java2个月吧，萌新说错了不要怪罪说说思路，因为我们要找出即是女人又是变态的，而只有表三中我们可以看到这个关系，所以首先我们应该找通过表3找出既是女人又是变态的记录，就是表三里面 s
从曾经的一家独大到现在的群雄逐鹿，大数据时代的数据库圈为啥如此之乱？
2022-10-10 16:13

梦回从前的博客天下合久必分，分久必合；江山代有人才出，各领风骚数百年这些古语显然也适用于数据库的市场。相对于数据量迅速膨胀这个表象之外，数据类型复杂度的...文章到这里就结束了，最后路漫漫其修远兮，大数据之路还很漫长。
mysql表记录很多查询慢是否影响插入 mysql 数据库
2017-05-03 08:00

回答 3 已采纳在查询一个表时，服务器首先会对这个对象加锁，为了保证数据的统一性，所以在查询的过程中，如果前台插入语句，是需要等待的。具体解释可以看下sql执行过程： http://www.cnblogs.
请问Mysql对比其他关系型数据库有什么优势 mysql 有问必答
2021-03-09 17:13

回答 6 已采纳 Mysql相比于其他关系型数据库来说有几个优点 1.其体积小,总体拥有成本低,开放源码(免费) 2.性能卓越,新版的mysql在以下方面带来了更好的性能：读/写工作负载、IO 密集型工作负载、以
MySQL数据库该怎么下载 mysql sql 数据库
2023-03-20 22:32

回答 2 已采纳右向外连接是将返回右表的所有行，左向外连接的结果集包括LEFTOUTER子句中指定的左表的所有行。右向外连接，如果右表的某行在左表中没有匹配行，则将为左表返回空值；如果左表的某行在右表中没有匹配行，则
MySQL数据库面试题（2022最新版）
2022-05-26 20:35

java领域的博客 mysql有关权限的表都有哪几个 MySQL的binlog有有几种录入格式？分别有什么区别？数据类型 mysql有哪些数据类型引擎 MySQL存储引擎MyISAM与InnoDB区别 MyISAM索引与InnoDB索引的区别？ InnoDB引擎的4大特性 ...
MySQL数据库的子查询 mysql sql 数据库
2023-03-21 08:32

回答 2 已采纳当一个查询是另一个查询的条件时,称之为子查询。子查询可以使用几个简单命令构造功能强大的复合命令。子查询最常用于SELECT-SQL命令的WHERE子句中。
python远程连接mysql数据库_python连接远程mysql数据库
2020-12-18 10:17

weixin_39710462的博客 Python操作MySQL基本环境搭建及增删改查实现写作原因：这篇文章将带领读者使用Python操作MySQL数据库。在PHP和Python之间博主更倾向于Python，而后台开发免不了要操作数据库，所以就有了这篇文章。安装Python鉴于上...
mysql数据库备份
2021-12-20 20:44

m0_64896523的博客 mysql数据库备份 mysqldump命令备份单个数据库 mysqldump -u root -p --databases dgf > /opt/dgf.sql -u 指定用户 -p 指定密码 > /opt/dgf.sql指定备份目录一定要.sql结尾多个数据库备份加...
没有解决我的问题, 去提问

悬赏问题

¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接
¥15 孟德尔随机化结果不一致
¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀
¥20 手写数字识别运行c仿真时，程序报错错误代码sim211-100
¥15 关于#hadoop#的问题
¥15 (标签-Python|关键词-socket)

码龄粉丝数原力等级 --

mysql数据库有很多很多关系混乱

1条回答默认最新

码龄粉丝数原力等级 --

raw data tables

original relation tables

modified relationship tables

悬赏问题

mysql数据库有很多很多关系混乱

1条回答 默认 最新

raw data tables

original relation tables

modified relationship tables

悬赏问题

1条回答默认最新