EVA Query Solution (纵表查询)

最新推荐文章于 2022-08-04 11:45:35 发布

envykok

最新推荐文章于 2022-08-04 11:45:35 发布

阅读量278

点赞数

分类专栏： SQL 经典问题文章标签： query insert table attributes join constraints

本文链接：https://blog.csdn.net/envykok/article/details/5646265

版权

SQL 经典问题专栏收录该内容

34 篇文章 0 订阅

订阅专栏

refer to :

http://stackoverflow.com/questions/663040/query-several-eav-attributes-in-separate-columns

http://structureddata.org/2009/03/19/the-impact-of-good-table-and-query-design/

refer to :http://www.sqlservercentral.com/articles/Database+Design/62386/

Before going deep into examples of this type of approach, let us look at the cons and pros of this approach towards data modelling. The lists are not confined to the one I have pointed but there could be more that you could add. I have listed here the most obvious once.

1. Advantages

- Very flexible to add an attribute that you don't know in advance without redesigning your database structure
- Can be a good idea to collect the attributes of unknown data . For example, it is difficult to know the attributes of some complex research data and using this methodology would help to assemble this attributes that can feed into proper data modelling.
- Database level complexity is simplified when inserting data into the table. The number of procedures to insert into the above table is only one or at most two if you are going to partition your table to increase performance.

2. Disadvantages

Difficult to query and transform the data into meaningful information. To display a data in

tabular format of meaningful way, your query may involve many case statements, sub-queries, self joins and etc. This will impact on performance of your query specially when the number of rows are getting bigger

- Unless the table is partitioned it can grow fast and querying the table would take longer time.

- It is not possible to enforce business rule constraints and default values for an attribute as attributes are modelled as being data.

- Difficult to the integrity of dat a as again as attributes are modelled as being data.
- Could result in changing the value column to text data type, as the value of name value pair can be significantly different ranging from bit to text data type. This will result in ineffective storage design
- Effective index design strategy is difficult. Indexing of the value column is could be very difficult as it can result in wide index and can impact inserting and updating.

So, looking at the above cons and pro's of using EAV towards data modeling, I would like to conclude that there might be occasions that you probably be better using this approach but for data model where most of the attributes are known and is tailored to do specific thing which most databases are for, it is better not even to consider this approach.

Example:

Table 1 : Attribute table

CREATE TABLE EAV (
subscriber_id INT NOT NULL DEFAULT '0',
attribute_id CHAR(62) NOT NULL DEFAULT '',
attribute_value CHAR(62) NOT NULL DEFAULT '',
PRIMARY KEY (subscriber_id,attribute_id)
)
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (4,'color','blue');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (4,'garment','shirt');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (5,'color','blue');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (5,'size','xl');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (5,'garment','shirt');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (6,'color','red');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (6,'garment','shirt');
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (1,'color','red')
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (1,'size','xl')
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (1,'garment','shirt')
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (2,'color','red')
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (2,'size','xl')
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (2,'garment','pants')
INSERT INTO EAV (subscriber_id, attribute_id, attribute_value) VALUES (3,'garment','pants')

Table 2 : Main Table

CREATE TABLE #TEMP_Table
(
subscriber_id INT
)
INSERT INTO #TEMP_Table VALUES (1)
INSERT INTO #TEMP_Table VALUES (2)
INSERT INTO #TEMP_Table VALUES (3)
INSERT INTO #TEMP_Table VALUES (4)
INSERT INTO #TEMP_Table VALUES (5)
INSERT INTO #TEMP_Table VALUES (6)

Question ：Find out all the 'subscriber_id' with 'red' color and 'xl' size from main table - #TEMP_Table

Solution 1 : create a helper table (Query Cost : 13%)

CREATE TABLE CRITERIA (
attribute_id CHAR(62) NOT NULL DEFAULT '',
attribute_value CHAR(62) NOT NULL DEFAULT '' )

INSERT INTO CRITERIA (attribute_id, attribute_value) VALUES ('color', 'red')
INSERT INTO CRITERIA (attribute_id, attribute_value) VALUES ('size', 'xl')

SELECT subscriber_id
FROM #TEMP_Table
WHERE subscriber_id IN (
SELECT E.subscriber_id
FROM EAV AS E
RIGHT JOIN CRITERIA AS CR ON (E.attribute_id = CR.attribute_id AND E.attribute_value =CR.attribute_value)
GROUP BY E.subscriber_id
HAVING COUNT(E.subscriber_id)=(SELECT COUNT(attribute_id) FROM CRITERIA)
)