SQL Server 2005: Recursive Hierarchies to XML

最新推荐文章于 2023-04-24 13:46:01 发布

huayang912

最新推荐文章于 2023-04-24 13:46:01 发布

阅读量447

点赞数

分类专栏： SQL SERVER 文章标签： sql server insert tsql hierarchy table performance

SQL SERVER 专栏收录该内容

9 篇文章 0 订阅

订阅专栏

Suppose we have a recursive hierarchy stored in a relational database and we want to write it to XML. This might be for a variety of reasons – e.g. as a pre-cached input to a UI control, to export to another system using a pre-defined format, etc.

In SQL Server 2000, in order to get it straight to XML using FOR XML EXPLICIT, we would have to know the depth of the lowest node beforehand (without doing some very ugly dynamic SQL), so this does not help us.

It would be useful to access the data in the same order that it will appear in the XML. I.e.

Node1
- Node2
  - Node3
  - Node4
- Node5

Getting at the data in this order will allow us to iterate through the nodes in sequential order. This avoids using the DOM and is significantly quicker and more efficient as it avoids loading the whole structure into memory.

We could achieve this in SQL Server 2000 using a recursive table-valued UDF. In SQL Server 2005, we also have the option of using a recursive Common Table Expression (CTE) to achieve the same functional result. Let’s compare the two ways of doing it.

A CTE is a temporary named resultset referenced by a subsequent “outer query”. They can provide similar functionality to views and derived tables, but their real value is in recursive queries. Recursive CTE’s contain an “anchor member” and a “recursive member”, which are connected by a UNION ALL operator. They can be encapsulated by UDFs for reusability.

Data Preparation

Let’s create a table and insert hierarchical values.

CREATE TABLE Employees

(

empid int NOT NULL,

mgrid int NULL,

empname varchar(25) NOT NULL,

CONSTRAINT PK_Employees PRIMARY KEY(empid),

CONSTRAINT FK_Employees_mgrid_empid

FOREIGN KEY(mgrid)

REFERENCES Employees(empid)

)

CREATE INDEX idx_nci_mgrid ON Employees(mgrid)

SET NOCOUNT ON

INSERT INTO Employees VALUES(1 , NULL, ‘Nancy’)

INSERT INTO Employees VALUES(2 , 1 , ‘Andrew’)

INSERT INTO Employees VALUES(3 , 1 , ‘Janet’)

INSERT INTO Employees VALUES(4 , 1 , ‘Margaret’)

INSERT INTO Employees VALUES(5 , 2 , ‘Steven’)

INSERT INTO Employees VALUES(6 , 2 , ‘Michael’)

INSERT INTO Employees VALUES(7 , 3 , ‘Robert’)

INSERT INTO Employees VALUES(8 , 3 , ‘Laura’)

INSERT INTO Employees VALUES(9 , 3 , ‘Ann’)

INSERT INTO Employees VALUES(10, 4 , ‘Ina’)

INSERT INTO Employees VALUES(11, 7 , ‘David’)

INSERT INTO Employees VALUES(12, 7 , ‘Ron’)

INSERT INTO Employees VALUES(13, 7 , ‘Dan’)

INSERT INTO Employees VALUES(14, 11 , ‘James’)

Recursive Table-Valued UDF Example

CREATE FUNCTION dbo.fn_EmpRec

(

@empid int,

@depth int

)

RETURNS @Emps TABLE

(

empid int,

empname varchar(25),

mgrid int,

depth int

)

BEGIN

– insert current employee into working table

INSERT INTO @Emps

SELECT empid,

empname,

mgrid,

@depth

FROM Employees

WHERE empid = @empid

– holding variable to keep track of current child employee

DECLARE @curempid int

– get the first child

SELECT @curempid = MIN(empid)

FROM Employees

WHERE mgrid = @empid

– iterate each child and make the recursive call

WHILE @curempid IS NOT NULL

BEGIN

INSERT INTO @Emps

SELECT *

FROM dbo.fn_EmpRec(@curempid, @depth + 1)

SELECT @curempid = MIN(empid)

FROM Employees

WHERE empid > @curempid AND

mgrid = @empid

END

RETURN

END

And if we run the following query,

SELECT

REPLICATE(‘| ', depth)

+ ‘(' + (CAST(empid AS VARCHAR(10))) + ‘) '

+ empname AS empname

FROM dbo.fn_EmpRec(1, 0)

The resultset is:

Great! - (until we try it on 25,000 rows or above that is)

Common Table Expression Example

Now let’s do it the CTE way.

WITH EmpCTE(empid, empname, mgrid, depth, sortcol)

(

– anchor member

SELECT empid, empname, mgrid, 0,

CAST(empid AS VARBINARY(900))

FROM employees

WHERE empid = 1

UNION ALL

– recursive member

SELECT E.empid, E.empname, E.mgrid, M.depth+1,

CAST(sortcol + CAST(E.empid AS BINARY(4)) AS VARBINARY(900))

FROM Employees AS E

JOIN EmpCTE AS M

ON E.mgrid = M.empid

)

– outer query

SELECT

REPLICATE(‘| ‘, depth)

+ ‘(’ + (CAST(empid AS VARCHAR(10))) + ‘) ‘

+ empname AS empname

FROM EmpCTE

ORDER BY sortcol

This returns the same resultset as the recursive UDF.

Once you get used to CTEs, this is significantly easier to write than the UDF. The key thing to notice here is the varbinary sortcol column on which the query is sorted. It is the full hierarchical chain of empids leading to the current node appended together in order.

By checking the value of GETDATE() before and after running the UDF/CTE, we can see a significant performance gain with the CTE. But I actually tried it on a hierarchy with over 50,000 members. The UDF took over half an hour. THE CTE SAW A PERFORMANCE GAIN OF ABOUT 90% !

For further details, refer to Itzik Ben-Gan’s TSQL Enhancements in SQL Server 2005 white paper.

当有多个顶级节点时，采用游标：

ALTER PROCEDURE [dbo].[IssueTracker_ProjectModules_GetAllModulesByProjectId] @ProjectId INT
AS
SET NOCOUNT ON
SET TRANSACTION ISOLATION LEVEL READ COMMITTED

    DECLARE @TopModuleID INT,
        @TSQL NVARCHAR(4000)
    SET @TSQL = ''
    DECLARE ModuleCoursor CURSOR
        FOR SELECT ModuleID
            FROM    IssueTracker_ProjectModules
            WHERE   ISNULL(ParentModuleID,0) = 0
    OPEN ModuleCoursor
    FETCH NEXT FROM ModuleCoursor INTO @TopModuleID
    WHILE @@FETCH_STATUS = 0
        BEGIN

            SELECT @TSQL = @TSQL
                    + 'SELECT ModuleID, REPLICATE(''         '', depth) + ''└'' + ModuleName AS ModuleName
      FROM    dbo.fn_ModuleRec(''' + CONVERT(VARCHAR, @TopModuleID)
                    + ''', 0) UNION ALL '

            FETCH NEXT FROM ModuleCoursor INTO @TopModuleID
        END

-- 去掉 UNION ALL

    CLOSE ModuleCoursor
    DEALLOCATE ModuleCoursor
    IF LEN(@TSQL) > 8
        BEGIN
            SELECT @TSQL = SUBSTRING(@TSQL, 0, LEN(@TSQL) - 8)
        END
    PRINT @TSQL

EXEC SP_EXECUTESQL @TSQL

结果：

ModuleID ModuleName
11 └基础数据
13          └客户类型
16          └销售渠道
18 └系统管理
20          └角色管理
23                   └测试管理
21          └权限管理
22 └销售管理

结果虽然好看点，但是它绑定到dropdownlist 时，前面的空格自动没了，所以没有树形效果，郁闷，如果哪位有解决办法，请告诉我，谢谢！

huayang912

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
SQL Server 2005: Recursive Hierarchies to XML

Suppose we have a recursive hierarchy stored in a relational database and we want to write it to XML. This might be for a variety of reasons – e.g. as a pre-cached input to a UI control, to export to
复制链接

扫一扫

专栏目录