Informatica中实现count(distinct)

最新推荐文章于 2022-04-01 11:33:18 发布

thy822

最新推荐文章于 2022-04-01 11:33:18 发布

阅读量2.1k

点赞数

分类专栏： Informatica

Informatica 专栏收录该内容

15 篇文章 1 订阅

订阅专栏

Thanks for your response. Here is the sample data information:

A id1 $200
A id1 $300
A id2 $150
B id3 $100
B id4 $20

I want the following in the output:
Name distinct-Count Totalamt
A 2 $650

B 2 $120

One way to do this is to use 2 aggregators. The First aggregator group and sum amt by Name and ID.
On the second aggregator group by Name and count id and sum amt.

So if I remove the duplicate row in the Sorter I will get incorrect distinct-count and totalamt for "A".

There are two different ways.

1) Suggested by Manas - to use two aggregates.
2)Flaging the record as 1 and 0 before aggregate. For this, you should have sorted data on name and id.
If you have sorted data coming from source, try 2nd option as given below.
In expression transformation before aggregator.

IN_ID
v_ID_CNT = IIF(ISNULL(v_ID) Or IN_ID!= V_ID,1, 0)
O_ID_CNT=V_ID_CNT
V_ID=IN_ID

O_ID_CNT will go in to your aggregator.

You need to create one more port in aggregator for SUM(O_ID_CNT) which will return the distinct count of IDs.
(No change in GROUP BY columns)