Data Analytics for Business BISM7233

SSIS Task: Company_data.csv contains information for each of the companies, some of the state code information is missing in this table. You would need to use “state_code.csv” to fill in these blank cells under state code before creating the company dimension table. Additionally, sector and industry-related information is provided on “industry.csv” and “sector.csv”. “Bankruptcy_1.csv” and “Bankruptcy_2.csv” contain information with the details of the bankruptcy, which forms the fact table in the dimension model. Connect to the company dimension table you created in the last step to build the dimension model for this case study. In particular, the integrated data should allow the analyst to generate reports on factors that can analyze and understand the bankruptcy of the companies.

iuww520iuww520iuww520iuww520iuww520iuww520iuww520iuww520iuww520

  1. Data transformations via SSIS

Please set up a data transformation process for the relevant source data tables (company.csv, state code.csv, industry.csv, sector.csv, bankruptcy_1.csv, bankruptcy_2.csv) and output the tables into csv files (you would need to submit the output CSV files for this section). Here is a brief description of the data:

  • Financial data related to the companies is summarized in files “bankruptcy_1.csv” and “bankruptcy_2.csv”.
  • Additional information about companies is available in “compay_csv”.
  • State codes are given in “state_code.csv”.
  • The industry to which the companies are classified is stored in “industry.csv”.
  • A high-level industry sector classification is stored in “sector.csv”. A sector classifies different industries into a few industrial sectors. The sector code is derived from the first two digits of the industry code.

Please create data transformation process to export the data from CSV files and create:

  1. Company Dimension table (contains all available attributes regarding the companies)
  2. Time Dimension 
  3. Fact table (contains all available measures with company dimension connecting to the fact table)

Please enable the data viewer that leads to each export of the two csv files and provide a screenshot of the data that you are going to export a) company dimension table b) Time Dimension, and c) Fact table. Each of your screenshot must have “green ticks” and “number of rows displayed”. A sample screenshot is shown below:

Note: When connecting to the CSV source files in SSIS, you might need to click “suggest data types” and increase the number of rows when suggesting data types to ensure the data types are detected correctly. Feel free to refer back to the tutorial exercise and make sure you know how to check and how to manually change the data types if needed.

  • 4
    点赞
  • 8
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值