QBUS6600 Data Analytics for Business


BUSINESS SCHOOL

Page 1 of 4

QBUS6600

Data Analytics for Business Capstone

Semester 2, 2024

Assignment 1 (individual assignment)

1.

Key information

Required submissions:

? Written report (in pdf, due date: Monday, September 2 by the end of the day).

? Confidentiality Deep Poll online form (deadline for submission: August 19).

Submission instructions for the report will be posted on Canvas in Week 5.

Weight: 30% of your final grade.

Length:

Your written report should have a maximum of 12 pages (single spaced, 11pt). Cover

page, references, and appendix (if any) will not count towards the page limit.

Please keep in

mind that making good use of your audience’s time is an essential business skill: every

sentence, table or figure should serve a purpose.

2.

Problem description

Please start by reading through the Project Outline document for your industry project, which

you can find on the 'Learn about our industry projects' page in the Week 1 module on Canvas.

Focus on the Problem Description section of the Project Outline, especially the first and

the third bullet points (EDA and Strategy), which are the most relevant bullet points for

Assignment 1. Both your analysis and your recommendations should be in line with the

requirements/suggestions provided in the Project Outline.

As a business analyst, you will conduct Exploratory Data Analysis (EDA) of the data

corresponding to your industry project. You should aim to find or reveal all relevant properties,

characteristics, patterns, and statistics hidden in the data, supporting your findings with

insightful plots and relevant statistical output.

Use the results from your EDA to outline a preliminary strategy or provide preliminary

recommendations to the management team corresponding to your selected industry project.

You will have a chance to refine these recommendations in Assignment 2. Please refrain from

extensive modelling and model selection – you will do them in Assignment 2. However, feel

free to fit simple models (e.g., linear regression or logistic regression) for the purposes of EDA

and understanding the relationships among the variables in the dataset.

BUSINESS SCHOOL

Page 2 of 4

3.

Written report

The purpose of the report is to describe, explain, and justify your findings to the management

team corresponding to your selected industry project. You may assume that team members

have training in business analytics, however, they are not experts in statistics or machine

learning. The team’s time is important: please be concise and objective.

Suggested outline for the main parts of the report (further details below):

1. Problem formulation.

2. Data processing.

3. Exploratory Data Analysis (EDA).

4. Conclusions and preliminary recommendations.

You should consider breaking down the longer parts into smaller sections.

4.

Marking Scheme

Business context and problem formulation.

5%

Data processing. 30%

Exploratory Data Analysis (EDA). 45%

Conclusions and preliminary recommendations. 10%

Writing and presentation of the report. 10%

Total 100%

5.

Rubric (basic requirements)

Business context and problem formulation. Your report gives a detailed description of the

problem that is being investigated, providing the context and background for the analysis.

Data processing. You describe the data processing steps clearly and in sufficient detail,

justifying and explaining your choices and decisions. You handle missing values and other

data issues appropriately.

You describe and explain your data transformations and/or your

feature engineering process (if any). Your choices and decisions are justified by data analysis,

domain knowledge, logic, and trial and error (if necessary).

Exploratory data analysis (EDA).

Your report provides a comprehensive description of your

EDA process, presenting selected results.

Your analysis is sufficiently rich, and your

visualizations are insightful. You study key variables and relationships among them using

appropriate plots and descriptive statistics. You note any features of the data that may be

relevant for model building in Assignment 2. You note the presence of outliers and any other

anomalies that can affect the analysis. You explain the relevance of the EDA results to the

underlying business problem and your subsequent recommendations. You clearly describe

and justify the methods in your analysis. The choice of methods is logically related to the

substantive problem, underlying theoretical knowledge, and data analysis. You interpret the

statistical outputs that you provide.

You report crucial assumptions and whether they are

potentially violated.

BUSINESS SCHOOL

Page 3 of 4

Conclusions and recommendations. The reasoning from the analysis and results to your

conclusions and recommendations is logical and convincing. Your conclusions and

recommendations are written in plain language appropriate for non-technical audience.

Writing. Your writing is concise, clear, precise, and free of grammatical and spelling errors.

You use appropriate technical terminology. Your paragraphs and sentences follow a clear logic

and are well connected. If you use an abbreviation or label, you define it first.

Report layout. Your report is well organised and professionally presented, as if it had been

prepared for a client later in your career. There are clear divisions between sections and

paragraphs.

Tables. Your tables are appropriately formatted and have a clear layout. The tables have

informative row and column labels. The tables are relatively easy to understand on their own.

The tables do not contain information which is irrelevant to the discussion in your report. The

tables are placed near the relevant discussion in your report. There is no text around your

tables, and your tables are not images.

Figures (plots). Your figures are easy to understand and have informative titles, captions,

labels, and legends. The figures are well formatted and laid out. The figures are placed near

the relevant discussion in your report. Your figures have appropriate definition and quality.

There is no text around your figures, and your figures are not screenshots.

Numbers. All numerical results are reported to suitable precision (typically no more than three

decimal places, in some cases fewer).

Referencing.

You follow the University of Sydney referencing rules and guidelines.

Python code. The text of your report should be entirely free of Python code.

Note: you are strongly encouraged to use Python for all the steps of your data analysis. While

there is no Python code submission for Assignment 1, you should keep your code well- organized, so that you can easily extend/modify/reuse this code for the purposes of

Assignment 2 (which will have a Python code submission requirement).

6.

Deductions

Marks may also be deducted from each item in the marking scheme in the following cases:

 The report is disorganised and/or has a poor layout.

 There is an excess of abbreviations or labels that the reader may be unfamiliar with.

 The report has an excessive number of grammatical or spelling mistakes.

 The tables are difficult to read, for example, due to poor layout or labelling.

 The figures are difficult to read, for example, due to poor layout or labelling.

 Numbers are not appropriately rounded.

BUSINESS SCHOOL

Page 4 of 4

7.

Late Submission of the report

Late submissions are subject to a deduction of 5% of the maximum mark for each calendar

day after the due date. After ten calendar days late, a mark of zero will be awarded.

8.

Late submission of the Confidentiality Deed Poll online form

It is a requirement of our QBUS6600 unit that all students complete the Confidentiality Deed

Poll online form before gaining access to the datasets for the industry projects. The datasets

are highly confidential, and you have responsibility to keep them secure and only use them

for your QBUS6600 coursework. Submission of the Confidentiality Deed Poll online form

after the August 19 deadline is subject to a penalty of 20% for Assignment 1. Furthermore,

assignments without a submission of the online form will not be marked.
 

  • 7
    点赞
  • 14
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
以下是一个简单的33节点配电网潮流分区的Matlab程序,仅供参考: ```matlab % 节点数据 busdata = [1 0 0 0.95 0 0 0; 2 3 1 1.02 0 0 0; 3 2 1 1.01 0 0 0; 4 3 1 1.03 0 0 0; 5 2 1 0.98 0 0 0; 6 5 1 0.97 0 0 0; 7 1 1 1.05 0 0 0; 8 6 1 0.99 0 0 0; 9 7 1 1.07 0 0 0; 10 8 1 1.06 0 0 0; 11 10 1 1.04 0 0 0; 12 9 1 1.01 0 0 0; 13 12 1 1.015 0 0 0; 14 13 1 1.03 0 0 0; 15 14 1 1.01 0 0 0; 16 15 1 1.02 0 0 0; 17 16 1 1.01 0 0 0; 18 9 1 1.01 0 0 0; 19 18 1 1.02 0 0 0; 20 19 1 1.01 0 0 0; 21 20 1 1.02 0 0 0; 22 21 1 1.01 0 0 0; 23 22 1 1.02 0 0 0; 24 23 1 1.01 0 0 0; 25 24 1 1.02 0 0 0; 26 25 1 1.01 0 0 0; 27 26 1 1.02 0 0 0; 28 27 1 1.01 0 0 0; 29 28 1 1.02 0 0 0; 30 29 1 1.01 0 0 0; 31 30 1 1.02 0 0 0; 32 31 1 1.01 0 0 0; 33 32 1 1.02 0 0 0]; % 支路数据 linedata = [1 2 0.02 0.06 0.03 0; 2 3 0.045 0.135 0.02 0; 3 4 0.02 0.06 0.03 0; 2 5 0.024 0.072 0.01 0; 5 6 0.011 0.033 0.01 0; 6 4 0.015 0.045 0.02 0; 1 7 0.018 0.036 0.01 0; 7 8 0.02 0.04 0.02 0; 8 9 0.022 0.044 0.02 0; 9 10 0.02 0.04 0.02 0; 10 11 0.02 0.06 0.03 0; 9 12 0.02 0.06 0.03 0; 12 13 0.022 0.066 0.01 0; 13 14 0.02 0.06 0.03 0; 14 15 0.02 0.06 0.03 0; 15 16 0.02 0.06 0.03 0; 16 17 0.02 0.06 0.03 0; 12 18 0.02 0.06 0.03 0; 18 19 0.02 0.06 0.03 0; 19 20 0.02 0.06 0.03 0; 20 21 0.02 0.06 0.03 0; 21 22 0.02 0.06 0.03 0; 22 23 0.02 0.06 0.03 0; 23 24 0.02 0.06 0.03 0; 24 25 0.02 0.06 0.03 0; 25 26 0.02 0.06 0.03 0; 26 27 0.02 0.06 0.03 0; 27 28 0.02 0.06 0.03 0; 28 29 0.02 0.06 0.03 0; 29 30 0.02 0.06 0.03 0; 30 31 0.02 0.06 0.03 0; 31 32 0.02 0.06 0.03 0; 32 33 0.02 0.06 0.03 0]; % 构造节点导纳矩阵 nbus = size(busdata,1); Ybus = zeros(nbus); for k=1:size(linedata,1) f = linedata(k,1); t = linedata(k,2); R = linedata(k,3); X = linedata(k,4); B = linedata(k,5); Y = 1/(R+j*X); Ybus(f,f) = Ybus(f,f) + Y + j*B/2; Ybus(t,t) = Ybus(t,t) + Y + j*B/2; Ybus(f,t) = Ybus(f,t) - Y; Ybus(t,f) = Ybus(t,f) - Y; end % 潮流计算 Pbusinj = busdata(:,6); Qbusinj = busdata(:,7); Vbus = busdata(:,4); theta = zeros(nbus,1); iter = 0; converged = 0; while ~converged && iter<20 iter = iter + 1; Pbus = Vbus.*conj(Ybus*Vbus); Qbus = Vbus.*conj(Ybus*Vbus); mis = [Pbus-Pbusinj; Qbus-Qbusinj]; J = zeros(2*nbus,2*nbus); for k=1:nbus for m=1:nbus if k==m dPdVkm = real(conj(Vbus(k))*(Ybus(k,:)*Vbus) + conj(Ybus(k,k))*Vbus(k)); dPdthetakm = -imag(conj(Vbus(k))*(Ybus(k,:)*Vbus) + conj(Ybus(k,k))*Vbus(k)); dQdVkm = -imag(conj(Vbus(k))*(Ybus(k,:)*Vbus) + conj(Ybus(k,k))*Vbus(k)); dQdthetakm = -real(conj(Vbus(k))*(Ybus(k,:)*Vbus) + conj(Ybus(k,k))*Vbus(k)); else dPdVkm = real(conj(Vbus(k))*Ybus(k,m)*Vbus(m)); dPdthetakm = -imag(conj(Vbus(k))*Ybus(k,m)*Vbus(m)); dQdVkm = -imag(conj(Vbus(k))*Ybus(k,m)*Vbus(m)); dQdthetakm = -real(conj(Vbus(k))*Ybus(k,m)*Vbus(m)); end J(k,nbus+m) = dPdVkm; J(nbus+k,nbus+m) = dQdVkm; J(k,m) = dPdthetakm; J(nbus+k,m) = dQdthetakm; end end dx = -J\mis; dtheta = dx(1:nbus); dV = dx(nbus+1:2*nbus); theta = theta + dtheta; Vbus = Vbus + dV; converged = max(abs(mis)) < 1e-7; end % 进行分区,定义分区1和分区2的节点 partition1 = [1 2 3 4 5 6 7 8 9 10 11 12 18]; partition2 = [13 14 15 16 17 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33]; % 输出分区1的潮流结果 fprintf('节点 电压幅值 电压相角 有功功率 无功功率\n'); for i=1:length(partition1) fprintf('%4d %12.4f %12.4f %12.4f %12.4f\n', partition1(i), abs(Vbus(partition1(i))), ... angle(Vbus(partition1(i)))*180/pi, real(Pbus(partition1(i))), imag(Qbus(partition1(i)))); end % 输出分区2的潮流结果 fprintf('\n节点 电压幅值 电压相角 有功功率 无功功率\n'); for i=1:length(partition2) fprintf('%4d %12.4f %12.4f %12.4f %12.4f\n', partition2(i), abs(Vbus(partition2(i))), ... angle(Vbus(partition2(i)))*180/pi, real(Pbus(partition2(i))), imag(Qbus(partition2(i)))); end ``` 程序中先定义了节点数据和支路数据,然后根据支路数据构造了节点导纳矩阵。接着进行潮流计算,直到收敛为止。最后按照给定的分区将节点分为两个部分,并输出每个节点的潮流结果。 注意,这里的潮流计算只是一个简单的示例,实际情况下可能需要更复杂的算法和数据处理。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值