Coursera SQL for Data Science - Notes

这篇博客详细记录了Coursera上的SQL for Data Science课程的学习笔记,涵盖了从选择和检索数据到过滤、排序、计算,再到子查询和联接,以及数据修改和分析等多个方面。通过随堂练习和代码测评,展示了如何使用SQL进行数据操作,包括从Chinook数据库中选取数据,并解决实际问题。
摘要由CSDN通过智能技术生成

SQL for Data Science - Notes

待补充ing

注:同一类编程题型只摘选1~2道做笔记;所有概念题型都不摘选哦,详见课堂讲义

Module 1:Select and Retrieve Data with SQL

随堂练习

  1. Rtrieve all the data from the tracks table
Select *
From Tracks;

DATA: all the data
TABLE: tracks table

  1. Return the playlist id, and name from the playlists table
Select Playlistid,
Name
From Playlists;

DATA: playlist id and name
TABLE: playlists table

  1. Select all the columns from the Playlist Track table and limit the results to 10 records
Select *
From Playlist_track 
Limit 10;

DATA: all the columns
TABLE: playlist track list
FILTER: limit 10 results

代码测评

很简单,不做记录了哈

Module 2:Filtering, Sorting, and Calculating Data with SQL

随堂练习

  1. Find the distinct values for the extended step from salary_range_by_job_classification
SELECT 
distinct Extended_step
From salary_range_by_job_classification

DATA: extended step
TABLE: salary_range_by_job_classification
FILTER: distinct

  1. Excluding $0.00, what is the minimum bi-weekly high rate of pay (please include the dollar sign and decimal point in your answer)
Select 
min(Biweekly_high_Rate)
From salary_range_by_job_classification
Where Biweekly_high_Rate <> '$0.00'

DATA: bi-weekly high rate of pay
CAL: minimum
TABLE: salary_range_by_job_classification
FILTER: Excluding $0.00,用到了<>符号,表示排除

  1. What is the pay type for all the job codes that start with ‘03’
Select
job_code,
pay_type
From salary_range_by_job_classification
Where job_code Like '03%'

DATA: pay type,job codes
TABLE: salary_range_by_job_classification
FILTER: start with ‘03’

  1. Run a query to find the Effective Date (eff_date) or Salary End Date (sal_end_date) for grade Q90H0
Select
grade,
eff_date,
sal_end_date
From salary_range_by_job_classification
Where grade = 'Q90H0'

DATA: Effective Date (eff_date),Salary End Date (sal_end_date) ,grade
p.s 凡是出现了的都要SELECT
TABLE: salary_range_by_job_classification
FILTER: for grade Q90H0

  1. What Step are Job Codes 0110-0400
SELECT 
Step
,Job_Code
From salary_range_by_job_classification
Where Job_Code Between '0110' AND '0400'

DATA: Step,Job Codes
p.s 凡是出现了的都要SELECT
TABLE: salary_range_by_job_classification
FILTER: Job Codes 0110-0400

  1. Sort the Biweekly low rate in ascending order
SELECT 
Biweekly_Low_Rate 
From salary_range_by_job_classification
Order by Biweekly_Low_Rate ASC

DATA: Biweekly low rate
TABLE: salary_range_by_job_classification
ORDER: in ascending order

  1. What is the Biweekly High Rate minus the Biweekly Low Rate for job Code 0170
SELECT 
(Biweekly_High_Rate - Biweekly_Low_Rate) AS Dif 
,Job_Code
From salary_range_by_job_classification
Where Job_Code='0170'

DATA: Biweekly High Rate,Biweekly Low Rate,Job Code
CAL: Biweekly High Rate minus the Biweekly Low Rate
TABLE: salary_range_by_job_classification
FILTER: job Code 0170

  1. What is the Extended Step for Pay Types M, H, and D
SELECT 
Extended_Step
,Pay_Type
From salary_range_by_job_classification
Where Pay_Type IN ('M', 'H', 'D')

DATA: Extended Step,Pay Types
TABLE: salary_range_by_job_classification
FILTER: Pay Types M, H, and D

  1. What is the step for Union Code 990 and a Set ID of SFMTA or COMMN
SELECT 
Step
,Union_Code
,SetID
From salary_range_by_job_classification
Where Union_Code = '990' AND (SetID = 'SFMTA' OR SetID = 'COMMN')

DATA: step,Union Code,Set I

R is one of the most popular, powerful data analytics languages and environments in use by data scientists. Actionable business data is often stored in Relational Database Management Systems (RDBMS), and one of the most widely used RDBMS is Microsoft SQL Server. Much more than a database server, it’s a rich ecostructure with advanced analytic capabilities. Microsoft SQL Server R Services combines these environments, allowing direct interaction between the data on the RDBMS and the R language, all while preserving the security and safety the RDBMS contains. In this book, you’ll learn how Microsoft has combined these two environments, how a data scientist can use this new capability, and practical, hands-on examples of using SQL Server R Services to create real-world solutions. How this book is organized This book breaks down into three primary sections: an introduction to the SQL Server R Services and SQL Server in general, a description and explanation of how a data scientist works in this new environment (useful, given that many data scientists work in “silos,” and this new way of working brings them in to the business development process), and practical, hands-on examples of working through real-world solutions. The reader can either review the examples, or work through them with the chapters. Who this book is for The intended audience for this book is technical—specifically, the data scientist—and is assumed to be familiar with the R language and environment. We do, however, introduce data science and the R language briefly, with many resources for the reader to go learn those disciplines, as well, which puts this book within the reach of database administrators, developers, and other data professionals. Although we do not cover the totality of SQL Server in this book, references are provided and some concepts are explained in case you are not familiar with SQL Server, as is often the case with data scientists.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值