INFO20003 2021S2 WEEK1
L01 Introduction
Data and Information
Data
- known facts stored and recorded
- can include: text, numbers, dates, plus images, sound,video, and other complex objects
Information
- Data presented in context (can be summarised data)
- Data that has been processed increasing the users knowledge
Data vs. Information
- Data is known and available; Information is processed and more useful
Metadata (Not examinable)
- data about data (can include: structure, rules, constraints)
- Consistency and Meaning
- 个人认为是对data的一种解释(2021.8.3)
Database and Database Management System (DBMS)
Database
- a large, integrated, structured collection of data
- EX: a university
- Entities… such as courses, students, professors
- Relationships… such as enrollment, teaching
Database Management System (DBMS)
- a software system designed to store, manage, and facilitate access to databases
File Processing Systems vs. DBMS
- Program-data dependence vs. Data independence
(在FPS中,文件结构或者数据结构的改变会影响到整个program,而在DBMS中,文件和program是各自独立的,由中央管理) - Duplication of data vs. Minimal data redundancy
(FPS中文件需要多次复制,在DBMS中被统一管理) - Limited data sharing vs. Improved data sharing
(FPS中data tied to application, DBMS中data is shared and not a necessity for an application / external users can be allowed access / multiple views of data, arbitrary views of data) - Excessive program maintenance vs. Reduced program maintenance
- DBMS only (Improved data consistency)
L02 Database Development Lifecycle
Cycle
- database planning
Planning how to do the project, how does the enterprise work(outside scope of the course(银行,大学由什么组成?) - Specfying Definition
specifying scope and boundaries(Users, Application areas) and How does the system interfere with other organisational systems(定义系统,面向的使用者,应用等,以及与其他系统的交互) - Requirements Definition and Analysis
Collection and analysis of requirements for the new system(在题目中找到要求) - Conceptual Design
Construction of a model of the data used in the database –independent of all physical considerations(建立ER Model,independent of a specific DB/DBMS) - Logical Design
Construction of a (relational) model of the data based on the conceptual design, independent of a specific database and other physical considerations. - Physical Design
A description of the implementation of the logical design – for a specificDBMS and Describes (Basic relations (data types), File organisation and indexes) (选择datatype) - Application Design
Design of the interface and application programs that use and process the database (设置与应用的交互) - Implementation
The physical realisation of the database and Implementation of the design - Data Conversion and Loading
Transfer existing data into the database - Conversion from old systems- Non trivial task (数据转化) - Testing
Running the database to find errors in the design / setup (both at a physical level and at a logical level), other issues also (performance, robustness, recoverability and adaptability) - Operational Maintenance
The process of monitoring and maintaining the database system following its commissioning, Handling new requirements and Handling changes to requirements
Other Physical Design Decisions
Look Up (Currency Code 的例子 冥币)
- Trade off between speed and space (and possibly integrity of data)
- Data field integrity (ensure fields only contain correct data)
- Handling missing data (concept of NULL data)
De-Normalise or Not
- Normalise
A formal method used to validate and improve upon the logical design thus far (which attributes should be grouped together), before proceeding with the physical design. - De-Normalise
At physical design time need to decide how to implement the design –including removing some of the normalisation steps…Benefits is improved database performance but might wasted storage space