50.043 Database Systems and Big Data Course Handout¶
This page will be updated regularly. Sync up often.¶
Course Description¶
Database systems manage data which is at the heart of modern computing applications. This course covers the fundamentals of traditional databases, such as Oracle and MySQL, and core ideas of recent big data systems.
Students will learn important problems in data management that these systems are designed to solve. They will experience the internal design and implementation of relational databases. They will also understand the internals of state‐of‐the‐art big data platforms, namely Apache Spark, and use them on Amazon cloud (Amazon Web Service). The students will be able to determine the advantages and limitations of different database systems.
Resource¶
The main resources are lecture slides, tutorial sessions, and online documentations. There are no official textbooks. But the following are useful for reference and deeper understanding of some topics.
- Abraham Siberschatz, Henry Korth, S Sudarshan. Database System Concepts. 6th edition. (DSC)
- Raghu Ramakrishnan, Johannes Gehrke. Database management systems. 3rd edition (DBM)
- Hector Garcia-Molina, Jeffrey D. Ullman, Jennifer Widom. Database systems, the complete book. 2nd edition. (DS)
Instructors¶
-
Kenny Lu (kenny_lu@sutd.edu.sg) Office Hour:
-
Yanxia Qin (yanxia_qin@sutd.edu.sg) Office Hour: Friday 2:00-3:00 pm (please send email to arrange)
TAs¶
-
Nguyen Quoc Dung, Tristan (dungquoc_nguyen@mymail.sutd.edu.sg)
-
Finn Yap Jun Hun (finn_yap@mymail.sutd.edu.sg)
Communication¶
If you have course/assignment/project related questions, please post it on the dedicated MS teams channel.
Grading¶
Your final grade is computed as follows:
-
Homework: 10% There will be 2 homework assignments, 5 points each.
-
Project: 40% Group project, up to 3 per group. Unless notifying the instructors otherwise, all group members have the same grade for the project.
-
Class participation: 3% Attend at least 80% of all lessons to get full marks. Ask/answer questions during classes, spot mistakes, etc.
-
Mid-term quiz: 5% Held during class before Week 6. Format is similar to final exam.
-
Final exam: 40% Written exam on Week 14 covering all topics.
-
Course Survey: 2%
Things you need to prepare¶
- If you are using Windows 10 or Windows 11, please install ubuntu subsystems
- If you are using Linux, it should be perfect.
- If you are using Mac, please install homebrew.
- Make sure Java >8 is installed and ant is installed.
- Ubuntu:
sudo apt install ant ant-contrib - Mac:
brew install ant ant-contrib - When you have the AWS educate invitaiton email. Please work on the AWS academy setup.
Project¶
Please refer to the project page.
Submission Policy and Plagiarism¶
- You will do the assignment/project on your own (own teams) and will not copy paste solutions from someone else.
- You will not post any solutions related to this course to a private/public repository that is accessible by the public/others.
- Students are allowed to have a private repository for their assignment which no one can access.
- For projects, students can only invite their partners as collaborators to a private repository.
- Failing to follow the Code of Honour will result in failing the course and/or being submitted to the University Disciplinary Committee. The consequences apply to both the person who shares their work and the person who copies the work.
- You must explicitly disclose how AI tools are used, and be responsible for your submissions.
Schedule (26 Janurary 2026 - 30 April 2026)¶
| Week (MM/DD) | Lecture | Cohort | Reference | Remarks |
|---|---|---|---|---|
| 1 (1/26) | Intro, ER Model | ER Model | DBM: Chapter 1-2, DSC: Chapter 7 |
|
| 2 (2/2) | Relational Model, Relational Algebra | Relational Model, Relational Algebra | DBM: Chapter 3-4, DSC: Chapter 2 & 6 |
|
| 3 (2/9) | SQL, NoSQL | SQL | DBM: Chapter 5, DSC: Chapter 2-4 |
Project Team Submission (2/13 23:59) |
| 4 (2/16) | Functional Dependency, Normal Forms | Functional Dependency, Normal Forms | DBM: Chapter 19, DSC: Chapter 8 |
Lec2 cancelled due to CNY |
| 5 (2/23) | Storage, Index | Strorage, Index | DBM: Chapter 8-10, DSC: Chapter 10-11 |
Assignment 1 Submission (3/2 23:59) |
| 6 (3/2) | Query Operations | Query Operations | DBM: Chapter 12-14, DSC: Chapter 12 | Project Lab 1 Submission ( 3/6 23:59) |
| 7 (3/9) | Recess Week | Self-study flintrock and spark cluster setup (video tutorial) | ||
| 8 (3/16) | Query Optimization | Query Optimization | DBM: Chapter 15 , DSC: Chapter 13 |
Project Lab 2 Submission ( 3/20 23:59) |
| 9 (3/23) | Transaction Recovery and Concurrency | Transactions | DBM: Chapter 16-18, DSC: Chapter 14-16 |
|
| 10 (3/30) | HDFS, MapReduce | HDFS, MapReduce | Project Lab 3 Submission (4/3 23:59). Friday's cohort cancelled due to Good Friday | |
| 11 (4/6) | Spark | Spark | ||
| 12 (4/13) | Yarn | Spark 2 | Assignment 2 Submission (4/17 23:59) | |
| 13 (4/20) | Revision week | Project Lab 4 Submission (4/24 23:59) | ||
| 14 (4/27) | Exam week |
Make Up and Alternative Assessment¶
Make ups for Final exam will be administered when there is an official Leave of Absence from OSA. There will be only one make up. There will be no make-up if students miss the make up test.