Team Project
Learning Goals
Throughout the course, we have touched upon various phases of the data preparation and workflow management pipeline. In the team project, we bring together all that you have learnt.
Together with your team members, you will
- set up a reproducible workflow on GitHub,
- apply the end-to-end Git workflow for versioning files, and manage your project using GitHub Issues, and
- build and automate a data preparation and analysis pipeline from scratch.
Working on your team project is not only a great refresher on the course content, but especially gives you a better sense of the added value of using Git/GitHub when collaborating with one another, and automation with make
with a many source code files.
For inspiration, explore past team projects like Airbnb Price Calculator, COVID-19 and Length of Stay, and Movie Genre IMDb Analysis. Use these as a starting point, keeping in mind that grading criteria may have changed and these examples are not flawless.
This project may be different from other projects that you have worked on during your study. In particular, the purpose of this project is not to write an academic research paper, but instead to focus on the infrastructure of working on an academic research paper. More more insights, check the grading guidelines.
Getting started
- Data
- Kickstart your team project with these data sets.
- Workplan and coaching
- What to do, when? Check out the workplan!
- Grading
- Find out how you'll be graded - and optimize your efforts accordingly!
- More resources
- Discover example projects and code snippets to use for your team projects.
Organization
Coaching sessions
During the course, you will have the opportunity to meet up with the course instructor for coaching sessions. These sessions are meant for you to receive feedback on your ideas and code. Frequently, this also entails problem-solving & debugging.
Participation: All teams attend the full session. Teams typically collaborate on their projects while the course instructor provides support (in-person by walking around or via Zoom breakout rooms).
Session Format:
- First Half: Each team gets 5–10 minutes to provide a progress update and seek assistance from the coach.
- Second Half: Time is allocated on a needs basis to address specific issues or questions raised by teams.
Deliverables: Most coaching sessions will help the team work on some deliverables, which are always due before the next coaching session. Please refer to Canvas for exact due dates each week.
Team composition
- 4-5 students per team
- Allocation in the first course week
- Enroll your team to the template repository from GitHub classroom (https://classroom.github.com/a/K4FxS25n)
Deadline & submission
- The submitted repository on Github classroom is the team project that will be graded.
- Deadline: Monday, 24 March 2025 (23:59)
Where do I submit my report?
Gotcha! There is no report! The project should self-document itself (e.g., comments in code,
makefile
), plus you’ll have an amazing README that ties everything together and motivates your project. Make this one shine! :)Note that we will check out the state of the repository at the deadline date and time, so any changes you make to the repository afterwards are not considered for grading.