This course provides a unique opportunity for students in the MS in Data Science program to apply their knowledge of the foundations, theory and methods of data science to address data driven problems in industry, research, government and the non-profit sector. The course activities focus on a semester-long project sponsored by an affiliate company or a Columbia faculty member. The project synthesizes the statistical, computational, engineering and social challenges involved in solving complex real-world problems. The course has a well developed Ethics component supported by Dr. Savannah Thais.
Select a team captain (with or without help from mentor/instructor/supervisor)
Record your names here in this format-
- Team captain Tina Cao, tc3334
- Member Lu Liu, ll3721
- Member Wanxin Luo, wl2930
- Member Xinwei Qiao, xq2236
- Member Yao Xie, yx2845
The CourseInfo folder has the templates for your reports, progress log, meeting minutes with your mentors. These are the deliverables you need to save as .pdf files and upload in this repository. Additionally the folder also contains sample meeting presentations and tips, report grading rubrics, student-mentor email templates and syllabus for your reference.
- Regularly work on developing your code, provide repository access to your industry mentor/instructor
- Update your project task status weekly in our progress log and github project board.
- Record your progress in the reports.
- Employ a mechanism to select weekly presenter at the mentor meetings
- Note down the meeting minutes on a weekly basis
- Code
- Reports- Midterm Progress Report, Final Report, Ethics Report
- Progress Log
- Meeting Minutes
The code can be placed in a folder named code, and the remaining files can be placed as .pdf files in the root directory.
Instructions:
1. Code Folders: Code (containing Schema, Graph Database, Final Chatbot, Automatic Testing)
2. Reports Folder: Reports (containing Midterm Progress Report, Final Report, Ethics Presentation Slides and Video)
3. Final Presentation Folder: Presentation (final presentation for KPMG)
4. Poster Folder: Poster
5. Weekly Meeting Presentation Folder: Weekly Meeting Presentation
6. Meeting Minutes: Weekly on Wednesday for 60 minutes
You can use this link https://final-kpmg-chatbot.streamlit.app/ to test our chatbot.
If you want to run our code, please remember to use your own GPT API.
(Note: Our Chatbot might give an openai.AuthenticationError if our API is leaked or we finished our API quota.)