Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create memo.md #1

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions memo.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
Our team’s progress is delayed due to the previous issue of accessing confidential data on Langone server. However, we still managed to complete some preliminary works. Based on the EHR datasets, we finished some elementary data cleaning, including dropping invalid candidates, filling in NAs. Also the team has spent great efforts in basic feature engineering. We normalized features into a scale of [0, 1], and built some handcrafted features from original ones based on logical relationship such as aggregation and binning. Finally, as the previous work suggests, we tested several simple models such as Lasso regression, and the current result is comparable to that of the research paper proposed by our collaborators.

In the next few weeks, the team will focus on feature engineering and potentially beat the best model in the research paper. Specifically, we will generate meaningful indicators with medical knowledge in mind. For example, weight*height may be helpful to identify the interaction between weights and heights. In addition, the team will try other models such as random forests in order to have a thorough comparison between models.