Modified from our original combined issue
- Feedback from Milestone 2 was to change the docker-compose file to not specify latest. This was addressed in more details in this Issue
In general, all feedback from Milestones and most from peer review were addressed in fixes. A minority few from peer reviews were addressed by explaining our rationale for we didn't follow their suggestions. These specific explanations each have their own GitHub Issues and are linked to within their respective bullet point below.
Note: Some fixes may have been addressed together in one big PR
-
Ensure EDA is conducted exclusively on the training dataset to prevent data leakage.
- Addressed by:
-
Split the quarto render commands into separate lines for clarity:
quarto render heart_disease_predictor_report.qmd --to html quarto render heart_disease_predictor_report.qmd --to pdf
-
Ensure consistent script naming (e.g., some scripts use
_heart_disease_predictor
, others do not). -
Move helper functions into separate modules for better organization.
- Addressed by:
-
Modularize functions to improve code clarity and reusability & Abstract code from the
main()
function into separate functions to improve structure and readability. -
Unit Testing: Adding unit tests for each validation function would ensure the correctness of the logic when applied to different datasets. It can help catch edge cases and confirm that each function works as expected.
The below 6 are all addressed in this PR
- Add proper email to code of conduct
- Remove .ds_store file
- Discuss importance/limitation of findings in Abstract/summary
- Clearly identify question in introduction
- Add all references to the dataset in Introduction
- More clearly define target/response variable in Introduction
The below 3 are all addressed in this PR
- Important methodology descriptions missing (e.g., did not explain in narrative what metric was being used for model parameter optimization)
- Some important results not displayed
- Findings from project need to be linked back to application domain and questions
- Move all environment files into a separate folder to tidy up the repository. #72
- Remove redundant lock files
- Organize files in the
report
folder into subfolders for better structure.