Using real CERN partcile accelerator data, we implemented a machine learning model able to detect the collision decay signature of a Higgs particle, in order to recreate the process that led to the discovery of the Higgs Boson.
In this section we explain how our project folder is organised and where to find the needed files.
Please import the train.csv and the test.csv in this folder.
All the CERN particle accelerator data as well as our predictions are found in the /data folder. It consists of:
- submission_ridge_reg.csv: file containing our predictions when the run.py code is executed
- train.csv : train set data (imported by the user)
- test.csv: test set data (imported by the user)
All our project implementation can be found within the /scripts folder.
Python Executables .py
- run: gives us our best accuracy score on Aicrowd
- implementations: contains all the required functions
- costs: loss and accuracy functions
- cross_validation: all cross validation methods that we used in our process
- data_preprocessing : function that help us preprocess the data
Jupyter Notebooks
- project_1: runs all different cross-validations for all methods, to obtain an accuracy for each method
- datapreprocessing_plot: creates all graphs used for the preprocessing part of the report
In this section we explain how to run the code that provided our best submission on the Aicrowd EPFL Higgs Challenge.
- Download our .zip project folder and exctract it
- Open your terminal and from the script folder execute the following command
python3 run.py
- After execution, the obtained predictions are available in the sample-submission.csv file
- Upload this file on the Aicrowd Higgs Challenge