Skip to content

Commit

Permalink
GSSOC'23 : Volcanic Eruption Prediction (#622)
Browse files Browse the repository at this point in the history
* Model building done

* output added

* Readme created

* Update Readme.md

* Update README.md

* Rename Untitled.ipynb to algorithm_dataset.ipynb

* Update README.md

---------

Co-authored-by: Khushee Kapoor <[email protected]>
  • Loading branch information
shashank1623 and khusheekapoor authored Aug 9, 2023
1 parent 511dc5c commit 8a4cd14
Show file tree
Hide file tree
Showing 4 changed files with 15,629 additions and 2 deletions.
4 changes: 2 additions & 2 deletions Machine Learning/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -270,11 +270,11 @@
- [Airline 128k Reviews (Logistic Regression Model)](https://github.com/sahaycodes/MindWave/blob/airline/Machine%20Learning/Airline%20128K%20Reviews/Airline-128K%20Reviews%20(1).ipynb)

- [Amazon Product length prdiction](/Amazon_product_length_prediction/nlp-with_regression.ipynb.ipynb)
- [Volcanic-Eruption-Prediction(RandomForestRegressor)](/Volcanic-Eruption-Prediction/algorithm_dataset.ipynb)

- [Credit Card Fraud Detection - SMOTE ](https://github.com/brahmamyv/MindWave/tree/ccf/Machine%20Learning/Credit%20Card%20Fraud%20Detection-SMOTE)



## Implementation Ideas

- Naive Bayes Classification
Expand All @@ -294,4 +294,4 @@
- Cross Validation Techniques
- K fold CV technique
- Passive Agressive Classifier

- RandomForestRegressor
29 changes: 29 additions & 0 deletions Machine Learning/Volcanic-Eruption-Prediction/Readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
## Description
What if scientists could anticipate volcanic eruptions as they predict the weather? While determining rain or shine days in advance is more difficult, weather reports become more accurate on shorter time scales. A similar approach with volcanoes could make a big impact. Just one unforeseen eruption can result in tens of thousands of lives lost. If scientists could reliably predict when a volcano will next erupt, evacuations could be more timely and the damage mitigated.

Currently, scientists often identify “time to eruption” by surveying volcanic tremors from seismic signals. In some volcanoes, this intensifies as volcanoes awaken and prepare to erupt. Unfortunately, patterns of seismicity are difficult to interpret. In very active volcanoes, current approaches predict eruptions some minutes in advance, but they usually fail at longer-term predictions.

Enter Italy's Istituto Nazionale di Geofisica e Vulcanologia (INGV), with its focus on geophysics and volcanology. The INGV's main objective is to contribute to the understanding of the Earth's system while mitigating the associated risks. Tasked with the 24-hour monitoring of seismicity and active volcano activity across the country, the INGV seeks to find the earliest detectable precursors that provide information about the timing of future volcanic eruptions.

In this competition, using your data science skills, you’ll predict when a volcano's next eruption will occur. You'll analyze a large geophysical dataset collected by sensors deployed on active volcanoes. If successful, your algorithms will identify signatures in seismic waveforms that characterize the development of an eruption.

With enough notice, areas around a volcano can be safely evacuated prior to their destruction. Seismic activity is a good indicator of an impending eruption, but earlier precursors must be identified to improve longer-term predictability. The impact of your participation could be felt worldwide with tens of thousands of lives saved by more predictable volcanic ruptures and earlier evacuations.

## Data

Dataset Description
Detecting volcanic eruptions before they happen is an important problem that has historically proven to be a very difficult. This competition provides you with readings from several seismic sensors around a volcano and challenges you to estimate how long it will be until the next eruption. The data represent a classic signal processing setup that has resisted traditional methods.

Identifying the exact sensors may be possible but would not be in the spirit of the competition nor further the scientific objectives. Please respect the importance of the problem and the time invested by the researchers at INGV in making this problem available by not seeking more metadata or information that would be unavailable in a real prediction context.

Files
train.csv Metadata for the train files.

segment_id: ID code for the data segment. Matches the name of the associated data file.

time_to_eruption: The target value, the time until the next eruption.

[train|test]/*.csv: the data files. Each file contains ten minutes of logs from ten different sensors arrayed around a volcano. The readings have been normalized within each segment, in part to ensure that the readings fall within the range of int16 values. If you are using the Pandas library you may find that you still need to load the data as float32 due to the presence of some nulls.


dataset link :- https://www.kaggle.com/competitions/predict-volcanic-eruptions-ingv-oe/data
Loading

0 comments on commit 8a4cd14

Please sign in to comment.