Skip to content

geo-smart/multi-gpr-soil

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

multi-gpr-soil

Multi-target Gaussian Process Regression for Three-Dimensional Mapping: a geology test case in Seattle, WA

The purpose of this project is to develop a multi-target Gaussian Process Regression (GPR) model to predict engineering soil classification and soil layering in three dimensions. We will use a dataset of past subsurface investigations from Seattle, WA, to train and test the model. The project offers opportunities to explore gaussian processes, multi-target regression, machine learning model training and evaluation, two- and three-dimensional geospatial data management and visualization, Dask parallelization, and more. Enthusiastic team members of all experience and skill levels are welcome to join this project!

Collaborators

Name Personal goals Can help with Role
Morgan S. I want to work towards three-dimensional ML predictions I can help with understanding our dataset, methods, programming in python Project Lead
... ... ... ...
... ... ... ...

The problem

Subsurface geological data is essential for a variety of engineering and environmental applications, including site characterization, geotechnical engineering, and groundwater modeling. In dense urban areas, such as Seattle, many subsurface investigations have been performed in past projects that could be leveraged to inform conceptual infrastructure design, urban planning, augment modern subsurface investigations, and more. Traditional methods for site soil classification and layering in three dimensions involve manual interpretation of borehole logs and other available surface and subsurface data. Machine learning methods, such as Gaussian Process Regression, offer a way to automate this process and provide predictions at a higher resolution than traditional methods. In this project, we will develop a multi-target Gaussian Process Regression model to predict engineering soil classification and soil layering in three dimensions.

Data and Methods

Data

The Washington State Department of Natural Resources (DNR) maintains a databse of various geologic data for the state, including geology maps, lidar surveys, subsurface investigations, and more. This data is accessible through an interactive web application called the Geologic Information Portal and is available for download. For this project, we will focus on the borehole logs within the subsurface investigations dataset. The borehole logs contain information about the soil layers and engineering soil classification at various depths, and we will be using the USCS soil classification data to train and test our model. Depending on time and computing restraints, we may limit model implementation to a small region around the University of Washington.

Existing methods

Traditional methods for predicting soil classification and layering in three dimensions involve manual interpretation of borehole logs and geophysical surveys. Machine learning methods, such as Gaussian Process Regression, offer a way to automate this process and provide predictions at a higher resolution than traditional methods. Gaussian Process Regression is a non-parametric method that can be used for regression and classification tasks, and it is particularly useful for modeling spatial data. In this project, we will use a multi-target Gaussian Process Regression model to predict soil classification and layering in three dimensions.

Proposed methods/tools

We will train a multi-target Gaussian Process Regression model for the city of Seattle, and publish the model in a way so that it can be used by others.

We will create a model card for the machine learning model used in this project in accordance with the Hugging Face model card standard.

Optional exploratory and background information for reference

As time and interest allow, you may explore the following resources in preparation for (and/or during) the hackweek:

Project goals and tasks

Project goals

TBD!!

Tasks

TBD!!

Project Results

TBD!!

Files and folders in this project repository

  • contributors/
    Each team member can create their own folder under contributors, within which they can work on their own scripts, notebooks, and other files. Having a dedicated folder for each person helps to prevent conflicts when merging with the main branch. This is a good place for team members to start off exploring data and methods for the project.
  • notebooks/
    Notebooks that are considered delivered results for the project should go in here.
  • scripts/
    Code that is shared by the team should go in here (e.g. functions or subroutines). These will be files other than Jupyter Notebooks such as Python scripts (.py).
  • .gitignore
    This file sets the files that will be globally ignored by git for the project. (e.g. you may want git to ignore temporary files or large data files, read more about ignoring files here)
  • environment.yml
    conda environment description needed to run this project.
  • README.md
    Description of the project (see suggested headings below)
  • model-card.md
    Description (following a metadata standard) of any machine learning models used in the project

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published