Machine Learning Analysis of Borehole Data for Geotechnical Insights

Research output: Contribution to journalArticlepeer-review

Abstract

This paper explores the use of machine learning (ML) to analyze borehole data aiming to enhance geotechnical insights, using the Gaza Strip as a case study. The data set consists of 632 boreholes, with features including spatial coordinates, ground level, and soil type per depth. A random forest (RF) classification model was applied to predict soil types, achieving an accuracy of approximately 75%. Notably, the model retained this accuracy even when the data set size was reduced to 30%, suggesting predictable subsurface conditions over large areas. A comparative analysis of common misclassifications revealed that errors mostly occurred between similar soil types, indicating the model’s ability to capture meaningful geological patterns. Unsupervised learning using k-means clustering revealed no clear-cut boundaries between clusters, indicating localized geological anomalies despite large-scale predictability. These findings align with the demonstrated stability of the Gaza Tunnel Network (GTN), a vast network of tunnels which was constructed without comprehensive site investigations. This study demonstrates the potential of ML to improve geotechnical assessments and suggests that fewer boreholes may be needed for large-scale projects, offering cost-saving opportunities. For future research, it is recommended to integrate advanced ML tools, including large language models (LLMs) for analyzing qualitative data from borehole logs, and interpretability methods to enhance model explainability, thus enhancing geological understanding and increasing predictive power.

Original languageEnglish
Pages (from-to)1175-1188
Number of pages14
JournalGeotechnics
Volume4
Issue number4
DOIs
StatePublished - Dec 2024

Keywords

  • boreholes
  • coastal geology
  • machine learning
  • site investigation
  • tunnel stability

Fingerprint

Dive into the research topics of 'Machine Learning Analysis of Borehole Data for Geotechnical Insights'. Together they form a unique fingerprint.

Cite this