Soil Classification System from Cone Penetration Test Data Applying Distance-Based Machine Learning Algorithms

L.O. Carvalho, D.B. Ribeiro

Article

Soils and Rocks, São Paulo, 42(2): 167-178, May-August, 2019 | PDF


Abstract

Most work from the literature dedicated to soil classification systems from cone penetration test (CPT) data are based on simple two-dimensional charts. One alternative approach is using machine learning (ML) to produce new soil classification systems or to reproduce existing ones. The available studies within this research field can be considered limited, once most of them do not include more than two inputs within their analysis and are applicable only to specific regions. In this context, the aim of this work is to use distance-based ML techniques to replicate two chart-based methods from the literature. Up to five input feature combinations are tested, with the objective of discussing geotechnical aspects of soil classification systems. Results are compared using the statistical test of Friedman with the post-hoc statistics of Nemenyi and the signed-rank statistical test of Wilcoxon. The used dataset can be considered diversified because it contains 111 CPT soundings from several countries. Results show that the used ML techniques maintain reasonable accuracy when inputs are substituted and when incomplete data is used, which can lead to cost reduction in real engineering projects. It is important to notice that these observations would not be possible by using the replicated soil classification systems alone.


Submitted on March 12, 2018; Final Acceptance on July 8, 2019; Discussion open until December 31, 2019. DOI: 10.28927/SR.422167