A gigabyte interpreted seismic dataset for automatic fault recognition



An, Y.1, Guo, J.2, Ye, Q.3, Childs, C., Walsh, J.J. & Dong, R.1
1 - The Insight Centre for Data Analytics, School of Computer Science, University College Dublin, Dublin, Ireland.
2 - Current address: C&C Reservoirs, Brunel House, Reading, United Kingdom.
3 - Key Laboratory of Tectonics and Petroleum Resources, Ministry of Education, China University of Geosciences, Wuhan 430074, China.


Abstract - The lack of large-scale open-source expert-labelled seismic datasets is one of the barriers to applying today’s AI techniques to automatic fault recognition tasks. The dataset present in this article consists of a large number of processed seismic images and their corresponding fault annotations. The processed seismic images, which are originally from a seismic survey called Thebe Gas Field in the Exmouth Plateau of the Carnarvan Basin on the NW shelf of Australia, are represented in Python Numpy format, which can be easily adopted by various AI models and will facilitate cooperation with researchers in the field of computer science. The corresponding fault annotations were firstly manually labelled by expert interpreters of faults from seismic data in order to investigate the structural style and associated evo- lution of the basin. Then the fault interpretation and seismic survey are processed and collected using Petrel software and Python programs separately. This dataset can help to train, validate, and evaluate the performance of different automatic fault recognition workflow.

Data in Brief, 37, 107219, doi: 10.1016/j.dib.2021.107219