Automatic Differentiation-Based Full Waveform Inversion

👩‍💻 Introduction
⚡️ Installation
👾 Examples
📝 Features
⚖️ LICENSE
🗓️ To-Do List
🔰 Contact

👩‍💻 Introduction

ADFWI is an open-source framework for high-resolution subsurface parameter estimation by minimizing discrepancies between observed and simulated waveform. Utilizing automatic differentiation (AD), ADFWI simplifies the derivation and implementation of Full Waveform Inversion (FWI), enhancing the design and evaluation of methodologies. It supports wave propagation in various media, including isotropic acoustic, isotropic elastic, and both vertical transverse isotropy (VTI) and tilted transverse isotropy (TTI) models.

In addition, ADFWI provides a comprehensive collection of Objective functions, regularization techniques, optimization algorithms, and deep neural networks. This rich set of tools facilitates researchers in conducting experiments and comparisons, enabling them to explore innovative approaches and refine their methodologies effectively.

⚡️ Installation

To install ADFWI, please follow these steps:

Ensure Prerequisites
make sure you have the following software installed on your system:
- Python 3.8+: Download Python from the official website: Python Downloads.
- pip (Python package installer).
Create a Virtual Environment (Optional but Recommended) It is recommended to create a virtual environment to manage your project dependencies. You can use either venv or conda. For example, using conda:
```
conda create --name adfwi-env python=3.8
conda activate adfwi-env
```
Install Required Packages

Method 1: Clone the github Repository This method provides the latest version, which may be more suitable for your research:
```
git clone https://github.com/liufeng2317/ADFWI.git
cd ADFWI
```
Then, install the necessary packages:
```
pip install -r requirements.txt
```
Method 2: Install via pip Alternatively, you can directly install ADFWI from PyPI:
```
  pip install ADFWI-Torch
```

Verify the Installation To ensure that ADFWI is installed correctly, run any examples located in the examples folder.
Troubleshooting If you encounter any issues during installation, please check the Issues section of the GitHub repository for potential solutions or to report a new issue.

👾 Examples

1. Gradient Comparation between AD & Central Difference

A comparative analysis of gradient calculations obtained through AD and the central difference method.

Test Name	Status	Path	Figure
Acoustic (vp)	✅	Codes
Elastic (vp/vs/rho)	✅	Codes	Gradient Comparation

2. Iso-acoustic Model Tests

Test Name	Status	Example's Path	Example's Figure
Marmousi2 (low resulotion)	✅	Example-Marmousi2 (low)	Inversion Process
Marmousi2 (high resulotion)	✅	Example-Marmousi2 (high)	Inversion Process
Marmousi2 (vp and rho)	✅	Example-vp & rho	Inversion Process
FootHill (low resulotion)	✅	Example-FootHill (low)	Inversion Process
FootHill (high resulotion)	✅	Example-FootHill (high)	Inversion Process
SEAM-I	✅	Example-SEAM-I	Inversion Process
Overthrust-offshore	✅	Example-Overthrust-offshore	Inversion Process
Anomaly	✅	Example-Anomaly	Inversion Process

3. Iso-elastic & VTI-elastic Model Tests

Test Name	Status	Path	Figure
Iso-elastic Anomaly	✅	Example-Anomaly	Inversion Process
Iso-elastic Marmousi2-1	✅	Shot & Rec on Surface	Inversion Process
Iso-elastic Marmousi2-2	✅	Shot & Rec Underwater	Inversion Process
VTI-elastic Anomaly-1	✅	Inv Epsilon	Inversion Process
VTI-elastic Anomaly-2	✅	Inv Epsilon & Delta	Inversion Process

4. Misfits Tests

We assess the convexity of different objective functions by simulating seismic records using shifted wavelets. The following table summarizes the results and provides examples for further exploration.

Convexity

Example-Ricker Shift

Example-Ricker Shift & vary Amplitude

Example-Ricker Shift & domain Frequency

Example-Ricker Shift & Gaussian noise

It is important to note that we present the performance of various objective functions under poorer initial models. When using better initial model conditions, each objective function demonstrates improved performance. Relevant results can be found in Better Initial Model.

Test Name	Status	Path	Figure
L1-norm	✅	Example-L1	Inversion Process
L2-norm	✅	Example-L2	Inversion Process
T-distribution (StudentT)	✅	Example-StudentT	Inversion Process
Envelope	✅	Example-Envelope	Inversion Process
Global Correlation (GC)	✅	Example-GC	Inversion Process
Soft Dynamic Time Warping (soft-DTW)	✅	Example-SoftDTW	Inversion Process
Wasserstein Distance with Sinkhorn	✅	Example-Wasserstein	Inversion Process
Hybrid Misfit: Envelope & Global Correlation (WECI)	✅	Example-WECI	Inversion Process

The following misfit functions are still in beta version and are undergoing further development and validation. Their performance and reliability will be evaluated in future studies.

Misfit Name	Status	Path	Figure
Travel Time	🛠️	Example-TravelTime	Inversion Process
Normalized Integration Method (NIM)	🛠️	Example-NIM	Inversion Process 🖼️ Image under development

5. Optimizer Tests

The results presented below specifically characterize the impact of using the L2-norm objective function in conjunction with the Marmousi2 model. It is important to note that the effects of different optimization algorithms may vary significantly when applied to other objective functions or models. Consequently, the findings should be interpreted within this specific context, and further investigations are recommended to explore the performance of these algorithms across a broader range of scenarios.

Test Name	Status	Path	Figure
Stochastic Gradient Descent (SGD)	✅	Example-SGD	Inversion Process
Average Stochastic Gradient Descent (ASGD)	✅	Example-ASGD	Inversion Process
Root Mean Square Propagation (RMSProp)	✅	Example-RMSProp	Inversion Process
Adaptive Gradient Algorithm (Adagrad)	✅	Example-Adagrad	Inversion Process
Adaptive Moment Estimation (Adam)	✅	Example-Adam	Inversion Process
Adam with Weight Decay (AdamW)	✅	Example-AdamW	Inversion Process
Nesterov-accelerated Adam (NAdam)	✅	Example-NAdam	Inversion Process
Rectified Adam (RAdam)	✅	Example-RAdam	Inversion Process

Test Name	Status	Path	Figure
L-BFGS	✅	Example-LBFGS	Inversion Process

6. Regularization Methods

Test Name	Status	Path	Figure
no-regularization	✅	no-regular	Inversion Process
Tikhonov-1st Order	✅	Example-Tikhonov1	Inversion Process
Tikhonov-2nd Order	✅	Example-Tikhonov2	Inversion Process
Total Variation-1st Order	✅	Example-TV1	Inversion Process
Total Variation-2nd Order	✅	Example-TV2	Inversion Process

7. Multi-Scale Strategy in FWI

Multi-scale strategies play a critical role in FWI as they help to mitigate non-linearity issues and enhance convergence, especially for complex models. Multi-scale strategies are currently in development to further improve robustness and efficiency.

Test Name	Status	Path	Figure
Iso-elastic Marmousi2	✅	Multi-freq (2Hz,3Hz,5Hz)	Inversion Process

Multi-Scale Name	Status	Path	Figure
Multi-Offsets	🛠️	on-going	Inversion Process 🖼️ Image under development
Multi-scale in Time	🛠️	on-going	Inversion Process 🖼️ Image under development

8. Deep Image Prior (Earth Model Reparameterization)

Test Name	Status	Path	Figure
no-regularization	✅	no-regular	Inversion Process
2-Layer CNN	✅	Example-2LayerCNN	Inversion Process
3-Layer CNN	✅	Example-3LayerCNN	Inversion Process
3-Layer Unet	✅	Example-3LayerUNet	Inversion Process
4-Layer Unet	✅	Example-4LayerUNet	Inversion Process

9. Uncertainty Estimation Using Deep Neural Networks (DNNs)

We employ DNNs derived from the Deep Image Prior (DIP) test described earlier, to perform uncertainty estimation. The variable p represents the dropout ratio applied during both training and inference to evaluate uncertainty.

Test Name	Status	Path	Figure
2LayerCNN-uncertainty	✅	Codes

📝 Features

Multi-Wave Equation:
- Iso-Acoustic
- Iso-Elastic
- VTI-Elastic
- TTI-Elastic
Various Objective Functions
Various Optimization Methods
Multiscale FWI inversion
- multi-frequency
Deep Neural Network Integration
- DNNs Reparameterization: DNNs reparameterize the Earth model, introducing learnable regularization to improve the inversion process.
- Dropout: Applied to assess inversion uncertainty by randomly dropping units during training, providing a measure of model robustness.
- Multiphysics Joint Inversion (on-going): Neural networks are used to fuse data from different physical fields, enabling joint inversion for a more comprehensive and accurate Earth model.
Resource Management
- Mini-batch: multi-shot data can be large due to different source positions and time steps. Splitting it into mini-batches prevents loading the entire dataset into memory at once.
- Checkpointing: a key memory-saving technique, particularly for backpropagation in FWI. Instead of storing all intermediate results, only a few checkpoints are saved. During backpropagation, missing steps are recomputed, reducing memory usage at the cost of extra computation.
- boundary saving (on-going): methods are being developed to efficiently reduce memory usage by saving only the wavefield boundaries during forward propagation instead of the entire wavefield, allowing for their later use in backpropagation.
Acceleration Methods
- GPU Acceleration: Utilizes the parallel processing power of GPUs to significantly speed up computations, especially for large-scale simulations like FWI.
- JIT(Just-in-Time): Speeds up code execution by compiling Python code into optimized machine code at runtime, improving performance without modifying the original codebase.
- Reconstruction Using Lower-Level Language (C++) (on-going): Involves rewriting performance-critical components in C++ to leverage lower-level optimizations, resulting in faster execution times and improved overall efficiency.
Robustness and Portability
- Each of the method has proposed a code for testing.

⚖️ LICENSE

The Automatic Differentiation-Based Full Waveform Inversion (ADFWI) framework is licensed under the MIT License. This license allows you to:

Use: You can use the software for personal, academic, or commercial purposes.
Modify: You can modify the software to suit your needs.
Distribute: You can distribute the original or modified software to others.
Private Use: You can use the software privately without any restrictions.

🗓️ To-Do List

Memory Optimization through Boundary and Wavefield Reconstruction
Objective: Implement a strategy to save boundaries and reconstruct wavefields to reduce memory usage.
Explanation: This approach focuses on saving only the wavefield boundaries during forward propagation and reconstructing the wavefields as needed. By doing so, it aims to minimize memory consumption while ensuring the accuracy of wave propagation simulations, particularly in large-scale models.

Related Work:
- [1] P. Yang, J. Gao, and B. Wang, RTM using Effective Boundary Saving: A Staggered Grid GPU Implementation, Comput. Geosci., vol. 68, pp. 64–72, Jul. 2014. doi: 10.1016/j.cageo.2014.04.004.
- [2] Wang, S., Jiang, Y., Song, P., Tan, J., Liu, Z., & He, B., 2023. Memory Optimization in RNN-Based Full Waveform Inversion Using Boundary Saving Wavefield Reconstruction, IEEE Trans. Geosci. Remote Sens., 61, 1–12. doi: 10.1109/TGRS.2023.3317529.
C++ / C-Based Forward Propagator
Objective: Develop a forward wave propagation algorithm using C++ or C.
Explanation: Implementing the forward propagator in lower-level languages like C++ or C will significantly enhance computational performance, particularly for large-scale simulations. The aim is to leverage the improved memory management, faster execution, and more efficient parallel computing capabilities of these languages over Python-based implementations.
Resource Optimization for Memory Efficiency
Objective: Reduce memory consumption for improved resource utilization.
Explanation: The current computational framework may encounter memory bottlenecks, especially when processing large datasets. Optimizing memory usage by identifying redundant storage, streamlining data structures, and using efficient algorithms will help in scaling up the computations while maintaining or even enhancing performance. This task is critical for expanding the capacity of the system to handle larger and more complex datasets.
Custom Input Data Management System
Objective: Develop a tailored system for managing input data effectively.
Explanation: A customized data management framework is needed to better organize, preprocess, and handle input data efficiently. This may involve designing workflows for data formatting, conversion, and pre-processing steps, ensuring the consistency and integrity of input data. Such a system will provide flexibility in managing various input types and scales, and it will be crucial for maintaining control over data quality throughout the project lifecycle.
Enhanced Gradient Processing
Objective: Implement advanced techniques for gradient handling.
Explanation: Developing a sophisticated gradient processing strategy will improve the inversion results by ensuring that gradients are effectively utilized and interpreted. This may include techniques such as gradient clipping, adaptive learning rates, and noise reduction methods to enhance the stability and convergence of the optimization process, ultimately leading to more accurate inversion outcomes.
Multi-Scale Inversion Strategies (done!)
Objective: Introduce multi-scale approaches for improved inversion accuracy.
Explanation: Multi-scale inversion involves processing data at various scales to capture both large-scale trends and small-scale features effectively. Implementing this strategy will enhance the robustness of the inversion process, allowing for better resolution of subsurface structures. Techniques such as hierarchical modeling and wavelet analysis may be considered to achieve this goal, thus improving the overall quality of the inversion results.
Real Data Testing
Objective: Evaluate the performance and robustness of the developed methodologies using real-world datasets. Explanation: Conducting tests with actual data is crucial for validating the effectiveness of the Status algorithms. This will involve the following steps:
1. Dataset Selection: Identify relevant real-world datasets that reflect the complexities of the target applications. These datasets should include diverse scenarios and noise characteristics typical in field data.
2. Preprocessing: Apply necessary preprocessing techniques to ensure data quality and consistency. This may include data normalization, filtering, and handling missing or corrupted values.
3. Implementation: Utilize the developed algorithms on the selected datasets, monitoring their performance metrics such as accuracy, computational efficiency, and convergence behavior.
4. Comparison: Compare the results obtained from the Status methods against established benchmarks or existing methodologies to assess improvements.
5. Analysis: Analyze the outcomes to identify strengths and weaknesses, and document any discrepancies or unexpected behaviors. This analysis will help refine the algorithms and inform future iterations.
6. Reporting: Summarize the findings in a comprehensive report, detailing the testing procedures, results, and any implications for future work.
This actual data testing phase is essential for ensuring that the developed methodologies not only perform well in controlled environments but also translate effectively to real-world applications. It serves as a critical validation step before broader deployment and adoption.

🔰 Contact

Developed by Feng Liu at the University of Science and Technology of China (USTC) and Shanghai Jiao Tong University (SJTU).

The related paper Full Waveform Inversion of (An)Isotropic Wave Physics in a Versatile Deep Learning Framework is currently in preparation.

For any inquiries, please contact Liu Feng via email at: [email protected] or [email protected].

@software{ADFWI_LiuFeng_2024,
  author       = {Feng Liu, Haipeng Li, GuangYuan Zou and Junlun Li},
  title        = {ADFWI},
  month        = July,
  year         = 2024,
  version      = {v1.0},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Differentiation-Based Full Waveform Inversion

👩‍💻 Introduction

⚡️ Installation

👾 Examples