MPF-Bench Dataset

2D	3D

In MPF-Bench dataset, we present a comprehensive collection of simulations designed to capture the intricate physics of falling droplets in both two-dimensional and three-dimensional settings. The Droplet-2D dataset includes 5,000 simulations with each simulation providing 101 time steps, offering a detailed view of how various fluid properties, such as density and viscosity ratios, influence the behavior of droplets as they fall under gravity. In contrast, the Droplet-3D dataset, though smaller with 500 simulations and 51 time steps per simulation, provides a three-dimensional perspective on the same phenomena, allowing for a more nuanced understanding of the complex interactions between forces like gravity and surface tension. Both datasets are generated using the Lattice Boltzmann method, ensuring high fidelity in capturing the droplet dynamics, including deformation and breakup, across a wide range of Reynolds and Bond numbers. These datasets are a critical resource for training and validating Scientific Machine Learning (SciML) models, enabling more accurate predictions of multiphase flow behavior in various engineering and scientific applications.

2D	3D

The Bubble dataset in MPF-Bench is an extensive collection of simulations that captures the dynamic behavior of rising bubbles in both two-dimensional and three-dimensional environments. The Bubble-2D dataset includes 5,000 simulations, each comprising 101 time steps, providing a detailed exploration of bubble dynamics under varying fluid properties, such as density and viscosity ratios. These simulations illustrate how bubbles rise through a denser fluid medium, influenced by factors like buoyancy and surface tension, leading to complex phenomena such as shape deformation and breakup. The Bubble-3D dataset, consisting of 500 simulations with 51 time steps each, offers a three-dimensional perspective, allowing for a more intricate examination of bubble behavior, including oscillations and path instability. Both datasets are generated using the Lattice Boltzmann method, ensuring high accuracy in capturing the essential physics of bubble dynamics across a wide range of Reynolds and Bond numbers. These datasets are instrumental for developing and testing Scientific Machine Learning (SciML) models, which aim to improve the prediction and understanding of multiphase flow phenomena in various scientific and industrial applications.

In fluid dynamics, the Reynolds number (Re) and Bond number (Bo) are two critical dimensionless quantities that characterize the flow regime of two-phase systems such as bubbles and droplets. The Reynolds number, Re, is a measure of the ratio of inertial forces to viscous forces within the fluid, and it indicates whether the flow will be laminar or turbulent. The Bond number, Bo, on the other hand, represents the ratio of gravitational forces to surface tension forces, determining the shape and stability of bubbles or droplets in a fluid.

MPF-Bench dataset spans a wide range of these two parameters, capturing multiple flow regimes from simple, predictable behaviors to highly complex, turbulent ones. The accompanying figure showcases four distinct cases from the 3D bubble dataset, highlighting scenarios with both low and high Reynolds and Bond numbers. As these numbers increase, the bubble undergoes significant deformation, making it more challenging for models to accurately predict its behavior. This trend is also observed in droplet dynamics, where increased Re and Bo lead to more intricate and harder-to-predict flows. By understanding the flow regimes represented in our dataset, users can better choose between easier or more challenging cases to train and assess their models, depending on their specific research or application needs.

Figure: Visualization of four different cases from the 3D bubble dataset, illustrating the impact of varying Reynolds (Re) and Bond (Bo) numbers on bubble deformation. As Re and Bo numbers increase, the bubble exhibits more complex and challenging dynamics, demonstrating the varying levels of difficulty in predicting bubble behavior across different flow regimes.

We provide a comprehensive table to help users select the most suitable cases for training and evaluating their models. Our dataset includes variations in key parameters, such as the Reynolds number (Re), Rho Ratio, Viscosity Ratio, and Bond Number (Bo). The Rho Ratio ranges from 10 to 1000, Viscosity Ratio varies between 1 and 100, and the Bo number spans from 10 to 500. Notably, the Reynolds number in bubble simulations varies from 10 to 500, while in drop simulations, it extends from 10 to 1000. Due to the inherent fluid properties, the Viscosity Ratio is proportional to the Density Ratio, meaning that both are typically either high or low together. To support model development, we have introduced a normalized difficulty classification that reflects the complexity of capturing interface deformations. This relative difficulty rating guides users in selecting cases, with most scenarios following this established pattern.

Density Ratio	Viscosity Ratio	\(Bo\) Number	\(Re\) Number	Difficulty
High	High	High	High	Challenging
High	High	Low	Low	Easy
High	High	High	Low	Moderate
High	High	Low	High	Moderate
Low	Low	High	High	Challenging
Low	Low	Low	Low	Easy
Low	Low	High	Low	Easy
Low	Low	Low	High	Moderate

MPF-Bench dataset is extensive, comprising a total of 11,000 simulations divided equally between bubble and droplet dynamics, with each category further broken down into 2D and 3D simulations. For both 2D and 3D simulations, we provide detailed input parameters, including the density ratio, viscosity ratio, Reynolds number, and Bond number. These inputs are critical in determining the complex flow behaviors captured in the dataset. The outputs of our simulations include key fields such as the interface indicator, velocity components in all directions, pressure, and density, which provide a comprehensive view of the multiphase flow dynamics. The dataset is structured in a way that allows for easy integration into Scientific Machine Learning (SciML) models, facilitating advancements in the understanding and prediction of multiphase flow phenomena.

Dataset	Dim.	Input Tensor	Output Tensor	Size of Dataset (GB)
Droplet	2D	X[5000][\(\rho^\), \(\mu^\), Bo, Re]	Y [5000][101][c, u, v, p, \(\rho\)][y][x]	566 GB
Bubble	2D	X[5000][\(\rho^\), \(\mu^\), Bo, Re]	Y [5000][101][c, u, v, p, \(\rho\)][y][x]	822 GB
Droplet	3D	X[500][\(\rho^\), \(\mu^\), Bo, Re]	Y [500][51][c, u, v, w, p, \(\rho\)][z][y][x]	1200 GB
Bubble	3D	X[500][\(\rho^\), \(\mu^\), Bo, Re]	Y [500][51][c, u, v, w, p, \(\rho\)][z][y][x]	1400 GB

To facilitate easier access and experimentation with our extensive dataset, we have provided a down-sampled version. This sample dataset is specifically tailored for quick testing and prototyping, containing 100 simulations for each of the 2D categories and 5 simulations for each of the 3D categories. Each of these simulations is further simplified, with only 10 time steps included. This allows users to get a taste of the full dataset's capabilities without the need for extensive computational resources or time commitment. The down-sampled dataset serves as an excellent starting point for researchers and developers to explore the dynamics of droplet and bubble behaviors in both 2D and 3D, making it easier to test ideas and methodologies before scaling up to the full dataset.

DOWNLOAD

Proposed in 2015 for image segmentation tasks, UNet is designed around the concept of semantic segmentation—assigning a class to every pixel in an image. It achieves this through a series of encoders and decoders, connected by bridge blocks, arranged in a sequential and symmetrical pattern that resembles the shape of the letter "U," which is where the name UNet comes from. One of the key advantages of UNet is its ability to extract the maximum amount of information from each image, enabling it to perform as well as traditional convolutional neural networks in image classification tasks, but with significantly fewer training samples.

Reference: Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention -- MICCAI 2015 (pp. 234-241). Springer International Publishing.

Proposed in 2019, DeepONets are based on the Universal Approximation Theorem for operators. The core idea is to split the problem into two parts: the input function evaluations at specific locations are passed through a fully connected neural network called the branch net, while the location coordinates where outputs are desired are passed through another fully connected network called the trunk net. The outputs from these two networks are then combined through a dot product, which is compared with the actual outputs during training. The entire system is trained using a loss function and backpropagation. One of the appealing aspects of DeepONet is its simplicity; it doesn’t require complex hyperparameter tuning, as fairly general fully connected networks with a few hidden layers work effectively for most cases.

Reference: Lu, L., Jin, P., Pang, G., Zhang, Z., & Karniadakis, G. E. (2021). Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3, 218–229.

Proposed in 2020, the Fourier Neural Operator (FNO) is a neural operator designed to approximate continuous nonlinear operator solutions to partial differential equations (PDEs). It achieves this by stacking a sequence of linear integral operators, which are implemented using a combination of convolutional kernels and the Fast Fourier Transform (FFT). After applying FFT, high frequencies are truncated to respect the Nyquist limit, and then an Inverse Fast Fourier Transform (IFFT) is applied. This process is followed by nonlinear activation functions, creating a block that forms the core of the FNO architecture. For practitioners, several hyperparameters are crucial when working with FNOs: the number of operator blocks, the number of retained frequencies after FFT, and the number of hidden nodes.

Reference: Li, Z., Kovachki, N., Azizzadenesheli, K., Liu, B., Bhattacharya, K., Stuart, A., & Anandkumar, A. (2021). Fourier Neural Operator for Parametric Partial Differential Equations. arXiv preprint arXiv:2010.08895.

Proposed in 2023, the Convolutional Neural Operator (CNO) is an adaptation of the U-Net architecture specifically designed for operator learning. CNOs address a common issue found in Fourier Neural Operators (FNOs), where the application of nonlinearity can cause an explosion in the frequency spectrum, leading to violations of the Nyquist barrier. CNOs mitigate this problem through a carefully structured series of upsampling, downsampling, and residual blocks, which allow them to maintain a structure-preserving operator that satisfies the property of Continuous-Discrete Equivalence. When working with CNOs, the two primary hyperparameters to tune are the number of upsampling/downsampling blocks and the learning rate for the optimizer.

Reference: Raonić, B., Molinaro, R., De Ryck, T., Rohner, T., Bartolucci, F., Alaifari, R., Mishra, S., & de Bézenac, E. (2023). Convolutional Neural Operators for robust and accurate learning of PDEs. arXiv preprint arXiv:2302.01178.

Proposed as a foundation model for scientific machine learning, Poseidon is based on the Scalable Operator Transformers (scOT) architecture, which combines hierarchical, multiscale vision transformers with U-Net-style encoder-decoder structures. Specifically, it integrates Swin Transformer V2 blocks with ConvNeXt residual blocks to capture multiscale dependencies efficiently. Trained on a large dataset of compressible Euler and incompressible Navier-Stokes equations, Poseidon is designed to generalize across complex fluid dynamics scenarios. The model is available in three sizes—tiny (20M parameters), base (160M parameters), and large (630M parameters). In this work, we utilize the tiny version of Poseidon (Poseidon-T), which provides a balance between computational efficiency and predictive accuracy for our experiments.

Reference: Herde, M., Raonić, B., Rohner, T., Käppeli, R., Molinaro, R., de Bézenac, E., & Mishra, S. (2024). Poseidon: Efficient foundation models for PDEs.

The Sequence to Field evaluation tests how well machine learning models predict the interface field C of multiphase flows given different input sequences. We set up three different data subsets: S1, S2, and S3. The models receive either a single timestep t1, sequences from t1 to t3, or sequences from t1 to t5, and they predict the corresponding next time snapshot for each subset. The figure below compares the ground truth with predictions from UNet, CNO, DeepONet, and Poseidon, where each row represents a different subset. S1 corresponds to a simpler case with minimal deformation, S2 involves moderate interface changes, and S3 represents a more challenging case with significant interface deformation. CNO achieves the best performance, maintaining accurate interface structures, while DeepONet struggles with high-deformation cases.

Figure: The figure presents a comparison of sequence-to-field predictions for the concentration field C against the ground truth. The predictions are generated by four models: UNet, Convolutional Neural Operator (CNO), DeepONet, and Poseidon, across three data subsets (S1, S2, and S3). Each row corresponds to a different subset (S1, S2, or S3), while each column displays the predictions made by the respective models.

Model	Channel	S1		S2		S3
Model	Channel	MSE	\(L_2\)	MSE	\(L_2\)	MSE	\(L_2\)
UNet	\(c\)	2.60 × 10^-2	2.59 × 10^-2	8.40 × 10^-3	8.07 × 10^-3	9.56 × 10^-3	9.04 × 10^-3
	\(u\)	8.80 × 10^-5	3.13 × 10⁰	8.00 × 10^-6	2.04 × 10⁰	1.00 × 10^-5	2.09 × 10⁰
	\(v\)	1.00 × 10^-6	1.56 × 10⁰	1.00 × 10^-6	7.84 × 10^-1	1.00 × 10^-6	8.76 × 10^-1
	\(p\)	1.00 × 10^-6	2.74 × 10²	1.00 × 10^-6	3.23 × 10²	1.00 × 10^-6	5.17 × 10²
DeepONet	\(c\)	2.66 × 10^-2	2.65 × 10^-2	1.01 × 10^-1	1.01 × 10^-1	1.18 × 10^-1	1.18 × 10^-1
	\(u\)	9.10 × 10^-5	6.71 × 10⁰	1.27 × 10^-3	1.24 × 10⁰	1.71 × 10^-3	9.18 × 10^-1
	\(v\)	1.00 × 10^-6	8.68 × 10^-1	1.00 × 10^-6	5.35 × 10^-1	1.00 × 10^-6	8.89 × 10^-1
	\(p\)	1.00 × 10^-6	1.89 × 10²	1.00 × 10^-6	2.43 × 10²	1.00 × 10^-6	1.66 × 10²
FNO	\(c\)	2.72 × 10^-2	2.68 × 10^-2	9.73 × 10^-3	8.97 × 10^-3	2.10 × 10^-2	1.98 × 10^-2
	\(u\)	9.30 × 10^-5	8.56 × 10⁰	1.00 × 10^-5	2.77 × 10⁰	4.80 × 10^-5	5.29 × 10⁰
	\(v\)	1.00 × 10^-6	3.43 × 10⁰	1.00 × 10^-6	1.09 × 10⁰	2.00 × 10^-6	2.02 × 10⁰
	\(p\)	1.00 × 10^-6	7.18 × 10²	1.00 × 10^-6	7.44 × 10²	2.00 × 10^-6	1.04 × 10³
CNO	\(c\)	2.62 × 10^-2	2.60 × 10^-1	5.89 × 10^-3	5.65 × 10^-3	9.41 × 10^-3	9.00 × 10^-3
	\(u\)	8.80 × 10^-5	5.04 × 10⁰	4.00 × 10^-6	1.60 × 10⁰	1.00 × 10^-5	1.79 × 10⁰
	\(v\)	1.00 × 10^-6	1.73 × 10⁰	1.00 × 10^-6	5.07 × 10^-1	1.00 × 10^-6	9.19 × 10^-1
	\(p\)	1.00 × 10^-6	4.46 × 10²	1.00 × 10^-6	2.24 × 10²	1.00 × 10^-6	3.96 × 10²
scOT	\(c\)	2.76 × 10^-2	2.68 × 10^-2	1.77 × 10^-2	1.68 × 10^-2	2.23 × 10^-2	2.17 × 10^-2
	\(u\)	1.29 × 10¹	1.29 × 10¹	3.50 × 10^-5	3.91 × 10⁰	5.80 × 10^-5	2.82 × 10⁰
	\(v\)	5.24 × 10⁰	5.24 × 10⁰	1.00 × 10^-6	2.31 × 10⁰	1.00 × 10^-6	1.75 × 10⁰
	\(p\)	9.65 × 10²	9.65 × 10²	2.00 × 10^-6	8.12 × 10²	2.00 × 10^-6	7.80 × 10²
Poseidon	\(c\)	2.87 × 10^-2	2.79 × 10^-1	3.01 × 10^-2	2.49 × 10^-2	2.49 × 10^-2	2.23 × 10^-2
	\(u\)	1.00 × 10^-4	1.16 × 10¹	1.14 × 10^-4	1.31 × 10¹	6.10 × 10^-5	8.28 × 10⁰
	\(v\)	2.00 × 10^-6	5.86 × 10⁰	1.00 × 10^-6	4.05 × 10⁰	5.00 × 10^-6	2.83 × 10⁰
	\(p\)	2.00 × 10^-6	1.04 × 10³	6.00 × 10^-6	2.35 × 10³	4.00 × 10^-6	1.94 × 10³

The Sequence to Sequence evaluation extends the challenge by predicting a sequence of future states rather than a single snapshot. The models are trained on past sequences and tasked with predicting the system's evolution at three consecutive time steps. We define three test cases: S4, S5, and S6, where input sequences range from t1 to t3, t1 to t5, and t1 to t8, respectively. The figure below compares predictions from different models against ground truth at key time steps T9, T10, and T11, showing how well each model maintains accuracy over time. CNO again performs best, accurately capturing complex interface dynamics, while DeepONet exhibits increasing errors, particularly in high-deformation regions. This benchmark provides insights into how well neural operators forecast long-term fluid dynamics.

Figure: The figure presents sequence-to-sequence predictions for the concentration field C compared to the ground truth. Predictions are made by four models: UNet, Convolutional Neural Operator (CNO), DeepONet, and Poseidon. Each row represents different time steps (t₉, t₁₀, and t₁₁) from dataset S6, while each column shows the predictions from the respective models.

Model	Channel	S4		S5		S6
Model	Channel	MSE	\(L_2\)	MSE	\(L_2\)	MSE	\(L_2\)
UNet	\(c\)	3.27 × 10^-2	2.87 × 10^-2	3.93 × 10^-2	3.31 × 10^-2	7.34 × 10^-2	6.58 × 10^-2
	\(u\)	1.35 × 10^-4	3.54 × 10⁰	2.02 × 10^-4	3.10 × 10⁰	6.65 × 10^-4	3.90 × 10⁰
	\(v\)	1.00 × 10^-6	1.38 × 10⁰	1.00 × 10^-6	1.45 × 10⁰	1.00 × 10^-6	9.54 × 10^-1
	\(p\)	1.00 × 10^-6	6.76 × 10²	1.00 × 10^-6	6.30 × 10²	1.00 × 10^-6	7.79 × 10²
DeepONet	\(c\)	1.64 × 10^-1	1.60 × 10^-1	2.03 × 10^-1	1.99 × 10^-1	1.94 × 10^-1	1.92 × 10^-1
	\(u\)	3.33 × 10^-3	3.15 × 10⁰	5.04 × 10^-3	1.64 × 10⁰	4.57 × 10^-3	1.32 × 10⁰
	\(v\)	1.00 × 10^-6	1.04 × 10⁰	1.00 × 10^-6	7.59 × 10^-1	1.00 × 10^-6	3.83 × 10^-1
	\(p\)	1.00 × 10^-6	7.30 × 10²	1.00 × 10^-6	3.53 × 10²	1.00 × 10^-6	3.17 × 10²
FNO	\(c\)	1.16 × 10^-2	1.10 × 10^-2	2.33 × 10^-2	2.20 × 10^-2	4.24 × 10^-2	4.00 × 10^-2
	\(u\)	1.70 × 10^-5	1.00 × 10⁰	6.50 × 10^-5	2.98 × 10⁰	2.28 × 10^-4	1.13 × 10⁰
	\(v\)	1.00 × 10^-6	4.50 × 10^-1	1.00 × 10^-6	1.28 × 10⁰	1.00 × 10^-6	3.98 × 10^-1
	\(p\)	1.00 × 10^-6	1.38 × 10²	1.00 × 10^-6	7.87 × 10²	1.00 × 10^-6	2.69 × 10²
CNO	\(c\)	1.74 × 10^-2	1.69 × 10^-2	1.72 × 10^-2	1.63 × 10^-2	3.78 × 10^-2	3.53 × 10^-2
	\(u\)	3.70 × 10^-5	1.51 × 10⁰	3.60 × 10^-5	1.37 × 10⁰	1.76 × 10^-4	1.74 × 10⁰
	\(v\)	1.00 × 10^-6	6.87 × 10^-1	1.00 × 10^-6	6.85 × 10^-1	1.00 × 10^-6	5.98 × 10^-1
	\(p\)	1.00 × 10^-6	2.84 × 10²	1.00 × 10^-6	2.93 × 10²	1.00 × 10^-6	4.22 × 10²
scOT	\(c\)	3.85 × 10^-3	3.69 × 10⁰	3.79 × 10^-3	6.27 × 10⁰	6.27 × 10^-3	5.79 × 10⁰
	\(u\)	1.73 × 10^-4	6.73 × 10⁰	6.73 × 10⁰	5.73 × 10⁰	4.85 × 10^-4	5.31 × 10⁰
	\(v\)	3.00 × 10^-6	2.78 × 10⁰	3.00 × 10^-6	2.12 × 10⁰	2.00 × 10^-6	1.93 × 10⁰
	\(p\)	3.00 × 10^-6	1.48 × 10³	3.00 × 10^-6	1.34 × 10³	1.00 × 10^-6	1.21 × 10³
Poseidon	\(c\)	3.06 × 10^-2	2.84 × 10⁰	3.33 × 10^-2	3.17 × 10^-2	5.99 × 10^-2	5.79 × 10^-1
	\(u\)	1.01 × 10^-4	8.26 × 10⁰	1.26 × 10^-4	6.49 × 10⁰	4.30 × 10^-4	7.79 × 10⁰
	\(v\)	5.00 × 10^-6	3.51 × 10⁰	4.00 × 10^-6	2.47 × 10⁰	5.00 × 10^-6	2.16 × 10⁰
	\(p\)	5.00 × 10^-6	1.71 × 10³	4.00 × 10^-6	1.48 × 10³	5.00 × 10^-6	1.81 × 10³

All models were trained on a single A100 80GB GPU, batch size of 4, Adam optimizer with learning rate of 10^−3, and run for 400 epochs. The shown validation loss for CNO sequence to field prediction on the bubble dataset seems to saturate after 300 epochs. Similar behavior is demonstrated by the other networks.

Figure: Validation Loss Curve: This plot illustrates the decline in validation loss over the course of training epochs, indicating the model's performance and improvement during the training process.

These dataset subsets contains all samples used in our paper, specifically for the results presented in the publication. Anyone interested in reproducing our results or benchmarking their methods can use this exact subset. While the full dataset is much larger, this selection includes all the data points referenced in our analysis and figures.

DOWNLOAD SUBSETS

Mehdi Shadkhah	Ronak Tali	Ali Rabeh	Ethan Herron	Cheng-Hau Yang	Abhisek Upadhyaya
Iowa State University	Iowa State University	Iowa State University	Iowa State University	Iowa State University	New York University
Ames, IA, USA	Ames, IA, USA	Ames, IA, USA	Ames, IA, USA	Ames, IA, USA	New York, NY, USA

Adarsh Krishnamurthy	Chinmay Hegde	Aditya Balu	Baskar Ganapathysubramanian
Iowa State University	New York University	Iowa State University	Iowa State University
Ames, IA, USA	New York, NY, USA	Ames, IA, USA	Ames, IA, USA

MPFBench: A Large Scale Dataset for SciML of Multi-Phase-Flows: Droplet and Bubble Dynamics

Abstract

Benchmark

Neural Operators

Experimental Results

Dataset Download

Contact