DMIAN: Deep Learning-Based Multi-IMU Fusion for Enhanced Marine Aided Navigation

Matko Batoš  ·  Đula Nađ
University of Zagreb  ·  Faculty of Electrical Engineering and Computing
Laboratory for Underwater Systems and Technologies (LABUST)
Centre of Excellence in Maritime Robotics and Technologies for Sustainable Blue Economy (CoE MARBLE)
Velocity estimate
Position estimate

Estimated results from the proposed method. Left: velocity over time. Right: position over time.

4.3 ms
Inference time (CPU)
0.11
Vel. RMSE (m/s)
0.88
R² Score
0.804
Pos. RMSE (m)
54.6%
Improvement vs INS-DVL
27.5%
Improvement vs NN baseline

Abstract

Learned inertial odometry has advanced rapidly across domains, especially in GNSS-denied environments. This paper introduces a learning-based approach that combines multiple IMUs with a DVL to improve velocity estimation for marine vehicles. The proposed method employs a multi-head attention Long Short-Term Memory network to fuse temporally and spatially distributed inertial signals with aiding velocity measurements. The model outputs both velocity estimates and their corresponding covariances, which are integrated as measurement updates within an EKF. This hybrid design allows learned features to complement traditional state estimation while maintaining filter consistency. The system is implemented and validated on the H2OmniX platform through a diverse set of trajectories. The method takes less than 5 ms for inference both on the GPU and the CPU, demonstrating less than 0.11 m/s RMSE and more than 0.88 R² in unseen trajectories through all ablation studies. The multi-IMU and DVL fusion provides the most accurate results, whereas the models with other IMU configurations continue to deliver reliable estimations when additional data are unavailable.

Deep Learning Inertial Navigation Multi-IMU LSTM ASV DVL EKF

Contributions

System Design

H2OmniX sensor setup

H2OmniX ASV with the multi-IMU enclosure and DVL mounted underneath.

Platform. The H2OmniX is an omnidirectional ASV developed at LABUST. It measures 1 m in length, weighs approximately 15 kg, and runs navigation software on ROS2 Humble.

Sensor setup. Three Xsens MTi-630R IMUs are mounted orthogonally in a perpendicular triad inside an IP-rated enclosure at 100 Hz. A NavQuest 600 Micro DVL is mounted beneath the platform providing 3-axis velocity at 5 Hz.

EKF. A constant-acceleration motion model with state vector x = [p, Θ, v, ω, a]. All configurations share the same prediction model and process noise Q.

Method

Raw IMU and DVL data are transformed to a common body frame, then passed to the multi-head LSTM. Predicted velocities and diagonal covariances serve as EKF measurement updates.

System workflow

System workflow from raw sensor data to EKF state variables.

Architecture. Each of the three IMU branches and the DVL branch is an independent 2-layer LSTM with 64 hidden units. Branch outputs are concatenated and passed through fully connected fusion layers into a velocity head and an uncertainty head.

Loss. Two-stage training: 10 epochs of MSE to stabilize velocity regression, then Gaussian Negative Log-Likelihood (GNLL) to jointly learn per-axis uncertainty. The diagonal covariance structure ensures a positive-definite measurement noise matrix for EKF integration.

Results

Comparison of DMIAN against BeamsNet and classical INS-DVL across five evaluation trajectories.

2D trajectory grid all methods

2D trajectory comparison across all five evaluation trajectories. Rows: INS-DVL (top), BeamsNet (middle), DMIAN (bottom).

Table 1 — Average Position Performance

MetricBeamsNetINS-DVLDMIAN
MAE (m)0.826321.206090.73442
MSE (m²)0.877672.526720.74459
RMSE (m)0.910231.508360.80424

Table 2 — Average Velocity Performance

MetricBeamsNetINS-DVLDMIAN
MAE (m/s)0.116400.188170.09710
MSE (m²/s²)0.023890.064810.01252
RMSE (m/s)0.153680.245390.11137
0.756940.297770.87802
Position error

Position error over time — Trajectory 4.

Velocity

Velocity estimates — Trajectory 4.

Position

Position estimates — Trajectory 4.

Ablation Study

IMU count is varied from 1 to 3 while keeping architecture, training, and process noise Q identical. 1-IMU and 2-IMU results are averaged across all possible IMU combinations.

Velocity and Position Performance by IMU Configuration

Table 3 — Average Velocity (all with DVL)

Metric1 IMU + DVL2 IMU + DVL3 IMU + DVL
MAE (m/s)0.105100.101540.09587
MSE (m²/s²)0.014810.013500.01222
RMSE (m/s)0.120910.115720.10998
0.857810.868950.88017
Velocity ablation

Velocity RMSE distribution per IMU configuration.

Position ablation

Position error distribution per IMU configuration.

Drift rate CDF

Cumulative distribution of drift rate.

Position RMSE CDF

Cumulative distribution of position RMSE.

Citation

@article{batos2026dmian,
  title   = {DMIAN: Deep Learning-Based Multi-IMU Fusion for Enhanced Marine Aided Navigation},
  author  = {Bato\v{s}, Matko and Na\dj{}, \DJ{}ula},
  journal = {Submitted to Control Engineering Practice},
  year = {2026},
  volume = {},
  pages = {},
  issn = {},
  doi = {},
  url = {}
}
Citation details will be updated upon article publication.

Contact

Matko Batoš

matko.batos@fer.unizg.hr

Laboratory for Underwater Systems and Technologies (LABUST)
Faculty of Electrical Engineering and Computing, University of Zagreb
Unska ul. 3, 10000 Zagreb, Croatia