End-to-end Fusion3DGS

January 28, 2026
|

2 min read

End-to-end Fusion3DGS

Researchers from Southwest Jiaotong University have unveiled Fusion3DGS, a new end-to-end framework that shakes up the world of 3D instance segmentation. Moving away from the industry’s heavy reliance on expensive, densely annotated 3D point clouds, this novel approach harnesses the power of 3D Gaussian Splatting (3DGS) combined with multi-view RGB images. By utilizing 2D instance masks the system optimizes a compact scene representation. This method effectively “lifts” widely available 2D data into the 3D realm, creating a geometry-aware substrate that identifies and segments objects without needing a single manual 3D label.

SAM geospatial services

End-to-end Fusion3DGS
Sample gaussian splat via https://creativecommons.org/licenses/by-sa/4.0/

Traditional pipelines often struggle with the high computational cost and massive data requirements of pure 3D supervision. This framework bridges that gap by employing a “weight-sharing lock” that ties early 2D and 3D neural kernels together, ensuring spatial consistency while preventing the model from drifting. It also features an occlusion-aware cross-attention mechanism that fuses features only where they physically align, filtering out noise from blocked views. The result is a system that rivals state-of-the-art pure 3D methods on benchmarks but operates with significantly lower memory and hardware demands.

For those diving into the technical weeds, the pipeline starts with sparse point clouds from structure-from-motion (SfM). These are densified using a Voronoi-based method and refined with bilateral filtering to smooth high-frequency noise while preserving sharp object boundaries. To keep things efficient, the system groups primitives into “SuperPoints” to reduce the token count before fusion. The team also released OIRGB, a new in-house dataset for RGB-only indoor scenes, proving the model’s robustness in natural light conditions where 2D evidence often degrades. This blend of rendering-coupled optimization and lightweight architecture marks a promising step forward for autonomous navigation and robotics.

Read More: https://www.nature.com/articles/s41598-025-33840-8

Written by Adam Clark. Adam has spent the past 13 years exploring the world from above by using drones, satellites, and mapping tools to better understand our landscapes. Connect with him on LinkedIn: Adam Clark

Get Lidar News in Your Inbox

Weekly updates on lidar tech, geospatial industry news, case studies, and product reviews.

About The Author

NV5 GeoAgent
NV5 GeoAgent

Recent 3D Modeling Posts

Gaussian Splatting Reveals Ancient Graffiti

A team of researchers from Stockholm University and the University…

March 6, 2026

3D Modeling the Lighthouse of Alexandria

After centuries on the Mediterranean seafloor, the Lighthouse of Alexandria…

March 2, 2026
Cintoo 360 Edition Brings Ricoh 360 Data into 3D Workflows

Cintoo 360 Edition Brings Ricoh 360 Data into 3D Workflows

New solution converts 360 video walkthroughs into 3D data to…

February 17, 2026
Q3D Sensing’s AuraGO™ pocket-sized 3D reality capture device

Q3D AuraGO, Pocket-Sized Long-Range Scanner

Q3D Sensing’s AuraGO™ Creates New Category of Compact Mobile Long-Range…

February 12, 2026
Photo of London Tower Bridge

London Tower 3D Model

Artist Krzysztof Szkoda recently transformed archival photography into an immersive…

February 5, 2026

Orthos + Rasters: Viewing and Sharing Made Simple

Orthos + Rasters: Viewing and Sharing Made Simple Raster data…

February 1, 2026

Popular Posts

Get Lidar News in Your Inbox

Weekly updates on lidar tech, geospatial industry news, case studies, and product reviews.