SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision
Abstract
SpectralSplats addresses vanishing gradient issues in 3D Gaussian Splatting tracking by transforming the optimization objective to the frequency domain using spectral moments and implementing a frequency annealing schedule.
3D Gaussian Splatting (3DGS) enables real-time, photorealistic novel view synthesis, making it a highly attractive representation for model-based video tracking. However, leveraging the differentiability of the 3DGS renderer "in the wild" remains notoriously fragile. A fundamental bottleneck lies in the compact, local support of the Gaussian primitives. Standard photometric objectives implicitly rely on spatial overlap; if severe camera misalignment places the rendered object outside the target's local footprint, gradients strictly vanish, leaving the optimizer stranded. We introduce SpectralSplats, a robust tracking framework that resolves this "vanishing gradient" problem by shifting the optimization objective from the spatial to the frequency domain. By supervising the rendered image via a set of global complex sinusoidal features (Spectral Moments), we construct a global basin of attraction, ensuring that a valid, directional gradient toward the target exists across the entire image domain, even when pixel overlap is completely nonexistent. To harness this global basin without introducing periodic local minima associated with high frequencies, we derive a principled Frequency Annealing schedule from first principles, gracefully transitioning the optimizer from global convexity to precise spatial alignment. We demonstrate that SpectralSplats acts as a seamless, drop-in replacement for spatial losses across diverse deformation parameterizations (from MLPs to sparse control points), successfully recovering complex deformations even from severely misaligned initializations where standard appearance-based tracking catastrophically fails.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction (2026)
- Intrinsic Geometry-Appearance Consistency Optimization for Sparse-View Gaussian Splatting (2026)
- M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM (2026)
- UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images (2026)
- GVGS: Gaussian Visibility-Aware Multi-View Geometry for Accurate Surface Reconstruction (2026)
- VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm (2026)
- HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2603.24036 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper