7 7 3

Xinggang Wang

xinggangw

https://xwcv.github.io

AI & ML interests

computer vision

Recent Activity

upvoted a paper 15 days ago

Next-Embedding Prediction Makes Strong Vision Learners

upvoted a paper 22 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

authored a paper 23 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

View all activity

Organizations

upvoted a paper 15 days ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 21 days ago • 83

upvoted a paper 22 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 24 days ago • 100

authored a paper 23 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 24 days ago • 100

authored a paper 28 days ago

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Paper • 2512.08829 • Published 30 days ago • 18

authored a paper 8 months ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29, 2025 • 43

authored a paper 10 months ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13, 2025 • 18

upvoted a paper 10 months ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13, 2025 • 18

authored 2 papers 10 months ago

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published Mar 11, 2025 • 19

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10, 2025 • 23

upvoted a paper 10 months ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10, 2025 • 23

upvoted a paper 11 months ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published Feb 18, 2025 • 38

authored a paper 11 months ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published Feb 18, 2025 • 38

updated a model 11 months ago

hustvl/mmMamba-linear

Image-Text-to-Text • 3B • Updated Feb 26, 2025 • 19 • 4

New activity in hustvl/mmMamba-linear 11 months ago

Add metadata tags and link to code

#1 opened 11 months ago by

nielsr

liked a model 11 months ago

hustvl/mmMamba-linear

Image-Text-to-Text • 3B • Updated Feb 26, 2025 • 19 • 4

upvoted a paper 11 months ago

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published Feb 18, 2025 • 38

New activity in hustvl/vavae-imagenet256-f16d32-dinov2 11 months ago

Add pipeline tag, link to paper

#1 opened about 1 year ago by

nielsr

New activity in hustvl/lightningdit-xl-imagenet256-64ep 11 months ago

Add model card

#1 opened about 1 year ago by

nielsr

authored 2 papers about 1 year ago

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2, 2025 • 44

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Paper • 2411.15139 • Published Nov 22, 2024 • 15

Xinggang Wang

AI & ML interests

Recent Activity

Organizations

xinggangw's activity

Add metadata tags and link to code

Add pipeline tag, link to paper

Add model card