Min-Hung Chen

cmhungsteve

https://minhungchen.netlify.app/

AI & ML interests

Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning

Recent Activity

upvoted a paper 5 days ago

Evaluating Parameter Efficient Methods for RLVR

upvoted a paper 6 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

upvoted a paper 6 days ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

View all activity

Organizations

upvoted a paper 5 days ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published 7 days ago • 23

upvoted 2 papers 6 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published Oct 22, 2025 • 30

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published 13 days ago • 18

upvoted 2 papers 11 days ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published 12 days ago • 30

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published 12 days ago • 29

upvoted a paper 12 days ago

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Paper • 2512.10927 • Published 25 days ago • 5

authored a paper 14 days ago

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published 18 days ago • 42

upvoted a paper 14 days ago

Generative Refocusing: Flexible Defocus Control from a Single Image

Paper • 2512.16923 • Published 18 days ago • 37

submitted a paper to Daily Papers 14 days ago

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published 18 days ago • 42

upvoted a paper 14 days ago

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published 18 days ago • 42

liked a model 18 days ago

nvidia/GR00T-N1.6-3B

Robotics • 3B • Updated 20 days ago • 9.8k • 19

authored a paper 19 days ago

Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in

Paper • 2512.14273 • Published 20 days ago • 7

upvoted a paper 19 days ago

Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in

Paper • 2512.14273 • Published 20 days ago • 7

upvoted a collection 26 days ago

Cosmos-Reason2

Collection

Cosmos Reason 2 is an open, customizable, reasoning vision language model (VLM) for physical AI and robotics • 14 items • Updated 11 days ago • 7

authored a paper about 1 month ago

BlurDM: A Blur Diffusion Model for Image Deblurring

Paper • 2512.03979 • Published Dec 3, 2025 • 3

upvoted a paper about 1 month ago

BlurDM: A Blur Diffusion Model for Image Deblurring

Paper • 2512.03979 • Published Dec 3, 2025 • 3

commented a paper about 1 month ago

BlurDM: A Blur Diffusion Model for Image Deblurring

Paper • 2512.03979 • Published Dec 3, 2025 • 3 •

authored a paper about 2 months ago

VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models

Paper • 2511.07299 • Published Nov 10, 2025 • 5

upvoted a paper about 2 months ago

VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models

Paper • 2511.07299 • Published Nov 10, 2025 • 5

commented a paper about 2 months ago

VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models

Paper • 2511.07299 • Published Nov 10, 2025 • 5 •

Min-Hung Chen

AI & ML interests

Recent Activity

Organizations

cmhungsteve's activity