5 34 40

Haiwen Diao

Paranioar

https://Paranioar.github.io/

AI & ML interests

Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model

Recent Activity

authored a paper 9 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

commented on a paper 11 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 11 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

View all activity

Organizations

authored a paper 9 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 12 days ago • 61

commented a paper 11 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 12 days ago • 61 •

upvoted a paper 11 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 12 days ago • 61

upvoted 3 papers about 1 month ago

liked a dataset about 1 month ago

VLMEval/OpenVLMRecords

Updated Apr 8, 2025 • 3.27k • 12

upvoted 3 papers about 1 month ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 125

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published Nov 17, 2025 • 46

upvoted 2 papers about 2 months ago

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Paper • 2511.13648 • Published Nov 17, 2025 • 52

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published Nov 11, 2025 • 29

upvoted a paper 2 months ago

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28, 2025 • 40

updated 6 models 2 months ago

Paranioar/NEO1_0-2B-PT

Image-Text-to-Text • 3B • Updated Oct 21, 2025 • 9 • 1

Paranioar/NEO1_0-2B-MT

Image-Text-to-Text • 3B • Updated Oct 21, 2025 • 10

Paranioar/NEO1_0-2B-SFT

Image-Text-to-Text • 3B • Updated Oct 21, 2025 • 146 • 8

Paranioar/NEO1_0-9B-PT

Image-Text-to-Text • 10B • Updated Oct 21, 2025 • 4

Paranioar/NEO1_0-9B-MT

Image-Text-to-Text • 10B • Updated Oct 21, 2025 • 13

Paranioar/NEO1_0-9B-SFT

Image-Text-to-Text • 10B • Updated Oct 21, 2025 • 68 • 5

authored a paper 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16, 2025 • 66

Haiwen Diao

AI & ML interests

Recent Activity

Organizations

Paranioar's activity