8 33 178

JingyeChen22

https://jingyechen.github.io

JingyeChen

AI & ML interests

OCR, Document Analysis, Text-to-X

Recent Activity

upvoted a paper 12 days ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

upvoted a paper 17 days ago

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

commented on a paper 3 months ago

DocReward: A Document Reward Model for Structuring and Stylizing

View all activity

Organizations

None yet

upvoted a paper 12 days ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Paper • 2512.20618 • Published 13 days ago • 53

upvoted a paper 17 days ago

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Paper • 2512.16924 • Published 18 days ago • 25

upvoted a paper 3 months ago

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published Oct 13, 2025 • 27

upvoted 3 papers 4 months ago

upvoted a paper 5 months ago

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published Jul 28, 2025 • 31

upvoted 3 papers 6 months ago

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 58

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10, 2025 • 33

Calligrapher: Freestyle Text Image Customization

Paper • 2506.24123 • Published Jun 30, 2025 • 37

upvoted 2 papers 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

ImgEdit: A Unified Image Editing Dataset and Benchmark

Paper • 2505.20275 • Published May 26, 2025 • 18

upvoted 3 papers 9 months ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11, 2025 • 42

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8, 2025 • 64

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79

upvoted a paper about 1 year ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published Dec 23, 2024 • 24

upvoted a collection about 1 year ago

RoLoRA

Collection

[EMNLP2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization • 3 items • Updated Sep 26, 2024 • 3

upvoted 3 papers about 1 year ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97

WonderJourney: Going from Anywhere to Everywhere

Paper • 2312.03884 • Published Dec 6, 2023 • 1

MM-Ego: Towards Building Egocentric Multimodal LLMs

Paper • 2410.07177 • Published Oct 9, 2024 • 22

JingyeChen22

AI & ML interests

Recent Activity

Organizations

JingyeChen22's activity