to read
updated
GenEx: Generating an Explorable World
Paper
•
2412.09624
•
Published
•
97
Image-to-Video
•
Updated
•
157
•
610
Track4Gen: Teaching Video Diffusion Models to Track Points Improves
Video Generation
Paper
•
2412.06016
•
Published
•
20
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
•
2412.09871
•
Published
•
108
Paper
•
2412.15115
•
Published
•
376
Alibaba-NLP/gte-multilingual-mlm-base
Fill-Mask
•
0.3B
•
Updated
•
431
•
15
answerdotai/ModernBERT-large
Fill-Mask
•
0.4B
•
Updated
•
69.6k
•
440
Parallelized Autoregressive Visual Generation
Paper
•
2412.15119
•
Published
•
53
Taming Multimodal Joint Training for High-Quality Video-to-Audio
Synthesis
Paper
•
2412.15322
•
Published
•
20
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers
Up
Paper
•
2412.16112
•
Published
•
23
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper
•
2501.05441
•
Published
•
95
Fill-Mask
•
2B
•
Updated
•
372
•
63
"Principal Components" Enable A New Language of Images
Paper
•
2503.08685
•
Published
•
12
Causal-Copilot: An Autonomous Causal Analysis Agent
Paper
•
2504.13263
•
Published
•
7
Paper2Code: Automating Code Generation from Scientific Papers in Machine
Learning
Paper
•
2504.17192
•
Published
•
120
Vid2World: Crafting Video Diffusion Models to Interactive World Models
Paper
•
2505.14357
•
Published
•
27
PixNerd: Pixel Neural Field Diffusion
Paper
•
2507.23268
•
Published
•
51