arxiv:2410.01744
Kaixin Ma
kaixinm
AI & ML interests
NLP, ML
Recent Activity
upvoted
a
paper
2 days ago
SO-Bench: A Structural Output Evaluation of Multimodal LLMs
upvoted
a
paper
12 days ago
NarrativeTrack: Evaluating Video Language Models Beyond the Frame
upvoted
a
paper
over 1 year ago
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive
Memory
Organizations
None yet