sunblaze-ucb

https://github.com/sunblaze-ucb

AI & ML interests

None defined yet.

Recent Activity

Xuandong authored a paper about 4 hours ago

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

LEAFERx updated a dataset about 7 hours ago

sunblaze-ucb/verina

Xuandong submitted a paper about 13 hours ago

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

View all activity

Xuandong

authored a paper about 4 hours ago

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Paper • 2601.00575 • Published 4 days ago • 1

LEAFERx

updated a dataset about 7 hours ago

sunblaze-ucb/verina

Viewer • Updated about 7 hours ago • 189 • 228 • 6

Xuandong

submitted a paper to Daily Papers about 13 hours ago

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Paper • 2601.00575 • Published 4 days ago • 1

dylanx26

updated a dataset 16 days ago

sunblaze-ucb/AgentSynth

Viewer • Updated Sep 1, 2025 • 1.21k • 76 • 5

Dongwei

authored a paper 5 months ago

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Paper • 2506.11930 • Published Jun 13, 2025 • 53

Xuandong

authored a paper 7 months ago

Learning to Reason without External Rewards

Paper • 2505.19590 • Published May 26, 2025 • 29

stneng

authored a paper 9 months ago

Progent: Programmable Privilege Control for LLM Agents

Paper • 2504.11703 • Published Apr 16, 2025 • 6

Xuandong

authored a paper 9 months ago

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs

Paper • 2504.04715 • Published Apr 7, 2025 • 13

Xuandong

authored a paper about 1 year ago

Multimodal Situational Safety

Paper • 2410.06172 • Published Oct 8, 2024 • 12

Dongwei

authored 3 papers over 1 year ago

Benchmarking Language Model Creativity: A Case Study on Code Generation

Paper • 2407.09007 • Published Jul 12, 2024 • 4

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1, 2024 • 35

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 39

Xuandong

authored 4 papers almost 2 years ago

Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective Distillation

Paper • 2203.07687 • Published Mar 15, 2022

Protecting Language Generation Models via Invisible Watermarking

Paper • 2302.03162 • Published Feb 6, 2023

Provable Robust Watermarking for AI-Generated Text

Paper • 2306.17439 • Published Jun 30, 2023

Weak-to-Strong Jailbreaking on Large Language Models

Paper • 2401.17256 • Published Jan 30, 2024 • 16