3 52 47

Victor Jotham Ashioya

ashioyajotham

https://ashioyajotham.github.io/

AI & ML interests

Hallucination in LLMs, AI Safety: alignment, red-teaming

Recent Activity

updated a Space 5 days ago

ashioyajotham/medgemma-clinical-reasoning

published a Space 5 days ago

ashioyajotham/medgemma-clinical-reasoning

updated a collection 4 months ago

LLM Reasoning

View all activity

Organizations

None yet

updated a Space 5 days ago

MedGemma Clinical Reasoning AI

🏥

published a Space 5 days ago

MedGemma Clinical Reasoning AI

🏥

updated a collection 4 months ago

LLM Reasoning

Collection

11 items • Updated Sep 17, 2025

upvoted a paper 4 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15, 2025 • 28

updated a collection 4 months ago

LLM Reasoning

Collection

11 items • Updated Sep 17, 2025

upvoted 3 papers 4 months ago

upvoted 2 papers 5 months ago

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Paper • 2508.03501 • Published Aug 5, 2025 • 59

Tool-integrated Reinforcement Learning for Repo Deep Search

Paper • 2508.03012 • Published Aug 5, 2025 • 20

liked a model 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.43M • • 4.32k

updated a collection 5 months ago

LLM Reasoning

Collection

11 items • Updated Sep 17, 2025

upvoted a paper 5 months ago

UloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing Large Language Models' Reasoning Abilities

Paper • 2507.19766 • Published Jul 26, 2025 • 14

updated 2 collections 5 months ago

LLM Reasoning

Collection

11 items • Updated Sep 17, 2025

Scale

Collection

2 items • Updated Jul 30, 2025

updated a collection 8 months ago

LLM Reasoning

Collection

11 items • Updated Sep 17, 2025

upvoted a paper 8 months ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5, 2025 • 33

upvoted a paper 9 months ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 89

liked a Space 11 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.25k

Generate high-quality text data for LLMs using FineWeb

updated a Space 11 months ago

Falcon 7b Coder

🌖

Victor Jotham Ashioya

AI & ML interests

Recent Activity

Organizations

ashioyajotham's activity

MedGemma Clinical Reasoning AI

MedGemma Clinical Reasoning AI

FineWeb: decanting the web for the finest text data at scale

Falcon 7b Coder