iseesaw

Follow

🎯

Focusing

Kaiyan Zhang iseesaw

🎯

Focusing

Follow

PhD Candidate at Tsinghua University.

82 followers · 14 following

@TsinghuaC3I
Beijing
14:14 (UTC +08:00)
https://iseesaw.github.io
@OkhayIea

Achievements

Achievements

Organizations

iseesaw/README.md

Hi there 👋

For more information, please visit my homepage.

Pinned Loading

TsinghuaC3I/Awesome-RL-for-LRMs TsinghuaC3I/Awesome-RL-for-LRMs Public

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2.1k 118
PRIME-RL/TTRL PRIME-RL/TTRL Public

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

Python 916 65
TsinghuaC3I/MARTI TsinghuaC3I/MARTI Public

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 370 42
TsinghuaC3I/SSRL TsinghuaC3I/SSRL Public

SSRL: Self-Search Reinforcement Learning

Python 157 6
TsinghuaC3I/UltraMedical TsinghuaC3I/UltraMedical Public

[NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine

Python 94 4
TsinghuaC3I/Awesome-Memory-for-Agents TsinghuaC3I/Awesome-Memory-for-Agents Public

A Collection of Papers about Memory for Language Agents

179 2