kuotient (Jisoo Kim)

upvoted 2 papers about 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 134

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 73

upvoted an article 2 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

By

•

Aug 26

• 34

upvoted a paper 2 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27 • 36

upvoted 2 articles 4 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 75

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 100

upvoted a paper 4 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 155

upvoted an article 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted a paper 5 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85

upvoted a collection 5 months ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Sep 18 • 346

upvoted a collection 6 months ago

Alpha Llama-3 collection

Collection

5 items • Updated Jun 20 • 2

upvoted an article 7 months ago

Article

Can We Train Chat Models with Raw Data?

By

•

Apr 25

• 17

upvoted a paper 8 months ago

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

Jisoo Kim PRO

AI & ML interests

Organizations

kuotient's activity

Training Language Models to Self-Correct via Reinforcement Learning

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

The Rise of Agentic Data Generation

How NuminaMath Won the 1st AIMO Progress Prize

Qwen2 Technical Report

Putting RL back in RLHF

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Qwen2

Alpha Llama-3 collection

Can We Train Chat Models with Raw Data?

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method