varuy322 (wangrui)

upvoted a paper 19 days ago

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published 22 days ago • 47

upvoted an article 20 days ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 59

upvoted 2 papers about 1 month ago

Erasing Conceptual Knowledge from Language Models

Paper • 2410.02760 • Published Oct 3 • 12

LML: Language Model Learning a Dataset for Data-Augmented Prediction

Paper • 2409.18957 • Published Sep 27 • 9

upvoted 2 collections about 2 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 17 days ago • 453

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 15 items • Updated 8 days ago • 73

upvoted an article about 2 months ago

Article

An Introduction to Deep Reinforcement Learning

May 4, 2022

• 2

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 258

upvoted a paper 5 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

upvoted an article 5 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 65

upvoted 3 collections 5 months ago

upvoted an article 6 months ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 114

upvoted an article 7 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 94

wangrui

AI & ML interests

Organizations

varuy322's activity

Baichuan Alignment Technical Report

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Erasing Conceptual Knowledge from Language Models

LML: Language Model Learning a Dataset for Data-Augmented Prediction

Llama 3.2

LLM Reasoning Papers

An Introduction to Deep Reinforcement Learning

SmolLM - blazingly fast and remarkably powerful

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Qwen2

LLMs

📀 Dataset comparison models

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Illustrating Reinforcement Learning from Human Feedback (RLHF)