RDson (Robin Davidsson)

upvoted a paper 29 days ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165

upvoted a collection 2 months ago

Jamba-1.5

Collection

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22 • 80

upvoted a paper 5 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602

upvoted a collection 6 months ago

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 6 days ago • 176

upvoted an article 7 months ago

Article

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

By

•

Apr 21

• 43

upvoted a collection 7 months ago

📀 Dataset comparison models

Collection

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 30

upvoted 4 papers about 1 year ago

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 47

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

Paper • 2308.13137 • Published Aug 25, 2023 • 17

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Paper • 2210.17323 • Published Oct 31, 2022 • 7

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 30

upvoted 2 papers over 1 year ago

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Paper • 2308.03526 • Published Aug 7, 2023 • 25

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Paper • 2308.02151 • Published Aug 4, 2023 • 18

Robin Davidsson

AI & ML interests

Organizations

RDson's activity

Differential Transformer

Jamba-1.5

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Granite Code Models

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

📀 Dataset comparison models

Mistral 7B

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

LoRA: Low-Rank Adaptation of Large Language Models

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization