Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
13.5
TFLOPS
37
35
64
Marc Sun
marcsun13
Follow
Chunte's profile picture
nbroad's profile picture
osanseviero's profile picture
87 followers
·
129 following
_marcsun
SunMarc
AI & ML interests
LLM, Quantization, Training, Inference
Articles
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
2 days ago
•
102
Accelerate 1.0.0
7 days ago
•
31
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
Jul 23
•
193
quanto: a pytorch quantization toolkit
Mar 18
•
28
Overview of natively supported quantization schemes in 🤗 Transformers
Sep 12, 2023
•
10
Making LLMs lighter with AutoGPTQ and transformers
Aug 23, 2023
•
25
Organizations
marcsun13
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
about 2 months ago
Upload folder using huggingface_hub
2
#4 opened about 2 months ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
about 2 months ago
Upload folder using huggingface_hub
2
#8 opened about 2 months ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct
about 2 months ago
Update original/mp8/README.md
#2 opened about 2 months ago by
marcsun13
Update original/mp16/README.md
#1 opened about 2 months ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B
about 2 months ago
Update original/mp16/README.md
#5 opened about 2 months ago by
marcsun13
Update original/mp8/README.md
#4 opened about 2 months ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
about 2 months ago
Upload folder using huggingface_hub
2
#2 opened about 2 months ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
about 2 months ago
Upload folder using huggingface_hub
2
#7 opened about 2 months ago by
marcsun13
[WIP] Upload folder using huggingface_hub (multi-commit 015597a9a84fd3a9cd8c9844ceb2b85ce89bb1a387968fd94159cb19e4200044)
#6 opened about 2 months ago by
marcsun13
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
2 months ago
Upload folder using huggingface_hub
2
#4 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#3 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#2 opened 2 months ago by
marcsun13
Upload folder using huggingface_hub
2
#1 opened 2 months ago by
marcsun13
New activity in
google/flan-t5-xxl
6 months ago
ValueError: Need either a `state_dict` or a `save_folder` containing offloaded weights.
5
#53 opened about 1 year ago by
tuannguyends
New activity in
huggingface/documentation-images
6 months ago
Upload NousResearch-Llama-2-7b-hf_Perplexity.png
1
#292 opened 6 months ago by
marcsun13
Upload NousResearch-Llama-2-7b-hf_Perplexity.png
#291 opened 6 months ago by
marcsun13
New activity in
mlx-community/Llama-2-7b-chat-4-bit
9 months ago
Update README.md
1
#4 opened 9 months ago by
marcsun13
Update README.md
#3 opened 9 months ago by
marcsun13
New activity in
mlx-community/Mistral-7B-Instruct-v0.2-4-bit
9 months ago
Update README.md
#5 opened 9 months ago by
marcsun13
Update config.json
1
#4 opened 9 months ago by
marcsun13
New activity in
mistralai/Mixtral-8x7B-Instruct-v0.1
9 months ago
Intuition for quality decrease after quantization
4
#23 opened 9 months ago by
krumeto
New activity in
mistralai/Mistral-7B-v0.1
10 months ago
Adding `safetensors` variant of this model
2
#91 opened 10 months ago by
lcahill
New activity in
hf-accelerate/model-memory-usage
11 months ago
Llama-2 models don't work since they have auth token required. I have an auth token but it is not doisplaying
7
#16 opened 12 months ago by
sayambhu
Determining Minimum GPU Memory and Input Text Length Calculation in Model Training
2
#19 opened 11 months ago by
kobe8-24
New activity in
mistralai/Mistral-7B-v0.1
11 months ago
Does Mistral support accelerate library?
4
#65 opened 11 months ago by
Sp1der
New activity in
marcsun13/Llama-2-13B-AWQ
11 months ago
Update config.json
#1 opened 11 months ago by
ybelkada
New activity in
marcsun13/opt-125m-awq
11 months ago
Update config.json
#3 opened 11 months ago by
ybelkada
Update config.json
#2 opened 11 months ago by
ybelkada
Update config.json
#1 opened 11 months ago by
ybelkada
New activity in
huggingface/documentation-images
about 1 year ago
Upload A100_use_cache_True.jpg
#181 opened about 1 year ago by
marcsun13
add images to 163_overview-quantization-transformers
1
#180 opened about 1 year ago by
marcsun13
add overview-quantization-transformers blog images
#179 opened about 1 year ago by
marcsun13
add images for overview-quantization-transformers blog
1
#178 opened about 1 year ago by
marcsun13
overview-quantization-transformers blog images
#177 opened about 1 year ago by
marcsun13
add images to overview-quantization-transformers folder
#176 opened about 1 year ago by
marcsun13
New activity in
hf-accelerate/model-memory-usage
about 1 year ago
Add link to the access token
1
#5 opened about 1 year ago by
marcsun13
Model Memory Consumption of Llama-2 models, access granted
8
#4 opened about 1 year ago by
arkoi