37 35 64

Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Articles

Organizations

marcsun13's activity

New activity in meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 about 2 months ago

Upload folder using huggingface_hub

#4 opened about 2 months ago by

marcsun13

New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 2 months ago

Upload folder using huggingface_hub

#8 opened about 2 months ago by

marcsun13

New activity in meta-llama/Meta-Llama-3.1-405B-Instruct about 2 months ago

Update original/mp8/README.md

#2 opened about 2 months ago by

marcsun13

Update original/mp16/README.md

#1 opened about 2 months ago by

marcsun13

New activity in meta-llama/Meta-Llama-3.1-405B about 2 months ago

Update original/mp16/README.md

#5 opened about 2 months ago by

marcsun13

Update original/mp8/README.md

#4 opened about 2 months ago by

marcsun13

New activity in meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 about 2 months ago

Upload folder using huggingface_hub

#2 opened about 2 months ago by

marcsun13

New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 2 months ago

Upload folder using huggingface_hub

#7 opened about 2 months ago by

marcsun13

[WIP] Upload folder using huggingface_hub (multi-commit 015597a9a84fd3a9cd8c9844ceb2b85ce89bb1a387968fd94159cb19e4200044)

#6 opened about 2 months ago by

marcsun13

New activity in meta-llama/Meta-Llama-3.1-405B-FP8 2 months ago

Upload folder using huggingface_hub

#4 opened 2 months ago by

marcsun13

Upload folder using huggingface_hub

#3 opened 2 months ago by

marcsun13

Upload folder using huggingface_hub

#2 opened 2 months ago by

marcsun13

Upload folder using huggingface_hub

#1 opened 2 months ago by

marcsun13

New activity in google/flan-t5-xxl 6 months ago

ValueError: Need either a `state_dict` or a `save_folder` containing offloaded weights.

#53 opened about 1 year ago by

tuannguyends

New activity in huggingface/documentation-images 6 months ago

Upload NousResearch-Llama-2-7b-hf_Perplexity.png

#292 opened 6 months ago by

marcsun13

Upload NousResearch-Llama-2-7b-hf_Perplexity.png

#291 opened 6 months ago by

marcsun13

New activity in mlx-community/Llama-2-7b-chat-4-bit 9 months ago

Update README.md

#4 opened 9 months ago by

marcsun13

Update README.md

#3 opened 9 months ago by

marcsun13

New activity in mlx-community/Mistral-7B-Instruct-v0.2-4-bit 9 months ago

Update README.md

#5 opened 9 months ago by

marcsun13

Update config.json

#4 opened 9 months ago by

marcsun13

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 9 months ago

Intuition for quality decrease after quantization

#23 opened 9 months ago by

krumeto

New activity in mistralai/Mistral-7B-v0.1 10 months ago

Adding `safetensors` variant of this model

#91 opened 10 months ago by

lcahill

New activity in hf-accelerate/model-memory-usage 11 months ago

Llama-2 models don't work since they have auth token required. I have an auth token but it is not doisplaying

#16 opened 12 months ago by

sayambhu

Determining Minimum GPU Memory and Input Text Length Calculation in Model Training

#19 opened 11 months ago by

kobe8-24

New activity in mistralai/Mistral-7B-v0.1 11 months ago

Does Mistral support accelerate library?

#65 opened 11 months ago by

Sp1der

New activity in marcsun13/Llama-2-13B-AWQ 11 months ago

Update config.json

#1 opened 11 months ago by

ybelkada

New activity in marcsun13/opt-125m-awq 11 months ago

Update config.json

#3 opened 11 months ago by

ybelkada

Update config.json

#2 opened 11 months ago by

ybelkada

Update config.json

#1 opened 11 months ago by

ybelkada

New activity in huggingface/documentation-images about 1 year ago

Upload A100_use_cache_True.jpg

#181 opened about 1 year ago by

marcsun13

add images to 163_overview-quantization-transformers

#180 opened about 1 year ago by

marcsun13

add overview-quantization-transformers blog images

#179 opened about 1 year ago by

marcsun13

add images for overview-quantization-transformers blog

#178 opened about 1 year ago by

marcsun13

overview-quantization-transformers blog images

#177 opened about 1 year ago by

marcsun13

add images to overview-quantization-transformers folder

#176 opened about 1 year ago by

marcsun13

New activity in hf-accelerate/model-memory-usage about 1 year ago

Add link to the access token

#5 opened about 1 year ago by

marcsun13

Model Memory Consumption of Llama-2 models, access granted

#4 opened about 1 year ago by

arkoi

Marc Sun

AI & ML interests

Articles

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Accelerate 1.0.0

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

quanto: a pytorch quantization toolkit

Overview of natively supported quantization schemes in 🤗 Transformers

Making LLMs lighter with AutoGPTQ and transformers

Organizations

marcsun13's activity

Upload folder using huggingface_hub

Upload folder using huggingface_hub

Update original/mp8/README.md

Update original/mp16/README.md

Update original/mp16/README.md

Update original/mp8/README.md

Upload folder using huggingface_hub

Upload folder using huggingface_hub

[WIP] Upload folder using huggingface_hub (multi-commit 015597a9a84fd3a9cd8c9844ceb2b85ce89bb1a387968fd94159cb19e4200044)

Upload folder using huggingface_hub

Upload folder using huggingface_hub

Upload folder using huggingface_hub

Upload folder using huggingface_hub

ValueError: Need either a `state_dict` or a `save_folder` containing offloaded weights.

Upload NousResearch-Llama-2-7b-hf_Perplexity.png

Upload NousResearch-Llama-2-7b-hf_Perplexity.png

Update README.md

Update README.md

Update README.md

Update config.json

Intuition for quality decrease after quantization

Adding `safetensors` variant of this model

Llama-2 models don't work since they have auth token required. I have an auth token but it is not doisplaying

Determining Minimum GPU Memory and Input Text Length Calculation in Model Training

Does Mistral support accelerate library?

Update config.json

Update config.json

Update config.json

Update config.json

Upload A100_use_cache_True.jpg

add images to 163_overview-quantization-transformers

add overview-quantization-transformers blog images

add images for overview-quantization-transformers blog

overview-quantization-transformers blog images

add images to overview-quantization-transformers folder

Add link to the access token

Model Memory Consumption of Llama-2 models, access granted