Improving Hugging Face Training Efficiency Through Packing with Flash Attention about 1 month ago • 19
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4 Text Generation • Updated 7 days ago • 38.9k • 33
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 Text Generation • Updated about 1 month ago • 60.3k • 164