teleprint-me/llama-3.2-3b-instruct

LLaMA 3.2 1B Instruct

LLaMA 3.2 3B Instruct is a multilingual instruction-tuned language model with 3.21 billion parameters. Designed for diverse multilingual dialogue and summarization tasks, it offers effective performance on a range of NLP benchmarks.

Model Information

Name: LLaMA 3.2 3B Instruct
Parameter Size: 3B (3.21B)
Model Family: LLaMA 3.2
Architecture: Auto-regressive Transformer with Grouped-Query Attention (GQA)
Purpose: Multilingual dialogue generation, text generation, and summarization.
Training Data: A mix of publicly available multilingual data, covering up to 9T tokens.
Supported Languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
Release Date: September 25, 2024
Context Length: 128k tokens
Knowledge Cutoff: December 2023

Quantized Model Files

Available Formats:
- ggml-model-q8_0.gguf: 8-bit quantization for resource efficiency and good performance.
- ggml-model-f16.gguf: Half-precision (16-bit) floating-point format for enhanced precision.
Quantization Library: llama.cpp
Use Cases: Multilingual dialogue, summarization, and text generation.

Core Library

LLaMA 3.2 1B Instruct can be deployed using llama.cpp or transformers, with a focus on streamlined integration into the Hugging Face ecosystem.

Primary Framework: llama.cpp
Alternate Frameworks:
- transformers for Hugging Face model support.
- vLLM for optimized inference and low-latency deployments.

Library and Model Links:

Model Base: meta-llama/Llama-3.2-3B-Instruct
Models: meta-llama/llama-stack
Inference Support: meta-llama/llama
Quantization: ggerganov/llama.cpp

Safety and Responsible Use

LLaMA 3.2 3B has been designed with safety in mind but may produce biased, harmful, or unpredictable outputs, especially for less-covered languages or specific prompts.

Testing and Risk Assessment: Initial testing has primarily focused on English; coverage for other languages is ongoing.
Limitations: LLaMA 3.2 may not fully adhere to user instructions or safety guidelines, and may exhibit unexpected behaviors.
Responsible Use Guidelines: Refer to the Responsible Use Guide for more details.

teleprint-me
/

llama-3.2-3b-instruct

LLaMA 3.2 1B Instruct

Model Information

Quantized Model Files

Core Library

Safety and Responsible Use

Model tree for teleprint-me/llama-3.2-3b-instruct