llama-3_2-3b-instruct model | Clarifai

llama-3_2-3b-instruct

Llama-3.2-3B-Instruct SLM is a multilingual, instruction-tuned LLM optimized for dialogue and text generation

Input

Prompt:

Press Ctrl + Enter to submit

Max Tokens

The maximum number of tokens to generate. Shorter token lengths will provide faster performance.

Temperature

A decimal number that determines the degree of randomness in the response

Top P

An alternative to sampling with temperature, where the model considers the results of the tokens with top_p probability mass.

Top K

The top-k parameter limits the model's predictions to the top k most probable tokens at each step of generation.

Num Beams

num_beams parameter is integral to a method called beam search, which impacts the quality and diversity of generated text

Do Sample

Return Full Text

OFF

Prompt Template

Template for formatting the prompt. Can be an arbitrary string, but must contain the substring `{prompt}`.

System Prompt

A system prompt sets the behavior and context for an AI assistant in a conversation, such as modifying its personality.

Output

Submit a prompt for a response.

Notes

Introduction

Llama-3.2-3B-Instruct SLM is part of Meta's Llama 3.2 collection of multilingual large language models (LLMs). This instruction-tuned model is optimized for a wide range of multilingual dialogue applications, including retrieval, summarization, and conversational agents. With 3.21 billion parameters, this model is tuned for tasks requiring natural language understanding and generation in multiple languages, outperforming many other models on industry benchmarks. It supports both text input and output in various languages, making it highly versatile for global applications.

Llama-3.2-11B-Vision-Instruct

Although the Llama-3.2 family also includes multimodal models such as the Llama-3.2-11B-Vision-Instruct, which handles text and image inputs, the Llama-3.2-3B-Instruct SLM is a text-only model. It specializes in multilingual natural language processing tasks, allowing developers to fine-tune it for text-based use cases that require high-quality, nuanced language understanding and generation.

Run Llama 3.2 with an API

Running the API with Clarifai's Python SDK

You can run the Llama 3.2 Model API using Clarifai’s Python SDK.

Export your PAT as an environment variable. Then, import and initialize the API Client.

Find your PAT in your security settings.

export CLARIFAI_PAT={your personal access token}

from clarifai.client.model import Model

prompt = "what's the future of AI?"
inference_params = dict(temperature=0.7, max_tokens=200, top_k = 50, top_p= 0.95)

# Model Predict
model_prediction = Model("https://clarifai.com/meta/Llama-3/models/llama-3_2-3b-instruct").predict_by_bytes(prompt.encode(), input_type="text", inference_params=inference_params)

print(model_prediction.outputs[0].data.text.raw)

You can also run Llama 3.2-3b-Instruct API using other Clarifai Client Libraries like Java, cURL, NodeJS, PHP, etc here.

Aliases: Llama 3.2-3b-Instruct, llama 3.2

Use Cases

Llama-3.2-3B-Instruct SLM is suitable for a variety of use cases, especially in multilingual environments:

Multilingual Customer Support: Real-time responses to customer inquiries across multiple languages.
Summarization: Condensing large text bodies in multiple languages into concise, coherent summaries.
Knowledge Retrieval: Extracting relevant information from large text databases for research, customer service, or academic use.
Conversational Agents: Powering intelligent chatbots capable of interacting in multiple languages with coherent and contextually appropriate responses.
Content Rewriting and Translation: Rewriting or translating text content into different languages while maintaining the original meaning and style.

Evaluation and Benchmark Results

English Text Benchmarks (Base Pretrained Models)

Category	Benchmark	# Shots	Metric	Llama 3.2 1B	Llama 3.2 3B	Llama 3.1 8B
General	MMLU	5	macro_avg/acc_char	32.2	58.0	66.7
	AGIEval English	3-5	average/acc_char	23.3	39.2	47.8
	ARC-Challenge	25	acc_char	32.8	69.1	79.7
Reading comprehension	SQuAD	1	em	49.2	67.7	77.0
	QuAC	1	f1	37.9	42.9	44.9
	DROP	3	f1	28.0	45.2	59.5
Long Context	Needle in Haystack	0	em	96.8	1.0	1.0

English Text Benchmarks (Instruction-Tuned Models)

Category	Benchmark	# Shots	Metric	Llama 3.2 1B	Llama 3.2 3B	Llama 3.1 8B
General	MMLU	5	macro_avg/acc	49.3	63.4	69.4
Re-writing	Open-rewrite eval	0	micro_avg/rougeL	41.6	40.1	40.9
Summarization	TLDR9+ (test)	1	rougeL	16.8	19.0	17.2
Instruction following	IFEval	0	avg(prompt/instruction acc loose/strict)	59.5	77.4	80.4
Math	GSM8K (CoT)	8	em_maj1@1	44.4	77.7	84.5
	MATH (CoT)	0	final_em	30.6	47.3	51.9
Reasoning	ARC-C	0	acc	59.4	78.6	83.4
	GPQA	0	acc	27.2	32.8	32.8
	Hellaswag	0	acc	41.2	69.8	78.7
Tool Use	BFCL V2	0	acc	25.7	67.0	70.9
	Nexus	0	macro_avg/acc	13.5	34.3	38.5
Long Context	InfiniteBench/En.QA	0	longbook_qa/f1	20.3	19.8	27.3
	InfiniteBench/En.MC	0	longbook_choice/acc	38.0	63.3	72.2
	NIH/Multi-needle	0	recall	75.0	84.7	98.8

Multilingual Benchmarks (Instruction-Tuned Models)

Category	Benchmark	Language	Llama 3.2 1B	Llama 3.2 3B	Llama 3.1 8B
General	MMLU	Portuguese	39.82	54.48	62.12
	MMLU	Spanish	41.50	55.10	62.50
	MMLU	Italian	39.80	53.80	61.60
	MMLU	German	39.20	53.30	60.60
	MMLU	French	40.50	54.60	62.30
	MMLU	Hindi	33.50	43.30	50.90
	MMLU	Thai	34.70	44.50	50.30

Dataset

Llama-3.2 models were trained on a new mix of publicly available online data. The text-only version of the model uses up to 9 trillion tokens across various languages, enabling it to excel in multilingual understanding. The dataset includes multilingual text and code, making the model proficient in generating both natural language and programming languages.

Advantages

Multilingual Support: Officially supports 8 languages, with the ability to fine-tune for more. This allows for extensive applicability in multilingual dialogue systems and global contexts.
Optimized for Instruction Following: Llama-3.2-3B-Instruct SLM excels at tasks involving clear instruction-following, making it useful for tasks like question-answering, summarization, and content rewriting.
High Performance on Industry Benchmarks: Outperforms several open-source and proprietary models on common benchmarks, demonstrating its high utility in both general and instruction-tuned scenarios.
Scalable: The 3.21B parameter architecture balances performance with scalability, making it ideal for developers and businesses looking for a high-performing yet efficient model.

Limitations

Knowledge Cutoff: As the training data is current up to December 2023, the model may not be aware of events or data after this period, limiting its use in real-time knowledge applications.
Multimodal Limitations: While the Llama-3.2 family includes multimodal models, the 3B-Instruct SLM is a text-only model, limiting its utility in applications requiring image or multimodal input.
Language Coverage: Though officially supporting 8 languages, performance may degrade for languages that fall outside this core set. Fine-tuning may be required for optimal performance in additional languages.

ID
Model Type ID
Text To Text
Input Type
text
Output Type
text
Description
Llama-3.2-3B-Instruct SLM is a multilingual, instruction-tuned LLM optimized for dialogue and text generation
Last Updated
Oct 17, 2024
Privacy
PUBLIC
Use Case
Toolkit
License
Share
Badge