32k or 128k?

by ChuckMcSneed - opened Jul 24

Jul 24

config.json says "max_position_embeddings": 32768, and readme says 128k

ckg

Jul 24

+1
To run the model at 128k, should we extend the max_position_embeddings or is there some sort of RoPE Scaling configuration that we should apply?

JinChe

Jul 25

+1
Currently only support 32k with vllm deployment test.

Jul 25

Dear developers. Please clarify. In your blog you write about 128k, but the model says “max_position_embeddings”: 32768.
What to believe?

Mistral AI_ org Jul 25

ChuckMcSneed changed discussion status to closed Jul 25

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment