Text Generation
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints
rcaulk commited on
Commit
37c9e5d
1 Parent(s): f34c0e4

Update config.json

Browse files

Llama2 allows 4096 context length, should be increased here.

Files changed (1) hide show
  1. config.json +1 -1
config.json CHANGED
@@ -9,7 +9,7 @@
9
  "hidden_size": 5120,
10
  "initializer_range": 0.02,
11
  "intermediate_size": 13824,
12
- "max_position_embeddings": 2048,
13
  "model_type": "llama",
14
  "num_attention_heads": 40,
15
  "num_hidden_layers": 40,
 
9
  "hidden_size": 5120,
10
  "initializer_range": 0.02,
11
  "intermediate_size": 13824,
12
+ "max_position_embeddings": 4096,
13
  "model_type": "llama",
14
  "num_attention_heads": 40,
15
  "num_hidden_layers": 40,