infCapital
/

llama2-7b-chatvi

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hungeni commited on Sep 29, 2023

Commit

d25a484

•

1 Parent(s): 1b85f85

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -5,6 +5,10 @@ language:
 - vi
 ---
-Base Model: LLaMa2 7B Chat HF
-+ Continual Pre-Train with 2B tokens Vietnamese
-+ Trainning profile: LoRa

 - vi
 ---
+## Base Model: LLaMa2 7B Chat HF
++ Extend vocab to 44,800 for better Vietnamese understanding
++ Continual Pre-Train with >2B tokens Vietnamese
++ Trainning profile: LoRa (rank=32, alpha=128, 16fp), 1 epoch, block size = 512. Takes 300GPU Hours x RXT4090 24GB
+## Can be better use for
++ Futher training / Fine-tuning for Vietnamese tasks