Habana
/

roberta-large

Model card Files Files and versions Community

Update README.md

#3

by astachowicz - opened Jun 25

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ The only difference is that there are a few new training arguments specific to H
 [Here](https://github.com/huggingface/optimum-habana/blob/main/examples/question-answering/run_qa.py) is a question-answering example script to fine-tune a model on SQuAD. You can run it with RoBERTa Large with the following command:
 ```bash
-python run_qa.py \
   --model_name_or_path roberta-large \
   --gaudi_config_name Habana/roberta-large \
   --dataset_name squad \
@@ -37,7 +37,9 @@ python run_qa.py \
   --max_seq_length 384 \
   --output_dir /tmp/squad/ \
   --use_habana \
-  --use_lazy_mode \
   --throughput_warmup_steps 3 \
   --bf16
 ```

 [Here](https://github.com/huggingface/optimum-habana/blob/main/examples/question-answering/run_qa.py) is a question-answering example script to fine-tune a model on SQuAD. You can run it with RoBERTa Large with the following command:
 ```bash
+PT_HPU_LAZY_MODE=0 python run_qa.py \
   --model_name_or_path roberta-large \
   --gaudi_config_name Habana/roberta-large \
   --dataset_name squad \
   --max_seq_length 384 \
   --output_dir /tmp/squad/ \
   --use_habana \
+  --torch_compile_backend hpu_backend \
+  --torch_compile \
+  --use_lazy_mode false \
   --throughput_warmup_steps 3 \
   --bf16
 ```