rmihaylov
/

bert-base-theseus-bg

Model card Files Files and versions Community

rmihaylov commited on Apr 17, 2022

Commit

a942e66

•

1 Parent(s): bc6adbe

Update README.md

Files changed (1) hide show

README.md +35 -4

README.md CHANGED Viewed

@@ -20,9 +20,40 @@ between bulgarian and Bulgarian. The training data is Bulgarian text from [OSCAR
 The model was compressed via [progressive module replacing](https://arxiv.org/abs/2002.02925).
-## Intended uses & limitations
-You can use the raw model for:
-- fill-mask task
-Or fine-tune it to a downstream task.

 The model was compressed via [progressive module replacing](https://arxiv.org/abs/2002.02925).
+### How to use
+Here is how to use this model in PyTorch:
+```python
+>>> from transformers import pipeline
+>>>
+>>> model = pipeline(
+>>>     'fill-mask',
+>>>     model='rmihaylov/bert-base-theseus-bg',
+>>>     tokenizer='rmihaylov/bert-base-theseus-bg',
+>>>     device=0,
+>>>     revision=None)
+>>> output = model("София е [MASK] на България.")
+>>> print(output)
+[{'score': 0.1586454212665558,
+  'sequence': 'София е столица на България.',
+  'token': 76074,
+  'token_str': 'столица'},
+ {'score': 0.12992817163467407,
+  'sequence': 'София е  столица на България.',
+  'token': 2659,
+  'token_str': 'столица'},
+ {'score': 0.06064048036932945,
+  'sequence': 'София е Перлата на България.',
+  'token': 102146,
+  'token_str': 'Перлата'},
+ {'score': 0.034687548875808716,
+  'sequence': 'София е представителката на България.',
+  'token': 105456,
+  'token_str': 'представителката'},
+ {'score': 0.03053216263651848,
+  'sequence': 'София е присъединяването на България.',
+  'token': 18749,
+  'token_str': 'присъединяването'}]
+```