gsarti
/

phi3-mini-rebus-solver-Q8_0-GGUF

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gsarti commited on Aug 2

Commit

b23a469

•

1 Parent(s): 531d030

Update README.md

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -38,9 +38,9 @@ model-index:
         name: Solution Exact Match
 ---
-# Phi-3 Mini 4K Verbalized Rebus Solver 🇮🇹
-This model is a parameter-efficient fine-tuned version of Phi-3 Mini 4K trained for verbalized rebus solving in Italian, as part of the [release](https://huggingface.co/collections/gsarti/verbalized-rebus-clic-it-2024-66ab8f11cb04e68bdf4fb028) for our paper [Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses](TBD). The task of verbalized rebus solving consists of converting an encrypted sequence of letters and crossword definitions into a solution phrase matching the word lengths specified in the solution key. An example is provided below.
 The model was trained in 4-bit precision for 5070 steps on the verbalized subset of the [EurekaRebus](https://huggingface.co/datasets/gsarti/eureka-rebus) using QLora via [Unsloth](https://github.com/unslothai/unsloth) and [TRL](https://github.com/huggingface/trl). This repository contains the GGUF exported checkpoint of the model in `Q8_0` format, and the `Modelfile` for usage with [Ollama](https://ollama.com/) (see below).
@@ -67,7 +67,15 @@ For problems or updates on this model, please contact [[email protected]
 If you use this model in your work, please cite our paper as follows:
 ```bibtex
-TBD
 ```
 ## Acknowledgements

         name: Solution Exact Match
 ---
+# Phi-3 Mini 4K Verbalized Rebus Solver - GGUF Q8_0 🇮🇹
+This model is a parameter-efficient fine-tuned version of Phi-3 Mini 4K trained for verbalized rebus solving in Italian, as part of the [release](https://huggingface.co/collections/gsarti/verbalized-rebus-clic-it-2024-66ab8f11cb04e68bdf4fb028) for our paper [Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses](https://arxiv.org/abs/2408.00584). The task of verbalized rebus solving consists of converting an encrypted sequence of letters and crossword definitions into a solution phrase matching the word lengths specified in the solution key. An example is provided below.
 The model was trained in 4-bit precision for 5070 steps on the verbalized subset of the [EurekaRebus](https://huggingface.co/datasets/gsarti/eureka-rebus) using QLora via [Unsloth](https://github.com/unslothai/unsloth) and [TRL](https://github.com/huggingface/trl). This repository contains the GGUF exported checkpoint of the model in `Q8_0` format, and the `Modelfile` for usage with [Ollama](https://ollama.com/) (see below).
 If you use this model in your work, please cite our paper as follows:
 ```bibtex
+@article{sarti-etal-2024-rebus,
+    title = "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses",
+    author = "Sarti, Gabriele and Caselli, Tommaso and Nissim, Malvina and Bisazza, Arianna",
+    journal = "ArXiv",
+    month = jul,
+    year = "2024",
+    volume = {abs/2408.00584},
+    url = {https://arxiv.org/abs/2408.00584},
+}
 ```
 ## Acknowledgements