NBoukachab
commited on
Commit
•
315757a
1
Parent(s):
6ec7ae5
Change files
Browse files- .gitattributes +1 -5
- README.md +22 -19
- meta.json +50 -0
- ner/cfg +18 -0
- model → ner/model +0 -0
- ner/moves +1 -0
- tokenizer +3 -3
- vectors +0 -3
- vocab/key2row +1 -0
- cfg → vocab/lookups.bin +2 -2
- moves → vocab/lookups_extra.bin +2 -2
- vocab/strings.json +0 -0
- vocab/vectors +0 -0
.gitattributes
CHANGED
@@ -32,8 +32,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
32 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
33 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
34 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
35 |
-
model filter=lfs diff=lfs merge=lfs -text
|
36 |
-
cfg filter=lfs diff=lfs merge=lfs -text
|
37 |
-
moves filter=lfs diff=lfs merge=lfs -text
|
38 |
-
tokenizer filter=lfs diff=lfs merge=lfs -text
|
39 |
-
vectors filter=lfs diff=lfs merge=lfs -text
|
|
|
32 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
33 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
34 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
35 |
+
model filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
README.md
CHANGED
@@ -5,28 +5,31 @@ tags:
|
|
5 |
- Spacy
|
6 |
language:
|
7 |
- 'lat'
|
|
|
|
|
8 |
---
|
9 |
|
10 |
## Model description
|
11 |
-
|
12 |
-
The model has been trained using the Spacy library on the [
|
13 |
-
|
14 |
# Cite us!
|
15 |
-
|
16 |
```bibtex
|
17 |
@inproceedings{10.1007/978-3-031-06555-2_29,
|
18 |
-
author = {Monroc, Claire Bizon and Miret, Blanche and Bonhomme, Marie-Laurence and Kermorvant, Christopher},
|
19 |
-
title = {A Comprehensive Study Of Open-Source Libraries For Named Entity Recognition On Handwritten Historical Documents},
|
20 |
-
year = {2022},
|
21 |
-
isbn = {978-3-031-06554-5},
|
22 |
-
publisher = {Springer-Verlag},
|
23 |
-
address = {Berlin, Heidelberg},
|
24 |
-
url = {https://doi.org/10.1007/978-3-031-06555-2_29},
|
25 |
-
doi = {10.1007/978-3-031-06555-2_29},
|
26 |
-
abstract = {In this paper, we propose an evaluation of several state-of-the-art open-source natural language processing (NLP) libraries for named entity recognition (NER) on handwritten historical documents: spaCy, Stanza and Flair. The comparison is carried out on three low-resource multilingual datasets of handwritten historical documents: HOME (a multilingual corpus of medieval charters), Balsac (a corpus of parish records from Quebec), and Esposalles (a corpus of marriage records in Catalan). We study the impact of the document recognition processes (text line detection and handwriting recognition) on the performance of the NER. We show that current off-the-shelf NER libraries yield state-of-the-art results, even on low-resource languages or multilingual documents using multilingual models. We show, in an end-to-end evaluation, that text line detection errors have a greater impact than handwriting recognition errors. Finally, we also report state-of-the-art results on the public Esposalles dataset.},
|
27 |
-
booktitle = {Document Analysis Systems: 15th IAPR International Workshop, DAS 2022, La Rochelle, France, May 22–25, 2022, Proceedings},
|
28 |
-
pages = {429–444},
|
29 |
-
numpages = {16},
|
30 |
-
keywords = {Text line detection, Named entity recognition, Handwritten historical documents},
|
31 |
-
location = {La Rochelle, France}
|
32 |
-
}
|
|
|
|
5 |
- Spacy
|
6 |
language:
|
7 |
- 'lat'
|
8 |
+
version:
|
9 |
+
- 'Spacy v2'
|
10 |
---
|
11 |
|
12 |
## Model description
|
13 |
+
|
14 |
+
The model has been trained using the Spacy v2 library on the [HOME-Alcar](https://zenodo.org/record/5600884) document annotations. The model is compatible with version 2.3.5 of Spacy and incompatible with versions 3.x.x
|
15 |
+
|
16 |
# Cite us!
|
17 |
+
|
18 |
```bibtex
|
19 |
@inproceedings{10.1007/978-3-031-06555-2_29,
|
20 |
+
author = {Monroc, Claire Bizon and Miret, Blanche and Bonhomme, Marie-Laurence and Kermorvant, Christopher},
|
21 |
+
title = {A Comprehensive Study Of Open-Source Libraries For Named Entity Recognition On Handwritten Historical Documents},
|
22 |
+
year = {2022},
|
23 |
+
isbn = {978-3-031-06554-5},
|
24 |
+
publisher = {Springer-Verlag},
|
25 |
+
address = {Berlin, Heidelberg},
|
26 |
+
url = {https://doi.org/10.1007/978-3-031-06555-2_29},
|
27 |
+
doi = {10.1007/978-3-031-06555-2_29},
|
28 |
+
abstract = {In this paper, we propose an evaluation of several state-of-the-art open-source natural language processing (NLP) libraries for named entity recognition (NER) on handwritten historical documents: spaCy, Stanza and Flair. The comparison is carried out on three low-resource multilingual datasets of handwritten historical documents: HOME (a multilingual corpus of medieval charters), Balsac (a corpus of parish records from Quebec), and Esposalles (a corpus of marriage records in Catalan). We study the impact of the document recognition processes (text line detection and handwriting recognition) on the performance of the NER. We show that current off-the-shelf NER libraries yield state-of-the-art results, even on low-resource languages or multilingual documents using multilingual models. We show, in an end-to-end evaluation, that text line detection errors have a greater impact than handwriting recognition errors. Finally, we also report state-of-the-art results on the public Esposalles dataset.},
|
29 |
+
booktitle = {Document Analysis Systems: 15th IAPR International Workshop, DAS 2022, La Rochelle, France, May 22–25, 2022, Proceedings},
|
30 |
+
pages = {429–444},
|
31 |
+
numpages = {16},
|
32 |
+
keywords = {Text line detection, Named entity recognition, Handwritten historical documents},
|
33 |
+
location = {La Rochelle, France}
|
34 |
+
}
|
35 |
+
```
|
meta.json
ADDED
@@ -0,0 +1,50 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"lang":"xx",
|
3 |
+
"pipeline":[
|
4 |
+
"ner"
|
5 |
+
],
|
6 |
+
"spacy_version":">=2.3.2",
|
7 |
+
"speed":{
|
8 |
+
"nwords":22118,
|
9 |
+
"cpu":14078.2482701708,
|
10 |
+
"gpu":36967.2712529745
|
11 |
+
},
|
12 |
+
"accuracy":{
|
13 |
+
"ents_f":81.7114729269,
|
14 |
+
"ents_p":82.5554705432,
|
15 |
+
"ents_r":80.8845577211,
|
16 |
+
"ents_per_type":{
|
17 |
+
"LOC":{
|
18 |
+
"p":93.4687953556,
|
19 |
+
"r":87.027027027,
|
20 |
+
"f":90.132960112
|
21 |
+
},
|
22 |
+
"PER":{
|
23 |
+
"p":71.1538461538,
|
24 |
+
"r":74.0,
|
25 |
+
"f":72.5490196078
|
26 |
+
},
|
27 |
+
"DAT":{
|
28 |
+
"p":60.8695652174,
|
29 |
+
"r":63.6363636364,
|
30 |
+
"f":62.2222222222
|
31 |
+
}
|
32 |
+
},
|
33 |
+
"token_acc":100.0
|
34 |
+
},
|
35 |
+
"vectors":{
|
36 |
+
"width":0,
|
37 |
+
"vectors":0,
|
38 |
+
"keys":0,
|
39 |
+
"name":"spacy_pretrained_vectors"
|
40 |
+
},
|
41 |
+
"name":"model0",
|
42 |
+
"version":"0.0.0",
|
43 |
+
"labels":{
|
44 |
+
"ner":[
|
45 |
+
"DAT",
|
46 |
+
"LOC",
|
47 |
+
"PER"
|
48 |
+
]
|
49 |
+
}
|
50 |
+
}
|
ner/cfg
ADDED
@@ -0,0 +1,18 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"beam_width":1,
|
3 |
+
"beam_density":0.0,
|
4 |
+
"beam_update_prob":1.0,
|
5 |
+
"cnn_maxout_pieces":3,
|
6 |
+
"nr_feature_tokens":6,
|
7 |
+
"nr_class":14,
|
8 |
+
"hidden_depth":1,
|
9 |
+
"token_vector_width":96,
|
10 |
+
"hidden_width":64,
|
11 |
+
"maxout_pieces":2,
|
12 |
+
"pretrained_vectors":null,
|
13 |
+
"bilstm_depth":0,
|
14 |
+
"self_attn_depth":0,
|
15 |
+
"conv_depth":4,
|
16 |
+
"conv_window":1,
|
17 |
+
"embed_size":2000
|
18 |
+
}
|
model → ner/model
RENAMED
File without changes
|
ner/moves
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
��movesٸ{"0":{},"1":{"PER":11554,"DAT":5976,"LOC":5662},"2":{"PER":11554,"DAT":5976,"LOC":5662},"3":{"PER":11554,"DAT":5976,"LOC":5662},"4":{"PER":11554,"DAT":5976,"LOC":5662,"":1},"5":{"":1}}
|
tokenizer
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
|
|
1 |
+
��prefix_search��^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^[¦©®°҂֍֎؎؏۞۩۽۾߶৺୰௳-௸௺౿൏൹༁-༃༓༕-༗༚-༟༴༶༸྾-࿅࿇-࿌࿎࿏࿕-࿘႞႟᎐-᎙᥀᧞-᧿᭡-᭪᭴-᭼℀℁℃-℆℈℉℔№℗℞-℣℥℧℩℮℺℻⅊⅌⅍⅏↊↋↕-↙↜-↟↡↢↤↥↧-↭↯-⇍⇐⇑⇓⇕-⇳⌀-⌇⌌-⌟⌢-⌨⌫-⍻⍽-⎚⎴-⏛⏢-␦⑀-⑊⒜-ⓩ─-▶▸-◀◂-◷☀-♮♰-❧➔-➿⠀-⣿⬀-⬯⭅⭆⭍-⭳⭶-⮕⮘-⯈⯊-⯾⳥-⳪⺀-⺙⺛-⻳⼀-⿕⿰-⿻〄〒〓〠〶〷〾〿㆐㆑㆖-㆟㇀-㇣㈀-㈞㈪-㉇㉐㉠-㉿㊊-㊰㋀-㋾㌀-㏿䷀-䷿꒐-꓆꠨-꠫꠶꠷꠹꩷-꩹﷽¦│■○�𐄷-𐄿𐅹-𐆉𐆌-𐆎𐆐-𐆛𐆠𐇐-𐇼𐡷𐡸𐫈𑜿𖬼-𖬿𖭅𛲜𝀀-𝃵𝄀-𝄦𝄩-𝅘𝅥𝅲𝅪-𝅬𝆃𝆄𝆌-𝆩𝆮-𝇨𝈀-𝉁𝉅𝌀-𝍖𝠀-𝧿𝨷-𝨺𝩭-𝩴𝩶-𝪃𝪅𝪆𞲬🀀-🀫🀰-🂓🂠-🂮🂱-🂿🃁-🃏🃑-🃵🄐-🅫🅰-🆬🇦-🈂🈐-🈻🉀-🉈🉐🉑🉠-🉥🌀-🏺🐀-🛔🛠-🛬🛰-🛹🜀-🝳🞀-🟘🠀-🠋🠐-🡇🡐-🡙🡠-🢇🢐-🢭🤀-🤋🤐-🤾🥀-🥰🥳-🥶🥺🥼-🦢🦰-🦹🧀-🧂🧐-🧿🩠-🩭]�suffix_search��…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[¦©®°҂֍֎؎؏۞۩۽۾߶৺୰௳-௸௺౿൏൹༁-༃༓༕-༗༚-༟༴༶༸྾-࿅࿇-࿌࿎࿏࿕-࿘႞႟᎐-᎙᥀᧞-᧿᭡-᭪᭴-᭼℀℁℃-℆℈℉℔№℗℞-℣℥℧℩℮℺℻⅊⅌⅍⅏↊↋↕-↙↜-↟↡↢↤↥↧-↭↯-⇍⇐⇑⇓⇕-⇳⌀-⌇⌌-⌟⌢-⌨⌫-⍻⍽-⎚⎴-⏛⏢-␦⑀-⑊⒜-ⓩ─-▶▸-◀◂-◷☀-♮♰-❧➔-➿⠀-⣿⬀-⬯⭅⭆⭍-⭳⭶-⮕⮘-⯈⯊-⯾⳥-⳪⺀-⺙⺛-⻳⼀-⿕⿰-⿻〄〒〓〠〶〷〾〿㆐㆑㆖-㆟㇀-㇣㈀-㈞㈪-㉇㉐㉠-㉿㊊-㊰㋀-㋾㌀-㏿䷀-䷿꒐-꓆꠨-꠫꠶꠷꠹꩷-꩹﷽¦│■○�𐄷-𐄿𐅹-𐆉𐆌-𐆎𐆐-𐆛𐆠𐇐-𐇼𐡷𐡸𐫈𑜿𖬼-𖬿𖭅𛲜𝀀-𝃵𝄀-𝄦𝄩-𝅘𝅥𝅲𝅪-𝅬𝆃𝆄𝆌-𝆩𝆮-𝇨𝈀-𝉁𝉅𝌀-𝍖𝠀-𝧿𝨷-𝨺𝩭-𝩴𝩶-𝪃𝪅𝪆𞲬🀀-🀫🀰-🂓🂠-🂮🂱-🂿🃁-🃏🃑-🃵🄐-🅫🅰-🆬🇦-🈂🈐-🈻🉀-🉈🉐🉑🉠-🉥🌀-🏺🐀-🛔🛠-🛬🛰-🛹🜀-🝳🞀-🟘🠀-🠋🠐-🡇🡐-🡙🡠-🢇🢐-🢭🤀-🤋🤐-🤾🥀-🥰🥳-🥶🥺🥼-🦢🦰-🦹🧀-🧂🧐-🧿🩠-🩭]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-za-zß-öø-ÿāăąćĉċčďđēĕėęěĝğġģĥħĩīĭįıijĵķĸĺļľŀłńņňʼnŋōŏőœŕŗřśŝşšţťŧũūŭůűųŵŷźżžſƀƃƅƈƌƍƒƕƙ-ƛƞơƣƥƨƪƫƭưƴƶƹƺƽ-ƿdžljnjǎǐǒǔǖǘǚǜǝǟǡǣǥǧǩǫǭǯǰdzǵǹǻǽǿȁȃȅȇȉȋȍȏȑȓȕȗșțȝȟȡȣȥȧȩȫȭȯȱȳ-ȹȼȿɀɂɇɉɋɍɏⱡⱥⱦⱨⱪⱬⱱⱳⱴⱶ-ⱻꜣꜥꜧꜩꜫꜭꜯ-ꜱꜳꜵꜷꜹꜻꜽꜿꝁꝃꝅꝇꝉꝋꝍꝏꝑꝓꝕꝗꝙꝛꝝꝟꝡꝣꝥꝧꝩꝫꝭꝯꝱ-ꝸꝺꝼꝿꞁꞃꞅꞇꞌꞎꞑꞓ-ꞕꞗꞙꞛꞝꞟꞡꞣꞥꞧꞩꞯꞵꞷꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḁḃḅḇḉḋḍḏḑḓḕḗḙḛḝḟḡḣḥḧḩḫḭḯḱḳḵḷḹḻḽḿṁṃṅṇṉṋṍṏṑṓṕṗṙṛṝṟṡṣṥṧṩṫṭṯṱṳṵṷṹṻṽṿẁẃẅẇẉẋẍẏẑẓẕ-ẝẟạảấầẩẫậắằẳẵặẹẻẽếềểễệỉịọỏốồổỗộớờởỡợụủứừửữựỳỵỷỹỻỽỿёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿���-︰-﹏🈀-丽-%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-ZA-ZÀ-ÖØ-ÞĀĂĄĆĈĊČĎĐĒĔĖĘĚĜĞĠĢĤĦĨĪĬĮİIJĴĶĹĻĽĿŁŃŅŇŊŌŎŐŒŔŖŘŚŜŞŠŢŤŦŨŪŬŮŰŲŴŶŸŹŻŽƁƂƄƆƇƉ-ƋƎ-ƑƓƔƖ-ƘƜƝƟƠƢƤƦƧƩƬƮƯƱ-ƳƵƷƸƼDŽLJNJǍǏǑǓǕǗǙǛǞǠǢǤǦǨǪǬǮDZǴǶ-ǸǺǼǾȀȂȄȆȈȊȌȎȐȒȔȖȘȚȜȞȠȢȤȦȨȪȬȮȰȲȺȻȽȾɁɃ-ɆɈɊɌɎⱠⱢ-ⱤⱧⱩⱫⱭ-ⱰⱲⱵⱾⱿꜢꜤꜦꜨꜪꜬꜮꜲꜴꜶꜸꜺꜼꜾꝀꝂꝄꝆꝈꝊꝌꝎꝐꝒꝔꝖꝘꝚꝜꝞꝠꝢꝤꝦꝨꝪꝬꝮꝹꝻꝽꝾꞀꞂꞄꞆꞋꞍꞐꞒꞖꞘꞚꞜꞞꞠꞢꞤꞦꞨꞪ-ꞮꞰ-ꞴꞶꞸḀḂḄḆḈḊḌḎḐḒḔḖḘḚḜḞḠḢḤḦḨḪḬḮḰḲḴḶḸḺḼḾṀṂṄṆṈṊṌṎṐṒṔṖṘṚṜṞṠṢṤṦṨṪṬṮṰṲṴṶṸṺṼṾẀẂẄẆẈẊẌẎẐẒẔẞẠẢẤẦẨẪẬẮẰẲẴẶẸẺẼẾỀỂỄỆỈỊỌỎỐỒỔỖỘỚỜỞỠỢỤỦỨỪỬỮỰỲỴỶỸỺỼỾЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-][A-ZA-ZÀ-ÖØ-ÞĀĂĄĆĈĊČĎĐĒĔĖĘĚĜĞĠĢĤĦĨĪĬĮİIJĴĶĹĻĽĿŁŃŅŇŊŌŎŐŒŔŖŘŚŜŞŠŢŤŦŨŪŬŮŰŲŴŶŸŹŻŽƁƂƄƆƇƉ-ƋƎ-ƑƓƔƖ-ƘƜƝƟƠƢƤƦƧƩƬƮƯƱ-ƳƵƷƸƼDŽLJNJǍǏǑǓǕǗǙǛǞǠǢǤǦǨǪǬǮDZǴǶ-ǸǺǼǾȀȂȄȆȈȊȌȎȐȒȔȖȘȚȜȞȠȢȤȦȨȪȬȮȰȲȺȻȽȾɁɃ-ɆɈɊɌɎⱠⱢ-ⱤⱧⱩⱫⱭ-ⱰⱲⱵⱾⱿꜢꜤꜦꜨꜪꜬꜮꜲꜴꜶꜸꜺꜼꜾꝀꝂꝄꝆꝈꝊꝌꝎꝐꝒꝔꝖꝘꝚꝜꝞꝠꝢꝤꝦꝨꝪꝬꝮꝹꝻꝽꝾꞀꞂꞄꞆꞋꞍꞐꞒꞖꞘꞚꞜꞞꞠꞢꞤꞦꞨꞪ-ꞮꞰ-ꞴꞶꞸḀḂḄḆḈḊḌḎḐḒḔḖḘḚḜḞḠḢḤḦḨḪḬḮḰḲḴḶḸḺḼḾṀṂṄṆṈṊṌṎṐṒṔṖṘṚṜṞṠṢṤṦṨṪṬṮṰṲṴṶṸṺṼṾẀẂẄẆẈẊẌẎẐẒẔẞẠẢẤẦẨẪẬẮẰẲẴẶẸẺẼẾỀỂỄỆỈỊỌỎỐỒỔỖỘỚỜỞỠỢỤỦỨỪỬỮỰỲỴỶỸỺỼỾЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-])\.$�infix_finditer�s\.\.+|…|[¦©®°҂֍֎؎؏۞۩۽۾߶৺୰௳-௸௺౿൏൹༁-༃༓༕-༗༚-༟༴༶༸྾-࿅࿇-࿌࿎࿏࿕-࿘႞႟᎐-᎙᥀᧞-᧿᭡-᭪᭴-᭼℀℁℃-℆℈℉℔№℗℞-℣℥℧℩℮℺℻⅊⅌⅍⅏↊↋↕-↙↜-↟↡↢↤↥↧-↭↯-⇍⇐⇑⇓⇕-⇳⌀-⌇⌌-⌟⌢-⌨⌫-⍻⍽-⎚⎴-⏛⏢-␦⑀-⑊⒜-ⓩ─-▶▸-◀◂-◷☀-♮♰-❧➔-➿⠀-⣿⬀-⬯⭅⭆⭍-⭳⭶-⮕⮘-⯈⯊-⯾⳥-⳪⺀-⺙⺛-⻳⼀-⿕⿰-⿻〄〒〓〠〶〷〾〿㆐㆑㆖-㆟㇀-㇣㈀-㈞㈪-㉇㉐㉠-㉿㊊-㊰㋀-㋾㌀-㏿䷀-䷿꒐-꓆꠨-꠫꠶꠷꠹꩷-꩹﷽¦│■○�𐄷-𐄿𐅹-𐆉𐆌-𐆎𐆐-𐆛𐆠𐇐-𐇼𐡷𐡸𐫈𑜿𖬼-𖬿𖭅𛲜𝀀-𝃵𝄀-𝄦𝄩-𝅘𝅥𝅲𝅪-𝅬𝆃𝆄𝆌-𝆩𝆮-𝇨𝈀-𝉁𝉅𝌀-𝍖𝠀-𝧿𝨷-𝨺𝩭-𝩴𝩶-𝪃𝪅𝪆𞲬🀀-🀫🀰-🂓🂠-🂮🂱-🂿🃁-🃏🃑-🃵🄐-🅫🅰-🆬🇦-🈂🈐-🈻🉀-🉈🉐🉑🉠-🉥🌀-🏺🐀-🛔🛠-🛬🛰-🛹🜀-🝳🞀-🟘🠀-🠋🠐-🡇🡐-🡙🡠-🢇🢐-🢭🤀-🤋🤐-🤾🥀-🥰🥳-🥶🥺🥼-🦢🦰-🦹🧀-🧂🧐-🧿🩠-🩭]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-za-zß-öø-ÿāăąćĉċčďđēĕėęěĝğġģĥħĩīĭįıijĵķĸĺļľŀłńņňʼnŋōŏőœŕŗřśŝşšţťŧũūŭůűųŵŷźżžſƀƃƅƈƌƍƒƕƙ-ƛƞơƣƥƨƪƫƭưƴƶƹƺƽ-ƿdžljnjǎǐǒǔǖǘǚǜǝǟǡǣǥǧǩǫǭǯǰdzǵǹǻǽǿȁȃȅȇȉȋȍȏȑȓȕȗșțȝȟȡȣȥȧȩȫȭȯȱȳ-ȹȼȿɀɂɇɉɋɍɏⱡⱥⱦⱨⱪⱬⱱⱳⱴⱶ-ⱻꜣꜥꜧꜩꜫꜭꜯ-ꜱꜳꜵꜷꜹꜻꜽꜿꝁꝃꝅꝇꝉꝋꝍꝏꝑꝓꝕꝗꝙꝛꝝꝟꝡꝣꝥꝧꝩꝫꝭꝯꝱ-ꝸꝺꝼꝿꞁꞃꞅꞇꞌꞎꞑꞓ-ꞕꞗꞙꞛꞝꞟꞡꞣꞥꞧꞩꞯꞵꞷꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḁḃḅḇḉḋḍḏḑḓḕḗḙḛḝḟḡḣḥḧḩḫḭḯḱḳḵḷḹḻḽḿṁṃṅṇṉṋṍṏṑṓṕṗṙṛṝṟṡṣṥṧṩṫṭṯṱṳṵṷṹṻṽṿẁẃẅẇẉẋẍẏẑẓẕ-ẝẟạảấầẩẫậắằẳẵặẹẻẽếềểễệỉịọỏốồổỗộớờởỡợụủứừ���ữựỳỵỷỹỻỽỿёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-ZA-ZÀ-ÖØ-ÞĀĂĄĆĈĊČĎĐĒĔĖĘĚĜĞĠĢĤĦĨĪĬĮİIJĴĶĹĻĽĿŁŃŅŇŊŌŎŐŒŔŖŘŚŜŞŠŢŤŦŨŪŬŮŰŲŴŶŸŹŻŽƁƂƄƆƇƉ-ƋƎ-ƑƓƔƖ-ƘƜƝƟƠƢƤƦƧƩƬƮƯƱ-ƳƵƷƸƼDŽLJNJǍǏǑǓǕǗǙǛǞǠǢǤǦǨǪǬǮDZǴǶ-ǸǺǼǾȀȂȄȆȈȊȌȎȐȒȔȖȘȚȜȞȠȢȤȦȨȪȬȮȰȲȺȻȽȾɁɃ-ɆɈɊɌɎⱠⱢ-ⱤⱧⱩⱫⱭ-ⱰⱲⱵⱾⱿꜢꜤꜦꜨꜪꜬꜮꜲꜴꜶꜸꜺꜼꜾꝀꝂꝄꝆꝈꝊꝌꝎꝐꝒꝔꝖꝘꝚꝜꝞꝠꝢꝤꝦꝨꝪꝬꝮꝹꝻꝽꝾꞀꞂꞄꞆꞋꞍꞐꞒꞖꞘꞚꞜꞞꞠꞢꞤꞦꞨꞪ-ꞮꞰ-ꞴꞶꞸḀḂḄḆḈḊḌḎḐḒḔḖḘḚḜḞḠḢḤḦḨḪḬḮḰḲḴḶḸḺḼḾṀṂṄṆṈṊṌṎṐṒṔṖṘṚṜṞṠṢṤṦṨṪṬṮṰṲṴṶṸṺṼṾẀẂẄẆẈẊẌẎẐẒẔẞẠẢẤẦẨẪẬẮẰẲẴẶẸẺẼẾỀỂỄỆỈỊỌỎỐỒỔỖỘỚỜỞỠỢỤỦỨỪỬỮỰỲỴỶỸỺỼỾЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-zA-Za-zÀ-ÖØ-öø-ÿĀ-ſƀ-ƿDŽ-ɏⱠ-ⱻⱾⱿꜢ-ꝯꝱ-ꞇꞋ-ꞎꞐ-ꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḀ-ỿёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-]),(?=[A-Za-zA-Za-zÀ-ÖØ-öø-ÿĀ-ſƀ-ƿDŽ-ɏⱠ-ⱻⱾⱿꜢ-ꝯꝱ-ꞇꞋ-ꞎꞐ-ꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḀ-ỿёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-])|(?<=[A-Za-zA-Za-zÀ-ÖØ-öø-ÿĀ-ſƀ-ƿDŽ-ɏⱠ-ⱻⱾⱿꜢ-ꝯꝱ-ꞇꞋ-ꞎꞐ-ꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḀ-ỿёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-])(?:-|–|—|--|---|——|~)(?=[A-Za-zA-Za-zÀ-ÖØ-öø-ÿĀ-ſƀ-ƿDŽ-ɏⱠ-ⱻⱾⱿꜢ-ꝯꝱ-ꞇꞋ-ꞎꞐ-ꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḀ-ỿёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-])|(?<=[A-Za-zA-Za-zÀ-ÖØ-öø-ÿĀ-ſƀ-ƿDŽ-ɏⱠ-ⱻⱾⱿꜢ-ꝯꝱ-ꞇꞋ-ꞎꞐ-ꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḀ-ỿёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-0-9])[:<>=/](?=[A-Za-zA-Za-zÀ-ÖØ-öø-ÿĀ-ſƀ-ƿDŽ-ɏⱠ-ⱻⱾⱿ���-ꝯꝱ-ꞇꞋ-ꞎꞐ-ꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḀ-ỿёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-])�token_match��url_match� (?u)^(?:(?:[\w\+\-\.]{2,})://)?(?:\S+(?::\S*)?@)?(?:(?!(?:10|127)(?:\.\d{1,3}){3})(?!(?:169\.254|192\.168)(?:\.\d{1,3}){2})(?!172\.(?:1[6-9]|2\d|3[0-1])(?:\.\d{1,3}){2})(?:[1-9]\d?|1\d\d|2[01]\d|22[0-3])(?:\.(?:1?\d{1,2}|2[0-4]\d|25[0-5])){2}(?:\.(?:[1-9]\d?|1\d\d|2[0-4]\d|25[0-4]))|(?:(?:[A-Za-z0-9¡-][A-Za-z0-9¡-_-]{0,62})?[A-Za-z0-9¡-]\.)+(?:[a-za-zß-öø-ÿāăąćĉċčďđēĕėęěĝğġģĥħĩīĭįıijĵķĸĺļľŀłńņňʼnŋōŏőœŕŗřśŝşšţťŧũūŭůűųŵŷźżžſƀƃƅƈƌƍƒƕƙ-ƛƞơƣƥƨƪƫƭưƴƶƹƺƽ-ƿdžljnjǎǐǒǔǖǘǚǜǝǟǡǣǥǧǩǫǭǯǰdzǵǹǻǽǿȁȃȅȇȉȋȍȏȑȓȕȗșțȝȟȡȣȥȧȩȫȭȯȱȳ-ȹȼȿɀɂɇɉɋɍɏⱡⱥⱦⱨⱪⱬⱱⱳⱴⱶ-ⱻꜣꜥꜧꜩꜫꜭꜯ-ꜱꜳꜵꜷꜹꜻꜽꜿꝁꝃꝅꝇꝉꝋꝍꝏꝑꝓꝕꝗꝙꝛꝝꝟꝡꝣꝥꝧꝩꝫꝭꝯꝱ-ꝸꝺꝼꝿꞁꞃꞅꞇꞌꞎꞑꞓ-ꞕꞗꞙꞛꞝꞟꞡꞣꞥꞧꞩꞯꞵꞷꞹꟺꬰ-ꭚꭠ-ꭤɐ-ʯᴀ-ᴥᵫ-ᵷᵹ-ᶚḁḃḅḇḉḋḍḏḑḓḕḗḙḛḝḟḡḣḥḧḩḫḭḯḱḳḵḷḹḻḽḿṁṃṅṇṉṋṍṏṑṓṕṗṙṛṝṟṡṣṥṧṩṫṭṯṱṳṵṷṹṻṽṿẁẃẅẇẉẋẍẏẑẓẕ-ẝẟạảấầẩẫậắằẳẵặẹẻẽếềểễệỉịọỏốồổỗộớờởỡợụủứừửữựỳỵỷỹỻỽỿёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґঀ-֑-״יִ-ﭏؠ-يٮ-ەۥ-ۿݐ-ݿࢠ-ࢽﭐ-ﮱﯓ-ﴽﵐ-ﷇﷰ-ﷻﹰ-ﻼ𞸀-𞺻-ऀ-ॿಀ--ఀ-౿가-ᄀ-ᇿ一-拿挀-矿砀-賿贀-鿿㐀-䶿𠀀-𡗿𡘀-𣃿𣄀-𤗿𤘀-𦃿𦄀-𧗿𧘀-𩃿𩄀-𪛟𪜀-𫝀-𫠠-𬺰-⺀-⼀-⿰- -〿㇀-㈀-㋿㌀-㏿豈-︰-﹏🈀-丽-]{2,63}))(?::\d{2,5})?(?:[/?#]\S*)?$�exceptions� �� ��A� JgK�_SP�
|
2 |
+
��A�
|
3 |
+
JgK�_SP� ��A� JgK�_SP�")��A�")�'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�C++��A�C++�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�\")��A�\")�\n��A�\nJgK�_SP�\t��A�\tJgK�_SP�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�b.��A�b.�c.��A�c.�d.��A�d.�e.��A�e.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l.��A�l.�m.��A�m.�n.��A�n.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�q.��A�q.�r.��A�r.�s.��A�s.�t.��A�t.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� JgI� K�_SP�¯\(ツ)/¯��A�¯\(ツ)/¯�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
|
vectors
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:14772b683e726436d5948ad3fff2b43d036ef2ebbe3458aafed6004e05a40706
|
3 |
-
size 128
|
|
|
|
|
|
|
|
vocab/key2row
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
�
|
cfg → vocab/lookups.bin
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5bac7887f37e66e55721c09112b3ccbe56866fe1969146df08fc41cb86a4b18f
|
3 |
+
size 14
|
moves → vocab/lookups_extra.bin
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:479d77559672b791f58e372a4e000da8efb92f443b6964d82684bbe2d324d28b
|
3 |
+
size 47
|
vocab/strings.json
ADDED
The diff for this file is too large to render.
See raw diff
|
|
vocab/vectors
ADDED
Binary file (128 Bytes). View file
|
|