lopatnov
/

Text Classification
fastText
language-identification
lopatnov kargaranamir commited on
Commit
564c73c
·
0 Parent(s):

Duplicate from cis-lmu/glotlid

Browse files

Co-authored-by: Amir Hossein Kargaran <kargaranamir@users.noreply.huggingface.co>

Files changed (7) hide show
  1. .gitattributes +35 -0
  2. LICENSE +253 -0
  3. README.md +2253 -0
  4. model.bin +3 -0
  5. model_v1.bin +3 -0
  6. model_v2.bin +3 -0
  7. model_v3.bin +3 -0
.gitattributes ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,253 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Apache License 2.0 plus notices
2
+
3
+ ---
4
+
5
+ ### Additional Notice on Training Data and Model Behavior
6
+
7
+ This software includes a machine learning model for language identification
8
+ based on statistical representations (e.g., character n-grams and linear classification).
9
+
10
+ The model was trained on a mixture of publicly available data sources, which may
11
+ include materials subject to copyright or other rights.
12
+
13
+ The model does not store or encode expressive text content. Instead, it represents
14
+ aggregated statistical patterns (such as n-gram frequency weights) that are not
15
+ intended to reproduce original source material.
16
+
17
+ No training data is included or distributed with this software.
18
+
19
+ Due to its design, this model performs classification only and is not capable of
20
+ generating or reconstructing source texts.
21
+
22
+ ---
23
+
24
+ ### Good-Faith Notice
25
+
26
+ If you are a rights holder and believe that your content has been included in the
27
+ training data and wish to request its exclusion from future versions, please contact
28
+ the authors. Reasonable efforts will be made to review and address such requests.
29
+
30
+ ---
31
+
32
+ ### Limitation and Responsibility
33
+
34
+ No representation or warranty is made regarding the licensing status of the
35
+ underlying training data.
36
+
37
+ Users are solely responsible for ensuring that their use of this software complies
38
+ with applicable laws and licensing requirements.
39
+
40
+ ---
41
+
42
+ ### Disclaimer
43
+
44
+ Unless required by applicable law or agreed to in writing, this software is
45
+ provided "AS IS", WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express
46
+ or implied, including, without limitation, any warranties of TITLE,
47
+ NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE.
48
+
49
+ ---
50
+
51
+ ### Full Apache License Text
52
+
53
+ Apache License
54
+ Version 2.0, January 2004
55
+ http://www.apache.org/licenses/
56
+
57
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
58
+
59
+ 1. Definitions.
60
+
61
+ "License" shall mean the terms and conditions for use, reproduction,
62
+ and distribution as defined by Sections 1 through 9 of this document.
63
+
64
+ "Licensor" shall mean the copyright owner or entity authorized by
65
+ the copyright owner that is granting the License.
66
+
67
+ "Legal Entity" shall mean the union of the acting entity and all
68
+ other entities that control, are controlled by, or are under common
69
+ control with that entity. For the purposes of this definition,
70
+ "control" means (i) the power, direct or indirect, to cause the
71
+ direction or management of such entity, whether by contract or
72
+ otherwise, or (ii) ownership of fifty percent (50%) or more of the
73
+ outstanding shares, or (iii) beneficial ownership of such entity.
74
+
75
+ "You" (or "Your") shall mean an individual or Legal Entity
76
+ exercising permissions granted by this License.
77
+
78
+ "Source" form shall mean the preferred form for making modifications,
79
+ including but not limited to software source code, documentation
80
+ source, and configuration files.
81
+
82
+ "Object" form shall mean any form resulting from mechanical
83
+ transformation or translation of a Source form, including but
84
+ not limited to compiled object code, generated documentation,
85
+ and conversions to other media types.
86
+
87
+ "Work" shall mean the work of authorship, whether in Source or
88
+ Object form, made available under the License, as indicated by a
89
+ copyright notice that is included in or attached to the work
90
+ (an example is provided in the Appendix below).
91
+
92
+ "Derivative Works" shall mean any work, whether in Source or Object
93
+ form, that is based on (or derived from) the Work and for which the
94
+ editorial revisions, annotations, elaborations, or other modifications
95
+ represent, as a whole, an original work of authorship. For the purposes
96
+ of this License, Derivative Works shall not include works that remain
97
+ separable from, or merely link (or bind by name) to the interfaces of,
98
+ the Work and Derivative Works thereof.
99
+
100
+ "Contribution" shall mean any work of authorship, including
101
+ the original version of the Work and any modifications or additions
102
+ to that Work or Derivative Works thereof, that is intentionally
103
+ submitted to Licensor for inclusion in the Work by the copyright owner
104
+ or by an individual or Legal Entity authorized to submit on behalf of
105
+ the copyright owner. For the purposes of this definition, "submitted"
106
+ means any form of electronic, verbal, or written communication sent
107
+ to the Licensor or its representatives, including but not limited to
108
+ communication on electronic mailing lists, source code control systems,
109
+ and issue tracking systems that are managed by, or on behalf of, the
110
+ Licensor for the purpose of discussing and improving the Work, but
111
+ excluding communication that is conspicuously marked or otherwise
112
+ designated in writing by the copyright owner as "Not a Contribution."
113
+
114
+ "Contributor" shall mean Licensor and any individual or Legal Entity
115
+ on behalf of whom a Contribution has been received by Licensor and
116
+ subsequently incorporated within the Work.
117
+
118
+ 2. Grant of Copyright License. Subject to the terms and conditions of
119
+ this License, each Contributor hereby grants to You a perpetual,
120
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
121
+ copyright license to reproduce, prepare Derivative Works of,
122
+ publicly display, publicly perform, sublicense, and distribute the
123
+ Work and such Derivative Works in Source or Object form.
124
+
125
+ 3. Grant of Patent License. Subject to the terms and conditions of
126
+ this License, each Contributor hereby grants to You a perpetual,
127
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
128
+ (except as stated in this section) patent license to make, have made,
129
+ use, offer to sell, sell, import, and otherwise transfer the Work,
130
+ where such license applies only to those patent claims licensable
131
+ by such Contributor that are necessarily infringed by their
132
+ Contribution(s) alone or by combination of their Contribution(s)
133
+ with the Work to which such Contribution(s) was submitted. If You
134
+ institute patent litigation against any entity (including a
135
+ cross-claim or counterclaim in a lawsuit) alleging that the Work
136
+ or a Contribution incorporated within the Work constitutes direct
137
+ or contributory patent infringement, then any patent licenses
138
+ granted to You under this License for that Work shall terminate
139
+ as of the date such litigation is filed.
140
+
141
+ 4. Redistribution. You may reproduce and distribute copies of the
142
+ Work or Derivative Works thereof in any medium, with or without
143
+ modifications, and in Source or Object form, provided that You
144
+ meet the following conditions:
145
+
146
+ (a) You must give any other recipients of the Work or
147
+ Derivative Works a copy of this License; and
148
+
149
+ (b) You must cause any modified files to carry prominent notices
150
+ stating that You changed the files; and
151
+
152
+ (c) You must retain, in the Source form of any Derivative Works
153
+ that You distribute, all copyright, patent, trademark, and
154
+ attribution notices from the Source form of the Work,
155
+ excluding those notices that do not pertain to any part of
156
+ the Derivative Works; and
157
+
158
+ (d) If the Work includes a "NOTICE" text file as part of its
159
+ distribution, then any Derivative Works that You distribute must
160
+ include a readable copy of the attribution notices contained
161
+ within such NOTICE file, excluding those notices that do not
162
+ pertain to any part of the Derivative Works, in at least one
163
+ of the following places: within a NOTICE text file distributed
164
+ as part of the Derivative Works; within the Source form or
165
+ documentation, if provided along with the Derivative Works; or,
166
+ within a display generated by the Derivative Works, if and
167
+ wherever such third-party notices normally appear. The contents
168
+ of the NOTICE file are for informational purposes only and
169
+ do not modify the License. You may add Your own attribution
170
+ notices within Derivative Works that You distribute, alongside
171
+ or as an addendum to the NOTICE text from the Work, provided
172
+ that such additional attribution notices cannot be construed
173
+ as modifying the License.
174
+
175
+ You may add Your own copyright statement to Your modifications and
176
+ may provide additional or different license terms and conditions
177
+ for use, reproduction, or distribution of Your modifications, or
178
+ for any such Derivative Works as a whole, provided Your use,
179
+ reproduction, and distribution of the Work otherwise complies with
180
+ the conditions stated in this License.
181
+
182
+ 5. Submission of Contributions. Unless You explicitly state otherwise,
183
+ any Contribution intentionally submitted for inclusion in the Work
184
+ by You to the Licensor shall be under the terms and conditions of
185
+ this License, without any additional terms or conditions.
186
+ Notwithstanding the above, nothing herein shall supersede or modify
187
+ the terms of any separate license agreement you may have executed
188
+ with Licensor regarding such Contributions.
189
+
190
+ 6. Trademarks. This License does not grant permission to use the trade
191
+ names, trademarks, service marks, or product names of the Licensor,
192
+ except as required for reasonable and customary use in describing the
193
+ origin of the Work and reproducing the content of the NOTICE file.
194
+
195
+ 7. Disclaimer of Warranty. Unless required by applicable law or
196
+ agreed to in writing, Licensor provides the Work (and each
197
+ Contributor provides its Contributions) on an "AS IS" BASIS,
198
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
199
+ implied, including, without limitation, any warranties or conditions
200
+ of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
201
+ PARTICULAR PURPOSE. You are solely responsible for determining the
202
+ appropriateness of using or redistributing the Work and assume any
203
+ risks associated with Your exercise of permissions under this License.
204
+
205
+ 8. Limitation of Liability. In no event and under no legal theory,
206
+ whether in tort (including negligence), contract, or otherwise,
207
+ unless required by applicable law (such as deliberate and grossly
208
+ negligent acts) or agreed to in writing, shall any Contributor be
209
+ liable to You for damages, including any direct, indirect, special,
210
+ incidental, or consequential damages of any character arising as a
211
+ result of this License or out of the use or inability to use the
212
+ Work (including but not limited to damages for loss of goodwill,
213
+ work stoppage, computer failure or malfunction, or any and all
214
+ other commercial damages or losses), even if such Contributor
215
+ has been advised of the possibility of such damages.
216
+
217
+ 9. Accepting Warranty or Additional Liability. While redistributing
218
+ the Work or Derivative Works thereof, You may choose to offer,
219
+ and charge a fee for, acceptance of support, warranty, indemnity,
220
+ or other liability obligations and/or rights consistent with this
221
+ License. However, in accepting such obligations, You may act only
222
+ on Your own behalf and on Your sole responsibility, not on behalf
223
+ of any other Contributor, and only if You agree to indemnify,
224
+ defend, and hold each Contributor harmless for any liability
225
+ incurred by, or claims asserted against, such Contributor by reason
226
+ of your accepting any such warranty or additional liability.
227
+
228
+ END OF TERMS AND CONDITIONS
229
+
230
+ APPENDIX: How to apply the Apache License to your work.
231
+
232
+ To apply the Apache License to your work, attach the following
233
+ boilerplate notice, with the fields enclosed by brackets "[]"
234
+ replaced with your own identifying information. (Don't include
235
+ the brackets!) The text should be enclosed in the appropriate
236
+ comment syntax for the file format. We also recommend that a
237
+ file or class name and description of purpose be included on the
238
+ same "printed page" as the copyright notice for easier
239
+ identification within third-party archives.
240
+
241
+ Copyright [2023] [AUTHORS]
242
+
243
+ Licensed under the Apache License, Version 2.0 (the "License");
244
+ you may not use this file except in compliance with the License.
245
+ You may obtain a copy of the License at
246
+
247
+ http://www.apache.org/licenses/LICENSE-2.0
248
+
249
+ Unless required by applicable law or agreed to in writing, software
250
+ distributed under the License is distributed on an "AS IS" BASIS,
251
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
252
+ See the License for the specific language governing permissions and
253
+ limitations under the License.
README.md ADDED
@@ -0,0 +1,2253 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: apache-2.0-plus-notices
4
+ license_link: LICENSE
5
+ language:
6
+ - aah
7
+ - aai
8
+ - aak
9
+ - aau
10
+ - aaz
11
+ - ab
12
+ - aba
13
+ - abi
14
+ - abk
15
+ - abn
16
+ - abq
17
+ - abs
18
+ - abt
19
+ - abx
20
+ - aby
21
+ - abz
22
+ - aca
23
+ - acd
24
+ - ace
25
+ - acf
26
+ - ach
27
+ - acm
28
+ - acn
29
+ - acq
30
+ - acr
31
+ - acu
32
+ - ada
33
+ - ade
34
+ - adh
35
+ - adi
36
+ - adj
37
+ - adl
38
+ - adx
39
+ - ady
40
+ - adz
41
+ - aeb
42
+ - aer
43
+ - aeu
44
+ - aey
45
+ - af
46
+ - afb
47
+ - afh
48
+ - afr
49
+ - agd
50
+ - agg
51
+ - agm
52
+ - agn
53
+ - agr
54
+ - agt
55
+ - agu
56
+ - agw
57
+ - agx
58
+ - aha
59
+ - ahk
60
+ - aia
61
+ - aii
62
+ - aim
63
+ - ain
64
+ - ajg
65
+ - aji
66
+ - ajp
67
+ - ajz
68
+ - ak
69
+ - aka
70
+ - akb
71
+ - ake
72
+ - akh
73
+ - akl
74
+ - akp
75
+ - ald
76
+ - alj
77
+ - aln
78
+ - alp
79
+ - alq
80
+ - als
81
+ - alt
82
+ - aly
83
+ - alz
84
+ - am
85
+ - ame
86
+ - amf
87
+ - amh
88
+ - ami
89
+ - amk
90
+ - amm
91
+ - amn
92
+ - amp
93
+ - amr
94
+ - amu
95
+ - amx
96
+ - an
97
+ - ang
98
+ - anm
99
+ - ann
100
+ - anp
101
+ - anv
102
+ - any
103
+ - aoc
104
+ - aoi
105
+ - aoj
106
+ - aom
107
+ - aon
108
+ - aoz
109
+ - apb
110
+ - apc
111
+ - ape
112
+ - apn
113
+ - apr
114
+ - apt
115
+ - apu
116
+ - apw
117
+ - apy
118
+ - apz
119
+ - aqz
120
+ - ar
121
+ - ara
122
+ - arb
123
+ - are
124
+ - arg
125
+ - arh
126
+ - arl
127
+ - arn
128
+ - arp
129
+ - arq
130
+ - arr
131
+ - ars
132
+ - ary
133
+ - arz
134
+ - as
135
+ - asg
136
+ - asm
137
+ - aso
138
+ - ast
139
+ - ata
140
+ - atb
141
+ - atd
142
+ - atg
143
+ - ati
144
+ - atj
145
+ - atq
146
+ - att
147
+ - auc
148
+ - aui
149
+ - auy
150
+ - av
151
+ - ava
152
+ - avk
153
+ - avn
154
+ - avt
155
+ - avu
156
+ - awa
157
+ - awb
158
+ - awi
159
+ - awx
160
+ - ay
161
+ - aym
162
+ - ayo
163
+ - ayp
164
+ - ayr
165
+ - az
166
+ - azb
167
+ - aze
168
+ - azg
169
+ - azj
170
+ - azz
171
+ - ba
172
+ - bak
173
+ - bal
174
+ - bam
175
+ - ban
176
+ - bao
177
+ - bar
178
+ - bas
179
+ - bav
180
+ - bba
181
+ - bbb
182
+ - bbc
183
+ - bbj
184
+ - bbk
185
+ - bbo
186
+ - bbr
187
+ - bcc
188
+ - bch
189
+ - bci
190
+ - bcl
191
+ - bco
192
+ - bcw
193
+ - bdd
194
+ - bdh
195
+ - bdq
196
+ - be
197
+ - bea
198
+ - bef
199
+ - bel
200
+ - bem
201
+ - ben
202
+ - beq
203
+ - ber
204
+ - bew
205
+ - bex
206
+ - bfd
207
+ - bfo
208
+ - bfz
209
+ - bg
210
+ - bgr
211
+ - bgs
212
+ - bgt
213
+ - bgz
214
+ - bhg
215
+ - bhl
216
+ - bho
217
+ - bhp
218
+ - bhw
219
+ - bhz
220
+ - bi
221
+ - bib
222
+ - big
223
+ - bih
224
+ - bik
225
+ - bim
226
+ - bin
227
+ - bis
228
+ - biu
229
+ - biv
230
+ - bjn
231
+ - bjp
232
+ - bjr
233
+ - bjv
234
+ - bkd
235
+ - bkl
236
+ - bkq
237
+ - bku
238
+ - bkv
239
+ - bla
240
+ - blh
241
+ - blk
242
+ - blt
243
+ - blw
244
+ - blz
245
+ - bm
246
+ - bmb
247
+ - bmh
248
+ - bmk
249
+ - bmq
250
+ - bmr
251
+ - bmu
252
+ - bmv
253
+ - bn
254
+ - bnj
255
+ - bno
256
+ - bnp
257
+ - bo
258
+ - boa
259
+ - bod
260
+ - boj
261
+ - bom
262
+ - bon
263
+ - bor
264
+ - bos
265
+ - bov
266
+ - box
267
+ - bpr
268
+ - bps
269
+ - bpy
270
+ - bqc
271
+ - bqj
272
+ - bqp
273
+ - br
274
+ - bre
275
+ - brh
276
+ - bru
277
+ - brx
278
+ - bs
279
+ - bsc
280
+ - bsn
281
+ - bsp
282
+ - bsq
283
+ - bss
284
+ - btd
285
+ - btg
286
+ - bth
287
+ - bts
288
+ - btt
289
+ - btx
290
+ - bua
291
+ - bud
292
+ - bug
293
+ - buk
294
+ - bul
295
+ - bum
296
+ - bus
297
+ - bvc
298
+ - bvd
299
+ - bvr
300
+ - bvy
301
+ - bvz
302
+ - bwd
303
+ - bwi
304
+ - bwq
305
+ - bwu
306
+ - bxh
307
+ - bxr
308
+ - byr
309
+ - byv
310
+ - byx
311
+ - bzd
312
+ - bzh
313
+ - bzi
314
+ - bzj
315
+ - bzt
316
+ - ca
317
+ - caa
318
+ - cab
319
+ - cac
320
+ - caf
321
+ - cag
322
+ - cak
323
+ - cao
324
+ - cap
325
+ - caq
326
+ - car
327
+ - cas
328
+ - cat
329
+ - cav
330
+ - cax
331
+ - cbc
332
+ - cbi
333
+ - cbk
334
+ - cbr
335
+ - cbs
336
+ - cbt
337
+ - cbu
338
+ - cbv
339
+ - cce
340
+ - cco
341
+ - ccp
342
+ - cdf
343
+ - ce
344
+ - ceb
345
+ - ceg
346
+ - cek
347
+ - ces
348
+ - cfm
349
+ - cgc
350
+ - cgg
351
+ - ch
352
+ - cha
353
+ - chd
354
+ - che
355
+ - chf
356
+ - chj
357
+ - chk
358
+ - chn
359
+ - cho
360
+ - chq
361
+ - chr
362
+ - chu
363
+ - chv
364
+ - chw
365
+ - chz
366
+ - cjk
367
+ - cjo
368
+ - cjp
369
+ - cjs
370
+ - cjv
371
+ - ckb
372
+ - ckm
373
+ - cko
374
+ - ckt
375
+ - cle
376
+ - clu
377
+ - cly
378
+ - cme
379
+ - cmi
380
+ - cmn
381
+ - cmo
382
+ - cmr
383
+ - cnh
384
+ - cni
385
+ - cnk
386
+ - cnl
387
+ - cnr
388
+ - cnt
389
+ - cnw
390
+ - co
391
+ - coe
392
+ - cof
393
+ - cok
394
+ - con
395
+ - cop
396
+ - cor
397
+ - cos
398
+ - cot
399
+ - cou
400
+ - cpa
401
+ - cpb
402
+ - cpc
403
+ - cpu
404
+ - cpy
405
+ - crh
406
+ - cri
407
+ - crj
408
+ - crk
409
+ - crl
410
+ - crm
411
+ - crn
412
+ - crq
413
+ - crs
414
+ - crt
415
+ - crx
416
+ - cs
417
+ - csb
418
+ - csk
419
+ - cso
420
+ - csw
421
+ - csy
422
+ - cta
423
+ - ctd
424
+ - cto
425
+ - ctp
426
+ - ctu
427
+ - cu
428
+ - cub
429
+ - cuc
430
+ - cui
431
+ - cuk
432
+ - cul
433
+ - cut
434
+ - cux
435
+ - cv
436
+ - cwd
437
+ - cwe
438
+ - cwt
439
+ - cy
440
+ - cya
441
+ - cym
442
+ - czt
443
+ - da
444
+ - daa
445
+ - dad
446
+ - daf
447
+ - dag
448
+ - dah
449
+ - dak
450
+ - dan
451
+ - dar
452
+ - dbq
453
+ - ddg
454
+ - ddn
455
+ - de
456
+ - ded
457
+ - des
458
+ - deu
459
+ - dga
460
+ - dgc
461
+ - dgi
462
+ - dgr
463
+ - dgz
464
+ - dhg
465
+ - dhm
466
+ - dhv
467
+ - did
468
+ - dig
469
+ - dik
470
+ - din
471
+ - dip
472
+ - diq
473
+ - dis
474
+ - diu
475
+ - div
476
+ - dje
477
+ - djk
478
+ - djr
479
+ - dks
480
+ - dln
481
+ - dng
482
+ - dnj
483
+ - dnw
484
+ - dob
485
+ - doi
486
+ - dop
487
+ - dos
488
+ - dow
489
+ - drg
490
+ - drt
491
+ - dru
492
+ - dsb
493
+ - dsh
494
+ - dtb
495
+ - dtp
496
+ - dts
497
+ - dty
498
+ - dua
499
+ - due
500
+ - dug
501
+ - duo
502
+ - dur
503
+ - dv
504
+ - dwr
505
+ - dws
506
+ - dww
507
+ - dyi
508
+ - dyo
509
+ - dyu
510
+ - dz
511
+ - dzo
512
+ - ebk
513
+ - ee
514
+ - efi
515
+ - egl
516
+ - eka
517
+ - ekk
518
+ - eko
519
+ - el
520
+ - ell
521
+ - eme
522
+ - emi
523
+ - eml
524
+ - emp
525
+ - en
526
+ - enb
527
+ - eng
528
+ - enl
529
+ - enm
530
+ - enq
531
+ - enx
532
+ - eo
533
+ - epo
534
+ - eri
535
+ - es
536
+ - ese
537
+ - esi
538
+ - esk
539
+ - ess
540
+ - est
541
+ - esu
542
+ - et
543
+ - eto
544
+ - etr
545
+ - etu
546
+ - eu
547
+ - eus
548
+ - eve
549
+ - evn
550
+ - ewe
551
+ - ewo
552
+ - ext
553
+ - eza
554
+ - fa
555
+ - faa
556
+ - fad
557
+ - fai
558
+ - fal
559
+ - fan
560
+ - fao
561
+ - far
562
+ - fas
563
+ - fat
564
+ - ffm
565
+ - fi
566
+ - fij
567
+ - fil
568
+ - fin
569
+ - fit
570
+ - fj
571
+ - fkv
572
+ - fmp
573
+ - fmu
574
+ - fo
575
+ - fon
576
+ - for
577
+ - fr
578
+ - fra
579
+ - frd
580
+ - frm
581
+ - fro
582
+ - frp
583
+ - frr
584
+ - fry
585
+ - fub
586
+ - fud
587
+ - fue
588
+ - fuf
589
+ - fuh
590
+ - fuq
591
+ - fur
592
+ - fuv
593
+ - fy
594
+ - ga
595
+ - gaa
596
+ - gag
597
+ - gah
598
+ - gai
599
+ - gam
600
+ - gaw
601
+ - gaz
602
+ - gba
603
+ - gbi
604
+ - gbo
605
+ - gbr
606
+ - gcf
607
+ - gcr
608
+ - gd
609
+ - gde
610
+ - gdg
611
+ - gdn
612
+ - gdr
613
+ - geb
614
+ - gej
615
+ - gfk
616
+ - ghe
617
+ - ghs
618
+ - gid
619
+ - gil
620
+ - giz
621
+ - gjn
622
+ - gkn
623
+ - gkp
624
+ - gl
625
+ - gla
626
+ - gle
627
+ - glg
628
+ - glk
629
+ - glv
630
+ - gmh
631
+ - gmv
632
+ - gn
633
+ - gna
634
+ - gnb
635
+ - gnd
636
+ - gng
637
+ - gnn
638
+ - gnw
639
+ - goa
640
+ - gof
641
+ - gog
642
+ - goh
643
+ - gom
644
+ - gor
645
+ - gos
646
+ - got
647
+ - gqr
648
+ - grc
649
+ - grn
650
+ - grt
651
+ - gso
652
+ - gsw
653
+ - gu
654
+ - gub
655
+ - guc
656
+ - gud
657
+ - gug
658
+ - guh
659
+ - gui
660
+ - guj
661
+ - guk
662
+ - gul
663
+ - gum
664
+ - gun
665
+ - guo
666
+ - guq
667
+ - gur
668
+ - guu
669
+ - guw
670
+ - gux
671
+ - guz
672
+ - gv
673
+ - gvc
674
+ - gvf
675
+ - gvl
676
+ - gvn
677
+ - gwi
678
+ - gwr
679
+ - gxx
680
+ - gya
681
+ - gym
682
+ - gyr
683
+ - ha
684
+ - hac
685
+ - hae
686
+ - hag
687
+ - hak
688
+ - hat
689
+ - hau
690
+ - hav
691
+ - haw
692
+ - hay
693
+ - hbo
694
+ - hbs
695
+ - hch
696
+ - he
697
+ - heb
698
+ - heg
699
+ - heh
700
+ - her
701
+ - hi
702
+ - hif
703
+ - hig
704
+ - hil
705
+ - hin
706
+ - hix
707
+ - hla
708
+ - hlt
709
+ - hmn
710
+ - hmo
711
+ - hmr
712
+ - hne
713
+ - hnj
714
+ - hnn
715
+ - hns
716
+ - ho
717
+ - hoc
718
+ - hop
719
+ - hot
720
+ - hr
721
+ - hra
722
+ - hrv
723
+ - hrx
724
+ - hsb
725
+ - ht
726
+ - hto
727
+ - hu
728
+ - hub
729
+ - hui
730
+ - hun
731
+ - hus
732
+ - huu
733
+ - huv
734
+ - hvn
735
+ - hwc
736
+ - hy
737
+ - hye
738
+ - hyw
739
+ - hz
740
+ - ia
741
+ - ian
742
+ - iba
743
+ - ibg
744
+ - ibo
745
+ - icr
746
+ - id
747
+ - ido
748
+ - idu
749
+ - ie
750
+ - ifa
751
+ - ifb
752
+ - ife
753
+ - ifk
754
+ - ifu
755
+ - ify
756
+ - ig
757
+ - ige
758
+ - ign
759
+ - igs
760
+ - ii
761
+ - iii
762
+ - ijc
763
+ - ike
764
+ - ikk
765
+ - ikt
766
+ - ikw
767
+ - ilb
768
+ - ile
769
+ - ilo
770
+ - imo
771
+ - ina
772
+ - inb
773
+ - ind
774
+ - inh
775
+ - ino
776
+ - io
777
+ - iou
778
+ - ipi
779
+ - iqw
780
+ - iri
781
+ - irk
782
+ - iry
783
+ - is
784
+ - isd
785
+ - ish
786
+ - isl
787
+ - iso
788
+ - it
789
+ - ita
790
+ - itl
791
+ - its
792
+ - itv
793
+ - ium
794
+ - ivb
795
+ - ivv
796
+ - iws
797
+ - ixl
798
+ - izh
799
+ - izr
800
+ - izz
801
+ - ja
802
+ - jaa
803
+ - jac
804
+ - jae
805
+ - jam
806
+ - jav
807
+ - jbo
808
+ - jbu
809
+ - jdt
810
+ - jic
811
+ - jiv
812
+ - jmc
813
+ - jmx
814
+ - jpa
815
+ - jpn
816
+ - jra
817
+ - jun
818
+ - jv
819
+ - jvn
820
+ - ka
821
+ - kaa
822
+ - kab
823
+ - kac
824
+ - kak
825
+ - kal
826
+ - kam
827
+ - kan
828
+ - kao
829
+ - kap
830
+ - kaq
831
+ - kas
832
+ - kat
833
+ - kaz
834
+ - kbc
835
+ - kbd
836
+ - kbh
837
+ - kbm
838
+ - kbo
839
+ - kbp
840
+ - kbq
841
+ - kbr
842
+ - kby
843
+ - kca
844
+ - kcg
845
+ - kck
846
+ - kdc
847
+ - kde
848
+ - kdh
849
+ - kdi
850
+ - kdj
851
+ - kdl
852
+ - kdp
853
+ - kdr
854
+ - kea
855
+ - kei
856
+ - kek
857
+ - ken
858
+ - keo
859
+ - ker
860
+ - kew
861
+ - kex
862
+ - kez
863
+ - kff
864
+ - kg
865
+ - kgf
866
+ - kgk
867
+ - kgp
868
+ - kgr
869
+ - kha
870
+ - khg
871
+ - khk
872
+ - khm
873
+ - khq
874
+ - khs
875
+ - khy
876
+ - khz
877
+ - ki
878
+ - kia
879
+ - kij
880
+ - kik
881
+ - kin
882
+ - kir
883
+ - kiu
884
+ - kix
885
+ - kj
886
+ - kjb
887
+ - kje
888
+ - kjh
889
+ - kjs
890
+ - kk
891
+ - kkc
892
+ - kki
893
+ - kkj
894
+ - kkl
895
+ - kl
896
+ - kle
897
+ - kln
898
+ - klt
899
+ - klv
900
+ - km
901
+ - kma
902
+ - kmb
903
+ - kmd
904
+ - kmg
905
+ - kmh
906
+ - kmk
907
+ - kmm
908
+ - kmo
909
+ - kmr
910
+ - kms
911
+ - kmu
912
+ - kmy
913
+ - kn
914
+ - knc
915
+ - kne
916
+ - knf
917
+ - kng
918
+ - knj
919
+ - knk
920
+ - kno
921
+ - knv
922
+ - knx
923
+ - kny
924
+ - ko
925
+ - kog
926
+ - koi
927
+ - kom
928
+ - kon
929
+ - koo
930
+ - kor
931
+ - kos
932
+ - kpe
933
+ - kpf
934
+ - kpg
935
+ - kpj
936
+ - kpq
937
+ - kpr
938
+ - kpv
939
+ - kpw
940
+ - kpx
941
+ - kpz
942
+ - kqa
943
+ - kqc
944
+ - kqe
945
+ - kqf
946
+ - kql
947
+ - kqn
948
+ - kqo
949
+ - kqp
950
+ - kqs
951
+ - kqw
952
+ - kqy
953
+ - krc
954
+ - kri
955
+ - krj
956
+ - krl
957
+ - kru
958
+ - krx
959
+ - ks
960
+ - ksb
961
+ - ksc
962
+ - ksd
963
+ - ksf
964
+ - ksh
965
+ - ksj
966
+ - ksp
967
+ - ksr
968
+ - kss
969
+ - ksw
970
+ - ktb
971
+ - ktj
972
+ - ktm
973
+ - kto
974
+ - ktu
975
+ - ktz
976
+ - kua
977
+ - kub
978
+ - kud
979
+ - kue
980
+ - kuj
981
+ - kum
982
+ - kup
983
+ - kus
984
+ - kv
985
+ - kvg
986
+ - kvj
987
+ - kvn
988
+ - kw
989
+ - kwd
990
+ - kwf
991
+ - kwi
992
+ - kwj
993
+ - kwn
994
+ - kwy
995
+ - kxc
996
+ - kxm
997
+ - kxw
998
+ - ky
999
+ - kyc
1000
+ - kyf
1001
+ - kyg
1002
+ - kyq
1003
+ - kyu
1004
+ - kyz
1005
+ - kze
1006
+ - kzf
1007
+ - kzj
1008
+ - kzn
1009
+ - la
1010
+ - lac
1011
+ - lad
1012
+ - lai
1013
+ - laj
1014
+ - lam
1015
+ - lao
1016
+ - lap
1017
+ - las
1018
+ - lat
1019
+ - lav
1020
+ - law
1021
+ - lb
1022
+ - lbb
1023
+ - lbe
1024
+ - lbj
1025
+ - lbk
1026
+ - lch
1027
+ - lcm
1028
+ - lcp
1029
+ - ldi
1030
+ - ldn
1031
+ - lea
1032
+ - led
1033
+ - lee
1034
+ - lef
1035
+ - leh
1036
+ - lem
1037
+ - leu
1038
+ - lew
1039
+ - lex
1040
+ - lez
1041
+ - lfn
1042
+ - lg
1043
+ - lgg
1044
+ - lgl
1045
+ - lgm
1046
+ - lhi
1047
+ - lhm
1048
+ - lhu
1049
+ - li
1050
+ - lia
1051
+ - lid
1052
+ - lif
1053
+ - lij
1054
+ - lim
1055
+ - lin
1056
+ - lip
1057
+ - lir
1058
+ - lis
1059
+ - lit
1060
+ - liv
1061
+ - ljp
1062
+ - lki
1063
+ - llb
1064
+ - lld
1065
+ - llg
1066
+ - lln
1067
+ - lmk
1068
+ - lmo
1069
+ - lmp
1070
+ - ln
1071
+ - lnd
1072
+ - lo
1073
+ - lob
1074
+ - loe
1075
+ - log
1076
+ - lok
1077
+ - lol
1078
+ - lom
1079
+ - loq
1080
+ - loz
1081
+ - lrc
1082
+ - lsi
1083
+ - lsm
1084
+ - lt
1085
+ - ltg
1086
+ - ltz
1087
+ - lu
1088
+ - lua
1089
+ - lub
1090
+ - luc
1091
+ - lud
1092
+ - lue
1093
+ - lug
1094
+ - lun
1095
+ - luo
1096
+ - lus
1097
+ - lut
1098
+ - lv
1099
+ - lvs
1100
+ - lwg
1101
+ - lwo
1102
+ - lww
1103
+ - lzh
1104
+ - lzz
1105
+ - maa
1106
+ - mad
1107
+ - maf
1108
+ - mag
1109
+ - mah
1110
+ - mai
1111
+ - maj
1112
+ - mak
1113
+ - mal
1114
+ - mam
1115
+ - maq
1116
+ - mar
1117
+ - mas
1118
+ - mau
1119
+ - mav
1120
+ - maw
1121
+ - max
1122
+ - maz
1123
+ - mbb
1124
+ - mbc
1125
+ - mbd
1126
+ - mbf
1127
+ - mbh
1128
+ - mbi
1129
+ - mbj
1130
+ - mbl
1131
+ - mbs
1132
+ - mbt
1133
+ - mca
1134
+ - mcb
1135
+ - mcd
1136
+ - mcf
1137
+ - mck
1138
+ - mcn
1139
+ - mco
1140
+ - mcp
1141
+ - mcq
1142
+ - mcu
1143
+ - mda
1144
+ - mdf
1145
+ - mdy
1146
+ - med
1147
+ - mee
1148
+ - meh
1149
+ - mej
1150
+ - mek
1151
+ - men
1152
+ - meq
1153
+ - mer
1154
+ - met
1155
+ - meu
1156
+ - mev
1157
+ - mfa
1158
+ - mfe
1159
+ - mfg
1160
+ - mfh
1161
+ - mfi
1162
+ - mfk
1163
+ - mfq
1164
+ - mfy
1165
+ - mfz
1166
+ - mg
1167
+ - mgc
1168
+ - mgh
1169
+ - mgm
1170
+ - mgo
1171
+ - mgr
1172
+ - mgv
1173
+ - mh
1174
+ - mhi
1175
+ - mhl
1176
+ - mhr
1177
+ - mhw
1178
+ - mhx
1179
+ - mhy
1180
+ - mi
1181
+ - mib
1182
+ - mic
1183
+ - mie
1184
+ - mif
1185
+ - mig
1186
+ - mih
1187
+ - mik
1188
+ - mil
1189
+ - mim
1190
+ - min
1191
+ - mio
1192
+ - mip
1193
+ - miq
1194
+ - mir
1195
+ - mit
1196
+ - miy
1197
+ - miz
1198
+ - mjc
1199
+ - mjw
1200
+ - mk
1201
+ - mkd
1202
+ - mkl
1203
+ - mkn
1204
+ - mks
1205
+ - mkz
1206
+ - ml
1207
+ - mlg
1208
+ - mlh
1209
+ - mlp
1210
+ - mlt
1211
+ - mlu
1212
+ - mmn
1213
+ - mmo
1214
+ - mmx
1215
+ - mn
1216
+ - mna
1217
+ - mnb
1218
+ - mnf
1219
+ - mni
1220
+ - mnk
1221
+ - mns
1222
+ - mnw
1223
+ - mnx
1224
+ - mny
1225
+ - moa
1226
+ - moc
1227
+ - mog
1228
+ - moh
1229
+ - mon
1230
+ - mop
1231
+ - mor
1232
+ - mos
1233
+ - mox
1234
+ - mpg
1235
+ - mph
1236
+ - mpm
1237
+ - mpp
1238
+ - mps
1239
+ - mpt
1240
+ - mpx
1241
+ - mqb
1242
+ - mqj
1243
+ - mqy
1244
+ - mr
1245
+ - mrg
1246
+ - mri
1247
+ - mrj
1248
+ - mrq
1249
+ - mrv
1250
+ - mrw
1251
+ - ms
1252
+ - msa
1253
+ - msb
1254
+ - msc
1255
+ - mse
1256
+ - msk
1257
+ - msm
1258
+ - msy
1259
+ - mt
1260
+ - mta
1261
+ - mtg
1262
+ - mti
1263
+ - mtj
1264
+ - mto
1265
+ - mtp
1266
+ - mua
1267
+ - mug
1268
+ - muh
1269
+ - mui
1270
+ - mup
1271
+ - mur
1272
+ - mus
1273
+ - mux
1274
+ - muy
1275
+ - mva
1276
+ - mvn
1277
+ - mvp
1278
+ - mwc
1279
+ - mwf
1280
+ - mwl
1281
+ - mwm
1282
+ - mwn
1283
+ - mwp
1284
+ - mwq
1285
+ - mwv
1286
+ - mww
1287
+ - mxb
1288
+ - mxp
1289
+ - mxq
1290
+ - mxt
1291
+ - mxv
1292
+ - my
1293
+ - mya
1294
+ - myb
1295
+ - myk
1296
+ - myu
1297
+ - myv
1298
+ - myw
1299
+ - myx
1300
+ - myy
1301
+ - mza
1302
+ - mzh
1303
+ - mzk
1304
+ - mzl
1305
+ - mzm
1306
+ - mzn
1307
+ - mzw
1308
+ - mzz
1309
+ - nab
1310
+ - naf
1311
+ - nah
1312
+ - nak
1313
+ - nan
1314
+ - nap
1315
+ - naq
1316
+ - nas
1317
+ - nav
1318
+ - naw
1319
+ - nb
1320
+ - nba
1321
+ - nbc
1322
+ - nbe
1323
+ - nbl
1324
+ - nbq
1325
+ - nbu
1326
+ - nca
1327
+ - nch
1328
+ - ncj
1329
+ - ncl
1330
+ - ncq
1331
+ - nct
1332
+ - ncu
1333
+ - ncx
1334
+ - nd
1335
+ - ndc
1336
+ - nde
1337
+ - ndh
1338
+ - ndi
1339
+ - ndj
1340
+ - ndo
1341
+ - ndp
1342
+ - nds
1343
+ - ndy
1344
+ - ndz
1345
+ - ne
1346
+ - neb
1347
+ - nep
1348
+ - new
1349
+ - nfa
1350
+ - nfr
1351
+ - ng
1352
+ - ngb
1353
+ - ngc
1354
+ - ngl
1355
+ - ngp
1356
+ - ngu
1357
+ - nhd
1358
+ - nhe
1359
+ - nhg
1360
+ - nhi
1361
+ - nhk
1362
+ - nho
1363
+ - nhr
1364
+ - nhu
1365
+ - nhw
1366
+ - nhx
1367
+ - nhy
1368
+ - nia
1369
+ - nif
1370
+ - nii
1371
+ - nij
1372
+ - nim
1373
+ - nin
1374
+ - nio
1375
+ - niq
1376
+ - niu
1377
+ - niy
1378
+ - njb
1379
+ - njm
1380
+ - njn
1381
+ - njo
1382
+ - njz
1383
+ - nka
1384
+ - nkf
1385
+ - nki
1386
+ - nko
1387
+ - nl
1388
+ - nla
1389
+ - nlc
1390
+ - nld
1391
+ - nlg
1392
+ - nma
1393
+ - nmf
1394
+ - nmh
1395
+ - nmo
1396
+ - nmw
1397
+ - nmz
1398
+ - nn
1399
+ - nnb
1400
+ - nng
1401
+ - nnh
1402
+ - nnl
1403
+ - nno
1404
+ - nnp
1405
+ - nnq
1406
+ - nnw
1407
+ - no
1408
+ - noa
1409
+ - nob
1410
+ - nod
1411
+ - nog
1412
+ - non
1413
+ - nop
1414
+ - nor
1415
+ - not
1416
+ - nou
1417
+ - nov
1418
+ - nph
1419
+ - npi
1420
+ - npl
1421
+ - npo
1422
+ - npy
1423
+ - nqo
1424
+ - nr
1425
+ - nre
1426
+ - nrf
1427
+ - nri
1428
+ - nrm
1429
+ - nsa
1430
+ - nse
1431
+ - nsm
1432
+ - nsn
1433
+ - nso
1434
+ - nss
1435
+ - nst
1436
+ - nsu
1437
+ - ntp
1438
+ - ntr
1439
+ - ntu
1440
+ - nuj
1441
+ - nus
1442
+ - nuy
1443
+ - nuz
1444
+ - nv
1445
+ - nvm
1446
+ - nwb
1447
+ - nwi
1448
+ - nwx
1449
+ - nxd
1450
+ - ny
1451
+ - nya
1452
+ - nyf
1453
+ - nyk
1454
+ - nyn
1455
+ - nyo
1456
+ - nyu
1457
+ - nyy
1458
+ - nza
1459
+ - nzb
1460
+ - nzi
1461
+ - nzm
1462
+ - obo
1463
+ - oc
1464
+ - oci
1465
+ - ogo
1466
+ - oj
1467
+ - ojb
1468
+ - oji
1469
+ - ojs
1470
+ - oke
1471
+ - oku
1472
+ - okv
1473
+ - old
1474
+ - olo
1475
+ - om
1476
+ - omb
1477
+ - omw
1478
+ - ong
1479
+ - ons
1480
+ - ood
1481
+ - opm
1482
+ - or
1483
+ - ori
1484
+ - orm
1485
+ - orv
1486
+ - ory
1487
+ - os
1488
+ - oss
1489
+ - ota
1490
+ - otd
1491
+ - ote
1492
+ - otm
1493
+ - otn
1494
+ - oto
1495
+ - otq
1496
+ - ots
1497
+ - otw
1498
+ - oym
1499
+ - ozm
1500
+ - pa
1501
+ - pab
1502
+ - pad
1503
+ - pag
1504
+ - pah
1505
+ - pam
1506
+ - pan
1507
+ - pao
1508
+ - pap
1509
+ - pau
1510
+ - pbb
1511
+ - pbc
1512
+ - pbi
1513
+ - pbl
1514
+ - pbt
1515
+ - pcd
1516
+ - pck
1517
+ - pcm
1518
+ - pdc
1519
+ - pdt
1520
+ - pem
1521
+ - pes
1522
+ - pez
1523
+ - pfe
1524
+ - pfl
1525
+ - phm
1526
+ - pib
1527
+ - pid
1528
+ - pih
1529
+ - pio
1530
+ - pir
1531
+ - pis
1532
+ - pjt
1533
+ - pkb
1534
+ - pl
1535
+ - plg
1536
+ - pls
1537
+ - plt
1538
+ - plu
1539
+ - plw
1540
+ - pma
1541
+ - pmf
1542
+ - pmq
1543
+ - pms
1544
+ - pmx
1545
+ - pnb
1546
+ - pne
1547
+ - pnt
1548
+ - pny
1549
+ - poe
1550
+ - poh
1551
+ - poi
1552
+ - pol
1553
+ - pon
1554
+ - por
1555
+ - pos
1556
+ - pot
1557
+ - pov
1558
+ - poy
1559
+ - ppk
1560
+ - ppl
1561
+ - ppo
1562
+ - pps
1563
+ - prf
1564
+ - prg
1565
+ - pri
1566
+ - prk
1567
+ - prq
1568
+ - prs
1569
+ - ps
1570
+ - pse
1571
+ - pss
1572
+ - pt
1573
+ - ptp
1574
+ - ptu
1575
+ - pua
1576
+ - pui
1577
+ - pus
1578
+ - pwg
1579
+ - pwn
1580
+ - pww
1581
+ - pxm
1582
+ - qu
1583
+ - qub
1584
+ - quc
1585
+ - que
1586
+ - quf
1587
+ - qug
1588
+ - quh
1589
+ - qul
1590
+ - qup
1591
+ - qus
1592
+ - quw
1593
+ - quy
1594
+ - quz
1595
+ - qva
1596
+ - qvc
1597
+ - qve
1598
+ - qvh
1599
+ - qvi
1600
+ - qvm
1601
+ - qvn
1602
+ - qvo
1603
+ - qvs
1604
+ - qvw
1605
+ - qvz
1606
+ - qwh
1607
+ - qxh
1608
+ - qxl
1609
+ - qxn
1610
+ - qxo
1611
+ - qxr
1612
+ - qya
1613
+ - rad
1614
+ - rai
1615
+ - rap
1616
+ - rar
1617
+ - rav
1618
+ - raw
1619
+ - rcf
1620
+ - rej
1621
+ - rel
1622
+ - rgu
1623
+ - rhg
1624
+ - ria
1625
+ - rif
1626
+ - rim
1627
+ - rjs
1628
+ - rkb
1629
+ - rm
1630
+ - rmc
1631
+ - rme
1632
+ - rml
1633
+ - rmn
1634
+ - rmo
1635
+ - rmq
1636
+ - rmy
1637
+ - rn
1638
+ - rnd
1639
+ - rng
1640
+ - rnl
1641
+ - ro
1642
+ - roh
1643
+ - rom
1644
+ - ron
1645
+ - roo
1646
+ - rop
1647
+ - row
1648
+ - rro
1649
+ - rtm
1650
+ - ru
1651
+ - rub
1652
+ - rue
1653
+ - ruf
1654
+ - rug
1655
+ - run
1656
+ - rup
1657
+ - rus
1658
+ - rw
1659
+ - rwo
1660
+ - sa
1661
+ - sab
1662
+ - sag
1663
+ - sah
1664
+ - saj
1665
+ - san
1666
+ - sas
1667
+ - sat
1668
+ - say
1669
+ - sba
1670
+ - sbd
1671
+ - sbe
1672
+ - sbl
1673
+ - sbs
1674
+ - sby
1675
+ - sc
1676
+ - sck
1677
+ - scn
1678
+ - sco
1679
+ - sd
1680
+ - sda
1681
+ - sdc
1682
+ - sdh
1683
+ - sdo
1684
+ - sdq
1685
+ - se
1686
+ - seh
1687
+ - sel
1688
+ - ses
1689
+ - sey
1690
+ - sfw
1691
+ - sg
1692
+ - sgb
1693
+ - sgc
1694
+ - sgh
1695
+ - sgs
1696
+ - sgw
1697
+ - sgz
1698
+ - sh
1699
+ - shi
1700
+ - shk
1701
+ - shn
1702
+ - shp
1703
+ - shr
1704
+ - shs
1705
+ - shu
1706
+ - shy
1707
+ - si
1708
+ - sid
1709
+ - sig
1710
+ - sil
1711
+ - sim
1712
+ - sin
1713
+ - sja
1714
+ - sjn
1715
+ - sjo
1716
+ - sju
1717
+ - sk
1718
+ - skg
1719
+ - skr
1720
+ - sl
1721
+ - sld
1722
+ - slk
1723
+ - sll
1724
+ - slv
1725
+ - sm
1726
+ - sma
1727
+ - sme
1728
+ - smj
1729
+ - smk
1730
+ - sml
1731
+ - smn
1732
+ - smo
1733
+ - sms
1734
+ - smt
1735
+ - sn
1736
+ - sna
1737
+ - snc
1738
+ - snd
1739
+ - snf
1740
+ - snn
1741
+ - snp
1742
+ - snw
1743
+ - sny
1744
+ - so
1745
+ - soe
1746
+ - som
1747
+ - sop
1748
+ - soq
1749
+ - sot
1750
+ - soy
1751
+ - spa
1752
+ - spl
1753
+ - spm
1754
+ - spp
1755
+ - sps
1756
+ - spy
1757
+ - sq
1758
+ - sqi
1759
+ - sr
1760
+ - srd
1761
+ - sri
1762
+ - srm
1763
+ - srn
1764
+ - srp
1765
+ - srq
1766
+ - srr
1767
+ - ss
1768
+ - ssd
1769
+ - ssg
1770
+ - ssw
1771
+ - ssx
1772
+ - st
1773
+ - stn
1774
+ - stp
1775
+ - stq
1776
+ - su
1777
+ - sua
1778
+ - suc
1779
+ - sue
1780
+ - suk
1781
+ - sun
1782
+ - sur
1783
+ - sus
1784
+ - sux
1785
+ - suz
1786
+ - sv
1787
+ - sw
1788
+ - swa
1789
+ - swb
1790
+ - swc
1791
+ - swe
1792
+ - swg
1793
+ - swh
1794
+ - swk
1795
+ - swp
1796
+ - sxb
1797
+ - sxn
1798
+ - syb
1799
+ - syc
1800
+ - syl
1801
+ - szb
1802
+ - szl
1803
+ - szy
1804
+ - ta
1805
+ - tab
1806
+ - tac
1807
+ - tah
1808
+ - taj
1809
+ - tam
1810
+ - tap
1811
+ - taq
1812
+ - tar
1813
+ - tat
1814
+ - tav
1815
+ - taw
1816
+ - tay
1817
+ - tbc
1818
+ - tbg
1819
+ - tbk
1820
+ - tbl
1821
+ - tbo
1822
+ - tbw
1823
+ - tby
1824
+ - tbz
1825
+ - tca
1826
+ - tcc
1827
+ - tcf
1828
+ - tcs
1829
+ - tcy
1830
+ - tcz
1831
+ - tdt
1832
+ - tdx
1833
+ - te
1834
+ - ted
1835
+ - tee
1836
+ - tel
1837
+ - tem
1838
+ - teo
1839
+ - ter
1840
+ - tet
1841
+ - tew
1842
+ - tfr
1843
+ - tg
1844
+ - tgk
1845
+ - tgl
1846
+ - tgo
1847
+ - tgp
1848
+ - th
1849
+ - tha
1850
+ - thk
1851
+ - thl
1852
+ - thv
1853
+ - ti
1854
+ - tif
1855
+ - tig
1856
+ - tih
1857
+ - tik
1858
+ - tim
1859
+ - tir
1860
+ - tiv
1861
+ - tiy
1862
+ - tk
1863
+ - tke
1864
+ - tkl
1865
+ - tkr
1866
+ - tku
1867
+ - tl
1868
+ - tlb
1869
+ - tlf
1870
+ - tlh
1871
+ - tlj
1872
+ - tll
1873
+ - tly
1874
+ - tmc
1875
+ - tmd
1876
+ - tmr
1877
+ - tn
1878
+ - tna
1879
+ - tnc
1880
+ - tnk
1881
+ - tnn
1882
+ - tnp
1883
+ - tnr
1884
+ - to
1885
+ - tob
1886
+ - toc
1887
+ - tod
1888
+ - tog
1889
+ - toh
1890
+ - toi
1891
+ - toj
1892
+ - tok
1893
+ - ton
1894
+ - too
1895
+ - top
1896
+ - tos
1897
+ - tpa
1898
+ - tpi
1899
+ - tpm
1900
+ - tpn
1901
+ - tpp
1902
+ - tpt
1903
+ - tpw
1904
+ - tpz
1905
+ - tqb
1906
+ - tqo
1907
+ - tr
1908
+ - trc
1909
+ - trn
1910
+ - tro
1911
+ - trp
1912
+ - trq
1913
+ - trs
1914
+ - trv
1915
+ - ts
1916
+ - tsc
1917
+ - tsg
1918
+ - tsn
1919
+ - tso
1920
+ - tsw
1921
+ - tsz
1922
+ - tt
1923
+ - ttc
1924
+ - tte
1925
+ - ttj
1926
+ - ttq
1927
+ - tts
1928
+ - tuc
1929
+ - tue
1930
+ - tuf
1931
+ - tui
1932
+ - tuk
1933
+ - tul
1934
+ - tum
1935
+ - tuo
1936
+ - tur
1937
+ - tuv
1938
+ - tvk
1939
+ - tvl
1940
+ - tw
1941
+ - twb
1942
+ - twi
1943
+ - twu
1944
+ - twx
1945
+ - txq
1946
+ - txu
1947
+ - ty
1948
+ - tyv
1949
+ - tzh
1950
+ - tzj
1951
+ - tzl
1952
+ - tzm
1953
+ - tzo
1954
+ - ubr
1955
+ - ubu
1956
+ - udm
1957
+ - udu
1958
+ - ug
1959
+ - uig
1960
+ - uk
1961
+ - ukr
1962
+ - umb
1963
+ - und
1964
+ - upv
1965
+ - ur
1966
+ - ura
1967
+ - urb
1968
+ - urd
1969
+ - urh
1970
+ - uri
1971
+ - urk
1972
+ - urt
1973
+ - urw
1974
+ - ury
1975
+ - usa
1976
+ - usp
1977
+ - uth
1978
+ - uvh
1979
+ - uvl
1980
+ - uz
1981
+ - uzb
1982
+ - uzn
1983
+ - uzs
1984
+ - vag
1985
+ - vap
1986
+ - var
1987
+ - ve
1988
+ - vec
1989
+ - ven
1990
+ - vep
1991
+ - vgt
1992
+ - vi
1993
+ - vid
1994
+ - vie
1995
+ - viv
1996
+ - vls
1997
+ - vmk
1998
+ - vmw
1999
+ - vmy
2000
+ - vo
2001
+ - vol
2002
+ - vot
2003
+ - vro
2004
+ - vun
2005
+ - vut
2006
+ - wa
2007
+ - waj
2008
+ - wal
2009
+ - wap
2010
+ - war
2011
+ - wat
2012
+ - way
2013
+ - wba
2014
+ - wbm
2015
+ - wbp
2016
+ - wca
2017
+ - wed
2018
+ - wer
2019
+ - wes
2020
+ - wew
2021
+ - whg
2022
+ - whk
2023
+ - wib
2024
+ - wim
2025
+ - wiu
2026
+ - wln
2027
+ - wls
2028
+ - wlv
2029
+ - wlx
2030
+ - wmt
2031
+ - wmw
2032
+ - wnc
2033
+ - wnu
2034
+ - wo
2035
+ - wob
2036
+ - wol
2037
+ - wos
2038
+ - wrk
2039
+ - wrs
2040
+ - wsg
2041
+ - wsk
2042
+ - wuu
2043
+ - wuv
2044
+ - wwa
2045
+ - xal
2046
+ - xav
2047
+ - xbi
2048
+ - xbr
2049
+ - xed
2050
+ - xh
2051
+ - xho
2052
+ - xla
2053
+ - xmf
2054
+ - xmm
2055
+ - xmv
2056
+ - xnn
2057
+ - xog
2058
+ - xon
2059
+ - xpe
2060
+ - xrb
2061
+ - xsb
2062
+ - xsi
2063
+ - xsm
2064
+ - xsr
2065
+ - xsu
2066
+ - xtd
2067
+ - xtm
2068
+ - xtn
2069
+ - xum
2070
+ - xuo
2071
+ - yaa
2072
+ - yad
2073
+ - yal
2074
+ - yam
2075
+ - yan
2076
+ - yao
2077
+ - yap
2078
+ - yaq
2079
+ - yas
2080
+ - yat
2081
+ - yaz
2082
+ - ybb
2083
+ - yby
2084
+ - ycn
2085
+ - ydd
2086
+ - yi
2087
+ - yid
2088
+ - yim
2089
+ - yka
2090
+ - yle
2091
+ - yli
2092
+ - yml
2093
+ - yo
2094
+ - yom
2095
+ - yon
2096
+ - yor
2097
+ - yrb
2098
+ - yre
2099
+ - yrk
2100
+ - yrl
2101
+ - yss
2102
+ - yua
2103
+ - yue
2104
+ - yuj
2105
+ - yup
2106
+ - yut
2107
+ - yuw
2108
+ - yuz
2109
+ - yva
2110
+ - zaa
2111
+ - zab
2112
+ - zac
2113
+ - zad
2114
+ - zae
2115
+ - zai
2116
+ - zam
2117
+ - zao
2118
+ - zar
2119
+ - zas
2120
+ - zat
2121
+ - zav
2122
+ - zaw
2123
+ - zca
2124
+ - zdj
2125
+ - zea
2126
+ - zgh
2127
+ - zh
2128
+ - zho
2129
+ - zia
2130
+ - ziw
2131
+ - zlm
2132
+ - zne
2133
+ - zoc
2134
+ - zom
2135
+ - zos
2136
+ - zpa
2137
+ - zpc
2138
+ - zpd
2139
+ - zpf
2140
+ - zpg
2141
+ - zpi
2142
+ - zpj
2143
+ - zpl
2144
+ - zpm
2145
+ - zpo
2146
+ - zpq
2147
+ - zpt
2148
+ - zpu
2149
+ - zpv
2150
+ - zpz
2151
+ - zsm
2152
+ - zsr
2153
+ - ztq
2154
+ - zty
2155
+ - zu
2156
+ - zul
2157
+ - zxx
2158
+ - zyb
2159
+ - zyp
2160
+ - zza
2161
+
2162
+ tags:
2163
+ - text-classification
2164
+ - language-identification
2165
+ library_name: fasttext
2166
+ datasets:
2167
+ - cis-lmu/GlotSparse
2168
+ - cis-lmu/GlotStoryBook
2169
+ - cis-lmu/glotlid-corpus
2170
+ metrics:
2171
+ - f1
2172
+ ---
2173
+
2174
+ # GlotLID
2175
+
2176
+ [![GlotLID](https://img.shields.io/badge/🤗-Open%20In%20Spaces-blue.svg)](https://huggingface.co/spaces/cis-lmu/glotlid-space)
2177
+
2178
+ ## Description
2179
+
2180
+ **GlotLID** is a Fasttext language identification (LID) model that supports more than **2000 labels**.
2181
+
2182
+
2183
+ **Latest:** GlotLID is now updated to **V3**. V3 supports **2102 labels** (three-letter ISO codes with script). For more details on the supported languages and performance, as well as significant changes from previous versions, please refer to [https://github.com/cisnlp/GlotLID/blob/main/languages-v3.md](https://github.com/cisnlp/GlotLID/blob/main/languages-v3.md).
2184
+
2185
+
2186
+ - **Demo:** [huggingface](https://huggingface.co/spaces/cis-lmu/glotlid-space)
2187
+ - **Repository:** [github](https://github.com/cisnlp/GlotLID)
2188
+ - **Paper:** [paper](https://arxiv.org/abs/2310.16248) (EMNLP 2023)
2189
+ - **Point of Contact:** amir@cis.lmu.de
2190
+
2191
+
2192
+
2193
+ ### How to use
2194
+
2195
+ Here is how to use this model to detect the language of a given text:
2196
+
2197
+ ```python
2198
+ >>> import fasttext
2199
+ >>> from huggingface_hub import hf_hub_download
2200
+
2201
+ # model.bin is the latest version always
2202
+ >>> model_path = hf_hub_download(repo_id="cis-lmu/glotlid", filename="model.bin")
2203
+ >>> model = fasttext.load_model(model_path)
2204
+ >>> model.predict("Hello, world!")
2205
+ ```
2206
+
2207
+ If you are not a fan of huggingface_hub, then download the model directyly:
2208
+
2209
+ ```python
2210
+ >>> ! wget https://huggingface.co/cis-lmu/glotlid/resolve/main/model.bin
2211
+ ```
2212
+
2213
+ ```python
2214
+ >>> import fasttext
2215
+
2216
+ >>> model = fasttext.load_model("/path/to/model.bin")
2217
+ >>> model.predict("Hello, world!")
2218
+ ```
2219
+
2220
+
2221
+ ## License
2222
+
2223
+ The model is distributed under the Apache License, Version 2.0 plus notices (see LICENSE file for full terms).
2224
+
2225
+ ## Version
2226
+
2227
+ We always maintain the previous version of GlotLID in our repository.
2228
+
2229
+ To access a specific version, simply append the version number to the `filename`.
2230
+
2231
+ - For v1: `model_v1.bin` (introduced in the GlotLID [paper](https://arxiv.org/abs/2310.16248) and used in all experiments).
2232
+ - For v2: `model_v2.bin` (an edited version of v1, featuring more languages, and cleaned from noisy corpora based on the analysis of v1).
2233
+ - For v3: `model_v3.bin` (an edited version of v2, featuring more languages, excluding macro languages, further cleaned from noisy corpora and incorrect metadata labels based on the analysis of v2, supporting "zxx" and "und" series labels)
2234
+
2235
+
2236
+ `model.bin` always refers to the latest version (v3).
2237
+
2238
+
2239
+ ## References
2240
+
2241
+ If you use this model, please cite the following paper:
2242
+
2243
+ ```
2244
+ @inproceedings{
2245
+ kargaran2023glotlid,
2246
+ title={{GlotLID}: Language Identification for Low-Resource Languages},
2247
+ author={Kargaran, Amir Hossein and Imani, Ayyoob and Yvon, Fran{\c{c}}ois and Sch{\"u}tze, Hinrich},
2248
+ booktitle={The 2023 Conference on Empirical Methods in Natural Language Processing},
2249
+ year={2023},
2250
+ url={https://openreview.net/forum?id=dl4e3EBz5j}
2251
+ }
2252
+
2253
+ ```
model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a818b6bd42a628ab47d3dfc1578c7ea615c45381f3494c42535e31e8c4cafc9e
3
+ size 1687094687
model_v1.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8624f1d95916209e6f0973757f8ab704570bd7f589d9ec454d76187bb6035c41
3
+ size 1382536178
model_v2.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7b186ecd9c78dca5b8ee5bcd29f35e0cfbd6258b1512484efddd0a6329c6be38
3
+ size 1697042258
model_v3.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a818b6bd42a628ab47d3dfc1578c7ea615c45381f3494c42535e31e8c4cafc9e
3
+ size 1687094687