Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-20260402-l40sx1-s03
Files
xet
nielsr/arxiv-chandra-ocr-full-20260402-l40sx1-s03
/
data
59.9 MB
181 files
Updated 15 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
409 kB
xet
17 days ago
3ead4ce9
part-00001.jsonl.gz
278 kB
xet
17 days ago
b1aa4501
part-00002.jsonl.gz
268 kB
xet
17 days ago
5202b50b
part-00003.jsonl.gz
393 kB
xet
17 days ago
dcc31121
part-00004.jsonl.gz
309 kB
xet
17 days ago
3786d294
part-00005.jsonl.gz
336 kB
xet
17 days ago
f465f00d
part-00006.jsonl.gz
277 kB
xet
17 days ago
fc926ffa
part-00007.jsonl.gz
323 kB
xet
17 days ago
be7a58b1
part-00008.jsonl.gz
347 kB
xet
17 days ago
62252e6d
part-00009.jsonl.gz
289 kB
xet
17 days ago
637b2c5c
part-00010.jsonl.gz
435 kB
xet
17 days ago
d0a379d3
part-00011.jsonl.gz
341 kB
xet
16 days ago
562d2946
part-00012.jsonl.gz
375 kB
xet
16 days ago
c1db8b21
part-00013.jsonl.gz
298 kB
xet
16 days ago
62b3d87c
part-00014.jsonl.gz
357 kB
xet
16 days ago
ae81867e
part-00015.jsonl.gz
384 kB
xet
16 days ago
5c91662c
part-00016.jsonl.gz
285 kB
xet
16 days ago
688cc482
part-00017.jsonl.gz
370 kB
xet
16 days ago
f8d95645
part-00018.jsonl.gz
333 kB
xet
16 days ago
6874fa9d
part-00019.jsonl.gz
203 kB
xet
16 days ago
5f461691
part-00020.jsonl.gz
335 kB
xet
16 days ago
cd668cc1
part-00021.jsonl.gz
359 kB
xet
16 days ago
9db2d717
part-00022.jsonl.gz
307 kB
xet
16 days ago
3a534f2b
part-00023.jsonl.gz
382 kB
xet
16 days ago
7e384a41
part-00024.jsonl.gz
384 kB
xet
16 days ago
9b4743dc
part-00025.jsonl.gz
338 kB
xet
16 days ago
becd8933
part-00026.jsonl.gz
316 kB
xet
16 days ago
8d480b8e
part-00027.jsonl.gz
340 kB
xet
16 days ago
a6348bdf
part-00028.jsonl.gz
422 kB
xet
16 days ago
ad0f80f3
part-00029.jsonl.gz
274 kB
xet
16 days ago
e3ae5853
part-00030.jsonl.gz
284 kB
xet
16 days ago
99a3d945
part-00031.jsonl.gz
360 kB
xet
16 days ago
6365a5d4
part-00032.jsonl.gz
301 kB
xet
16 days ago
cfbd2417
part-00033.jsonl.gz
268 kB
xet
16 days ago
861a1f32
part-00034.jsonl.gz
328 kB
xet
16 days ago
6e6ad3e2
part-00035.jsonl.gz
318 kB
xet
16 days ago
ff4760a7
part-00036.jsonl.gz
372 kB
xet
16 days ago
b4c9d0fa
part-00037.jsonl.gz
279 kB
xet
16 days ago
964e2108
part-00038.jsonl.gz
366 kB
xet
16 days ago
b5956225
part-00039.jsonl.gz
278 kB
xet
16 days ago
b1a23efe
part-00040.jsonl.gz
318 kB
xet
16 days ago
ba2a2da4
part-00041.jsonl.gz
298 kB
xet
16 days ago
3b03c135
part-00042.jsonl.gz
302 kB
xet
16 days ago
0a8e5c3d
part-00043.jsonl.gz
271 kB
xet
16 days ago
1afffedb
part-00044.jsonl.gz
289 kB
xet
16 days ago
19000585
part-00045.jsonl.gz
421 kB
xet
16 days ago
752d63d9
part-00046.jsonl.gz
176 kB
xet
16 days ago
9e46f611
part-00047.jsonl.gz
320 kB
xet
16 days ago
f36a3662
part-00048.jsonl.gz
366 kB
xet
16 days ago
fda9b086
part-00049.jsonl.gz
379 kB
xet
16 days ago
e5a9d195
part-00050.jsonl.gz
428 kB
xet
16 days ago
afa37695
part-00051.jsonl.gz
338 kB
xet
16 days ago
45924f0c
part-00052.jsonl.gz
276 kB
xet
16 days ago
fe8c47c1
part-00053.jsonl.gz
321 kB
xet
16 days ago
8d63be62
part-00054.jsonl.gz
285 kB
xet
16 days ago
39a39939
part-00055.jsonl.gz
338 kB
xet
16 days ago
37de51b2
part-00056.jsonl.gz
291 kB
xet
16 days ago
079b0078
part-00057.jsonl.gz
279 kB
xet
16 days ago
b6a6938a
part-00058.jsonl.gz
335 kB
xet
16 days ago
65043e62
part-00059.jsonl.gz
368 kB
xet
16 days ago
f9af5cf1
part-00060.jsonl.gz
353 kB
xet
16 days ago
84a92173
part-00061.jsonl.gz
236 kB
xet
16 days ago
ea8ffd02
part-00062.jsonl.gz
331 kB
xet
16 days ago
43b2f7b2
part-00063.jsonl.gz
299 kB
xet
16 days ago
1727f6bd
part-00064.jsonl.gz
345 kB
xet
16 days ago
64b3cf9a
part-00065.jsonl.gz
395 kB
xet
16 days ago
7d9e9ece
part-00066.jsonl.gz
293 kB
xet
16 days ago
7936411e
part-00067.jsonl.gz
344 kB
xet
16 days ago
b3e471a2
part-00068.jsonl.gz
299 kB
xet
16 days ago
b0bfe921
part-00069.jsonl.gz
288 kB
xet
16 days ago
39ca3e9b
part-00070.jsonl.gz
296 kB
xet
16 days ago
c9de0ccd
part-00071.jsonl.gz
238 kB
xet
16 days ago
4fd0fe76
part-00072.jsonl.gz
356 kB
xet
16 days ago
b684a6d8
part-00073.jsonl.gz
344 kB
xet
16 days ago
4cea7e4a
part-00074.jsonl.gz
372 kB
xet
16 days ago
afa71b25
part-00075.jsonl.gz
294 kB
xet
16 days ago
d3c81540
part-00076.jsonl.gz
390 kB
xet
16 days ago
5647d457
part-00077.jsonl.gz
419 kB
xet
16 days ago
53eaeb19
part-00078.jsonl.gz
348 kB
xet
16 days ago
54ec2450
part-00079.jsonl.gz
271 kB
xet
16 days ago
ad947261
part-00080.jsonl.gz
342 kB
xet
16 days ago
8360289d
part-00081.jsonl.gz
173 kB
xet
16 days ago
e19962ae
part-00082.jsonl.gz
357 kB
xet
16 days ago
1d37611a
part-00083.jsonl.gz
363 kB
xet
16 days ago
57838f59
part-00084.jsonl.gz
389 kB
xet
16 days ago
45c08f7d
part-00085.jsonl.gz
317 kB
xet
16 days ago
592f7e62
part-00086.jsonl.gz
348 kB
xet
16 days ago
e8179b9b
part-00087.jsonl.gz
298 kB
xet
16 days ago
ec979947
part-00088.jsonl.gz
323 kB
xet
16 days ago
4708718e
part-00089.jsonl.gz
294 kB
xet
16 days ago
378fadbf
part-00090.jsonl.gz
225 kB
xet
16 days ago
269f13ec
part-00091.jsonl.gz
235 kB
xet
16 days ago
ac7b791f
part-00092.jsonl.gz
371 kB
xet
16 days ago
076a174d
part-00093.jsonl.gz
342 kB
xet
16 days ago
83b9b6de
part-00094.jsonl.gz
233 kB
xet
16 days ago
f2bb2337
part-00095.jsonl.gz
269 kB
xet
16 days ago
69a751c2
part-00096.jsonl.gz
410 kB
xet
16 days ago
41ec0885
part-00097.jsonl.gz
369 kB
xet
16 days ago
fa2140ed
part-00098.jsonl.gz
271 kB
xet
16 days ago
2930a1f0
part-00099.jsonl.gz
290 kB
xet
16 days ago
4163bed6
Load more
Use this bucket
Total size
59.9 MB
Files
181
Last updated
Apr 3
Pre-warmed CDN
US
EU
US
EU
Contributors