Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-20260402-l40sx1-s09
Files
xet
nielsr/arxiv-chandra-ocr-full-20260402-l40sx1-s09
/
data
78.2 MB
181 files
Updated 13 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
330 kB
xet
15 days ago
67039e75
part-00001.jsonl.gz
371 kB
xet
15 days ago
0b41ae15
part-00002.jsonl.gz
516 kB
xet
14 days ago
b2d99aa2
part-00003.jsonl.gz
336 kB
xet
14 days ago
ebd91ddf
part-00004.jsonl.gz
385 kB
xet
14 days ago
32a686c4
part-00005.jsonl.gz
406 kB
xet
14 days ago
7756a463
part-00006.jsonl.gz
323 kB
xet
14 days ago
0c192c13
part-00007.jsonl.gz
302 kB
xet
14 days ago
2bb8c3b3
part-00008.jsonl.gz
502 kB
xet
14 days ago
825fd31e
part-00009.jsonl.gz
449 kB
xet
14 days ago
335fe9f7
part-00010.jsonl.gz
382 kB
xet
14 days ago
437b9c6d
part-00011.jsonl.gz
370 kB
xet
14 days ago
0b12ea3f
part-00012.jsonl.gz
427 kB
xet
14 days ago
f2827ba3
part-00013.jsonl.gz
525 kB
xet
14 days ago
577d6791
part-00014.jsonl.gz
351 kB
xet
14 days ago
2a2a7b45
part-00015.jsonl.gz
518 kB
xet
14 days ago
03e6c2d6
part-00016.jsonl.gz
380 kB
xet
14 days ago
c3ab05b3
part-00017.jsonl.gz
427 kB
xet
14 days ago
d3b6e42d
part-00018.jsonl.gz
526 kB
xet
14 days ago
22358d83
part-00019.jsonl.gz
416 kB
xet
14 days ago
693e6535
part-00020.jsonl.gz
486 kB
xet
14 days ago
cb37b4e4
part-00021.jsonl.gz
364 kB
xet
14 days ago
c119561d
part-00022.jsonl.gz
330 kB
xet
14 days ago
3b6aff90
part-00023.jsonl.gz
419 kB
xet
14 days ago
cac9d46c
part-00024.jsonl.gz
466 kB
xet
14 days ago
6dbad7bc
part-00025.jsonl.gz
398 kB
xet
14 days ago
b8ba3419
part-00026.jsonl.gz
330 kB
xet
14 days ago
c97c20f4
part-00027.jsonl.gz
439 kB
xet
14 days ago
e7b2ddcc
part-00028.jsonl.gz
458 kB
xet
14 days ago
0bee629a
part-00029.jsonl.gz
389 kB
xet
14 days ago
36f0faf9
part-00030.jsonl.gz
406 kB
xet
14 days ago
44e87aa0
part-00031.jsonl.gz
301 kB
xet
14 days ago
a2347af0
part-00032.jsonl.gz
411 kB
xet
14 days ago
248e383c
part-00033.jsonl.gz
323 kB
xet
14 days ago
b76de23c
part-00034.jsonl.gz
454 kB
xet
14 days ago
6171ce62
part-00035.jsonl.gz
440 kB
xet
14 days ago
91bacf54
part-00036.jsonl.gz
500 kB
xet
14 days ago
5a0e0a5f
part-00037.jsonl.gz
401 kB
xet
14 days ago
9b8b89fa
part-00038.jsonl.gz
530 kB
xet
14 days ago
712392aa
part-00039.jsonl.gz
387 kB
xet
14 days ago
c19cb8e4
part-00040.jsonl.gz
376 kB
xet
14 days ago
0f0492df
part-00041.jsonl.gz
478 kB
xet
14 days ago
56066a7f
part-00042.jsonl.gz
391 kB
xet
14 days ago
25a98f07
part-00043.jsonl.gz
385 kB
xet
14 days ago
a2988f6e
part-00044.jsonl.gz
427 kB
xet
14 days ago
f0c284ef
part-00045.jsonl.gz
377 kB
xet
14 days ago
ed581410
part-00046.jsonl.gz
458 kB
xet
14 days ago
df358057
part-00047.jsonl.gz
411 kB
xet
14 days ago
e79cd565
part-00048.jsonl.gz
353 kB
xet
14 days ago
59cc881e
part-00049.jsonl.gz
428 kB
xet
14 days ago
749ba459
part-00050.jsonl.gz
412 kB
xet
14 days ago
e1327537
part-00051.jsonl.gz
410 kB
xet
14 days ago
fc45e002
part-00052.jsonl.gz
344 kB
xet
14 days ago
c9b448ab
part-00053.jsonl.gz
379 kB
xet
14 days ago
0a444446
part-00054.jsonl.gz
436 kB
xet
14 days ago
c8519402
part-00055.jsonl.gz
379 kB
xet
14 days ago
299d119f
part-00056.jsonl.gz
415 kB
xet
14 days ago
d474ce45
part-00057.jsonl.gz
426 kB
xet
14 days ago
7a115f2d
part-00058.jsonl.gz
465 kB
xet
14 days ago
2691486d
part-00059.jsonl.gz
414 kB
xet
14 days ago
51390a60
part-00060.jsonl.gz
463 kB
xet
14 days ago
d7a4844e
part-00061.jsonl.gz
395 kB
xet
14 days ago
c8f144e4
part-00062.jsonl.gz
509 kB
xet
14 days ago
25d4c41f
part-00063.jsonl.gz
521 kB
xet
14 days ago
68fac75a
part-00064.jsonl.gz
433 kB
xet
14 days ago
244415a0
part-00065.jsonl.gz
429 kB
xet
14 days ago
49a0a5df
part-00066.jsonl.gz
385 kB
xet
14 days ago
71c85cb4
part-00067.jsonl.gz
454 kB
xet
14 days ago
8bc3f185
part-00068.jsonl.gz
452 kB
xet
14 days ago
366ada77
part-00069.jsonl.gz
445 kB
xet
14 days ago
af2df658
part-00070.jsonl.gz
430 kB
xet
14 days ago
46be24c4
part-00071.jsonl.gz
566 kB
xet
14 days ago
36e3bdf5
part-00072.jsonl.gz
454 kB
xet
14 days ago
988b7e11
part-00073.jsonl.gz
409 kB
xet
14 days ago
aaaf0e08
part-00074.jsonl.gz
427 kB
xet
14 days ago
cf97469c
part-00075.jsonl.gz
337 kB
xet
14 days ago
0b0b4442
part-00076.jsonl.gz
542 kB
xet
14 days ago
0067fb97
part-00077.jsonl.gz
358 kB
xet
14 days ago
bf8eb41e
part-00078.jsonl.gz
650 kB
xet
14 days ago
b78c5767
part-00079.jsonl.gz
465 kB
xet
14 days ago
61f2b8f0
part-00080.jsonl.gz
488 kB
xet
14 days ago
5a3e312b
part-00081.jsonl.gz
384 kB
xet
14 days ago
566883d6
part-00082.jsonl.gz
442 kB
xet
14 days ago
67e36c6f
part-00083.jsonl.gz
498 kB
xet
14 days ago
beb2118e
part-00084.jsonl.gz
365 kB
xet
14 days ago
9a549c4c
part-00085.jsonl.gz
516 kB
xet
14 days ago
cbddf305
part-00086.jsonl.gz
526 kB
xet
14 days ago
cc342da5
part-00087.jsonl.gz
455 kB
xet
14 days ago
2890b954
part-00088.jsonl.gz
428 kB
xet
14 days ago
ecc3402b
part-00089.jsonl.gz
471 kB
xet
14 days ago
1909c32b
part-00090.jsonl.gz
454 kB
xet
14 days ago
581b7500
part-00091.jsonl.gz
509 kB
xet
14 days ago
b485fbce
part-00092.jsonl.gz
390 kB
xet
14 days ago
b34a69bb
part-00093.jsonl.gz
423 kB
xet
14 days ago
bdd1c853
part-00094.jsonl.gz
523 kB
xet
14 days ago
0c4d3d57
part-00095.jsonl.gz
474 kB
xet
14 days ago
9d9203ee
part-00096.jsonl.gz
467 kB
xet
13 days ago
313f0447
part-00097.jsonl.gz
414 kB
xet
13 days ago
2dbcaacf
part-00098.jsonl.gz
452 kB
xet
13 days ago
8fd395fa
part-00099.jsonl.gz
398 kB
xet
13 days ago
2821e6ba
Load more
Use this bucket
Total size
78.2 MB
Files
181
Last updated
Apr 4
Pre-warmed CDN
US
EU
US
EU
Contributors