Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-20260402-l40sx1-s07
Files
xet
nielsr/arxiv-chandra-ocr-full-20260402-l40sx1-s07
/
data
72.6 MB
181 files
Updated 17 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
476 kB
xet
19 days ago
55cb24f8
part-00001.jsonl.gz
418 kB
xet
19 days ago
c4bf14cd
part-00002.jsonl.gz
435 kB
xet
18 days ago
7a585706
part-00003.jsonl.gz
359 kB
xet
18 days ago
35941f96
part-00004.jsonl.gz
408 kB
xet
18 days ago
848edfb6
part-00005.jsonl.gz
502 kB
xet
18 days ago
9f4e2979
part-00006.jsonl.gz
354 kB
xet
18 days ago
96bdfb16
part-00007.jsonl.gz
406 kB
xet
18 days ago
ac4cdadc
part-00008.jsonl.gz
390 kB
xet
18 days ago
407c9a49
part-00009.jsonl.gz
397 kB
xet
18 days ago
bb94148b
part-00010.jsonl.gz
404 kB
xet
18 days ago
bfdce8a9
part-00011.jsonl.gz
308 kB
xet
18 days ago
ea190383
part-00012.jsonl.gz
375 kB
xet
18 days ago
b4d4a32d
part-00013.jsonl.gz
405 kB
xet
18 days ago
6ff084f9
part-00014.jsonl.gz
313 kB
xet
18 days ago
0250b19c
part-00015.jsonl.gz
361 kB
xet
18 days ago
a14a6fdf
part-00016.jsonl.gz
410 kB
xet
18 days ago
e422e4a4
part-00017.jsonl.gz
308 kB
xet
18 days ago
aa102f88
part-00018.jsonl.gz
361 kB
xet
18 days ago
e9984be3
part-00019.jsonl.gz
372 kB
xet
18 days ago
0f368600
part-00020.jsonl.gz
443 kB
xet
18 days ago
6ca4d095
part-00021.jsonl.gz
454 kB
xet
18 days ago
26cc6540
part-00022.jsonl.gz
384 kB
xet
18 days ago
7c27fb3f
part-00023.jsonl.gz
423 kB
xet
18 days ago
337bad06
part-00024.jsonl.gz
465 kB
xet
18 days ago
e1b2d613
part-00025.jsonl.gz
433 kB
xet
18 days ago
c0a2c625
part-00026.jsonl.gz
328 kB
xet
18 days ago
43d6b024
part-00027.jsonl.gz
430 kB
xet
18 days ago
6846e99b
part-00028.jsonl.gz
369 kB
xet
18 days ago
43ecf365
part-00029.jsonl.gz
394 kB
xet
18 days ago
48c2e2c6
part-00030.jsonl.gz
395 kB
xet
18 days ago
cced8e18
part-00031.jsonl.gz
416 kB
xet
18 days ago
647953cf
part-00032.jsonl.gz
331 kB
xet
18 days ago
2b5b098e
part-00033.jsonl.gz
401 kB
xet
18 days ago
8c7dba2c
part-00034.jsonl.gz
527 kB
xet
18 days ago
0e7572f6
part-00035.jsonl.gz
466 kB
xet
18 days ago
92e79e16
part-00036.jsonl.gz
430 kB
xet
18 days ago
8b962c16
part-00037.jsonl.gz
397 kB
xet
18 days ago
a0016847
part-00038.jsonl.gz
408 kB
xet
18 days ago
723b1761
part-00039.jsonl.gz
397 kB
xet
18 days ago
2b9fcbe7
part-00040.jsonl.gz
416 kB
xet
18 days ago
b7288136
part-00041.jsonl.gz
489 kB
xet
18 days ago
3081150b
part-00042.jsonl.gz
365 kB
xet
18 days ago
aa9e08c9
part-00043.jsonl.gz
456 kB
xet
18 days ago
f2e5b4d5
part-00044.jsonl.gz
416 kB
xet
18 days ago
897b2b30
part-00045.jsonl.gz
432 kB
xet
18 days ago
ceef261f
part-00046.jsonl.gz
404 kB
xet
18 days ago
42fedb65
part-00047.jsonl.gz
415 kB
xet
18 days ago
70e6c1d1
part-00048.jsonl.gz
397 kB
xet
18 days ago
ab575996
part-00049.jsonl.gz
352 kB
xet
18 days ago
0dd582b3
part-00050.jsonl.gz
437 kB
xet
18 days ago
f5715cc3
part-00051.jsonl.gz
364 kB
xet
18 days ago
dddb6023
part-00052.jsonl.gz
453 kB
xet
18 days ago
fb644231
part-00053.jsonl.gz
449 kB
xet
18 days ago
f5d6482e
part-00054.jsonl.gz
427 kB
xet
18 days ago
e95fcd72
part-00055.jsonl.gz
459 kB
xet
18 days ago
7d4a88f2
part-00056.jsonl.gz
388 kB
xet
18 days ago
6960896d
part-00057.jsonl.gz
419 kB
xet
18 days ago
b7ba2790
part-00058.jsonl.gz
338 kB
xet
18 days ago
a0de5049
part-00059.jsonl.gz
378 kB
xet
18 days ago
688e5534
part-00060.jsonl.gz
461 kB
xet
18 days ago
39753638
part-00061.jsonl.gz
406 kB
xet
18 days ago
94019732
part-00062.jsonl.gz
267 kB
xet
18 days ago
5dca1a31
part-00063.jsonl.gz
339 kB
xet
18 days ago
e55412f8
part-00064.jsonl.gz
372 kB
xet
18 days ago
b2a69044
part-00065.jsonl.gz
550 kB
xet
18 days ago
eec83c1a
part-00066.jsonl.gz
377 kB
xet
18 days ago
bbb3674d
part-00067.jsonl.gz
460 kB
xet
18 days ago
356d7840
part-00068.jsonl.gz
443 kB
xet
18 days ago
7d8945ec
part-00069.jsonl.gz
392 kB
xet
18 days ago
e2d5cc1e
part-00070.jsonl.gz
434 kB
xet
18 days ago
c10079de
part-00071.jsonl.gz
401 kB
xet
18 days ago
a840c73e
part-00072.jsonl.gz
396 kB
xet
18 days ago
f492fe2e
part-00073.jsonl.gz
373 kB
xet
18 days ago
04d15bbc
part-00074.jsonl.gz
407 kB
xet
18 days ago
1546547e
part-00075.jsonl.gz
376 kB
xet
18 days ago
521c4576
part-00076.jsonl.gz
362 kB
xet
18 days ago
4c9994f2
part-00077.jsonl.gz
332 kB
xet
18 days ago
d49b5513
part-00078.jsonl.gz
452 kB
xet
18 days ago
1f0afd01
part-00079.jsonl.gz
367 kB
xet
18 days ago
f008a8dc
part-00080.jsonl.gz
399 kB
xet
18 days ago
14f0b632
part-00081.jsonl.gz
392 kB
xet
18 days ago
20611db3
part-00082.jsonl.gz
334 kB
xet
18 days ago
f2e04836
part-00083.jsonl.gz
439 kB
xet
18 days ago
a14c6ba4
part-00084.jsonl.gz
402 kB
xet
18 days ago
7aab6361
part-00085.jsonl.gz
398 kB
xet
18 days ago
ecfb431a
part-00086.jsonl.gz
386 kB
xet
18 days ago
7340629a
part-00087.jsonl.gz
464 kB
xet
18 days ago
420270e2
part-00088.jsonl.gz
421 kB
xet
18 days ago
c95e2a35
part-00089.jsonl.gz
379 kB
xet
18 days ago
858570b3
part-00090.jsonl.gz
349 kB
xet
18 days ago
21d25b30
part-00091.jsonl.gz
497 kB
xet
18 days ago
4e9363b4
part-00092.jsonl.gz
416 kB
xet
18 days ago
28f3efa0
part-00093.jsonl.gz
304 kB
xet
18 days ago
2a99896b
part-00094.jsonl.gz
412 kB
xet
18 days ago
a865fa1f
part-00095.jsonl.gz
387 kB
xet
18 days ago
0f6c8544
part-00096.jsonl.gz
387 kB
xet
18 days ago
0f7d5ec7
part-00097.jsonl.gz
324 kB
xet
18 days ago
ffec6343
part-00098.jsonl.gz
464 kB
xet
18 days ago
633b4149
part-00099.jsonl.gz
324 kB
xet
18 days ago
cc093e35
Load more
Use this bucket
Total size
72.6 MB
Files
181
Last updated
Apr 4
Pre-warmed CDN
US
EU
US
EU
Contributors