Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-20260402-l40sx1-s11
Files
xet
nielsr/arxiv-chandra-ocr-full-20260402-l40sx1-s11
/
data
78.4 MB
181 files
Updated 10 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
359 kB
xet
12 days ago
44beca18
part-00001.jsonl.gz
382 kB
xet
12 days ago
bcb901e2
part-00002.jsonl.gz
373 kB
xet
12 days ago
8d05e550
part-00003.jsonl.gz
439 kB
xet
12 days ago
ceb61af4
part-00004.jsonl.gz
403 kB
xet
12 days ago
4b640cbd
part-00005.jsonl.gz
358 kB
xet
12 days ago
8c9b957a
part-00006.jsonl.gz
416 kB
xet
12 days ago
81f977cc
part-00007.jsonl.gz
420 kB
xet
12 days ago
7dc26681
part-00008.jsonl.gz
306 kB
xet
12 days ago
4472f345
part-00009.jsonl.gz
498 kB
xet
12 days ago
b75cf411
part-00010.jsonl.gz
440 kB
xet
12 days ago
f8ad9fb1
part-00011.jsonl.gz
377 kB
xet
12 days ago
e7308225
part-00012.jsonl.gz
387 kB
xet
12 days ago
15e02b3a
part-00013.jsonl.gz
485 kB
xet
12 days ago
d4cfe6cf
part-00014.jsonl.gz
464 kB
xet
12 days ago
fbf001c4
part-00015.jsonl.gz
440 kB
xet
12 days ago
68a787b2
part-00016.jsonl.gz
434 kB
xet
12 days ago
a3c60492
part-00017.jsonl.gz
358 kB
xet
12 days ago
e3e09eaf
part-00018.jsonl.gz
371 kB
xet
12 days ago
39502811
part-00019.jsonl.gz
398 kB
xet
12 days ago
9dd33aaf
part-00020.jsonl.gz
397 kB
xet
12 days ago
6e099165
part-00021.jsonl.gz
389 kB
xet
12 days ago
8accd457
part-00022.jsonl.gz
437 kB
xet
12 days ago
a6988bb3
part-00023.jsonl.gz
354 kB
xet
12 days ago
7d129198
part-00024.jsonl.gz
464 kB
xet
12 days ago
e54a1939
part-00025.jsonl.gz
444 kB
xet
12 days ago
a6a579fa
part-00026.jsonl.gz
296 kB
xet
12 days ago
13929862
part-00027.jsonl.gz
505 kB
xet
12 days ago
dce740c3
part-00028.jsonl.gz
412 kB
xet
12 days ago
bf1ff03b
part-00029.jsonl.gz
464 kB
xet
12 days ago
bed3b9ab
part-00030.jsonl.gz
278 kB
xet
12 days ago
7ded67f4
part-00031.jsonl.gz
502 kB
xet
12 days ago
f2e4992e
part-00032.jsonl.gz
435 kB
xet
12 days ago
fad9087b
part-00033.jsonl.gz
374 kB
xet
12 days ago
027f2783
part-00034.jsonl.gz
515 kB
xet
12 days ago
2360bf82
part-00035.jsonl.gz
358 kB
xet
12 days ago
54a7d3fb
part-00036.jsonl.gz
422 kB
xet
12 days ago
4f00f6f9
part-00037.jsonl.gz
327 kB
xet
12 days ago
2d05417e
part-00038.jsonl.gz
456 kB
xet
12 days ago
38924819
part-00039.jsonl.gz
347 kB
xet
12 days ago
6a53126b
part-00040.jsonl.gz
468 kB
xet
12 days ago
5521cfe9
part-00041.jsonl.gz
471 kB
xet
12 days ago
50085356
part-00042.jsonl.gz
485 kB
xet
12 days ago
8253f8ff
part-00043.jsonl.gz
357 kB
xet
12 days ago
7c0d79ab
part-00044.jsonl.gz
382 kB
xet
12 days ago
bfb1b657
part-00045.jsonl.gz
432 kB
xet
12 days ago
5aea41aa
part-00046.jsonl.gz
462 kB
xet
12 days ago
5845d61f
part-00047.jsonl.gz
328 kB
xet
12 days ago
937f7368
part-00048.jsonl.gz
373 kB
xet
12 days ago
7037854c
part-00049.jsonl.gz
396 kB
xet
12 days ago
f3817e04
part-00050.jsonl.gz
424 kB
xet
12 days ago
49a6c35e
part-00051.jsonl.gz
450 kB
xet
12 days ago
428a244e
part-00052.jsonl.gz
380 kB
xet
12 days ago
e103d0d2
part-00053.jsonl.gz
544 kB
xet
12 days ago
70cbb391
part-00054.jsonl.gz
434 kB
xet
12 days ago
9b40db73
part-00055.jsonl.gz
382 kB
xet
12 days ago
469e0579
part-00056.jsonl.gz
419 kB
xet
12 days ago
d72f6c11
part-00057.jsonl.gz
353 kB
xet
12 days ago
06a91a40
part-00058.jsonl.gz
419 kB
xet
12 days ago
7a0b713d
part-00059.jsonl.gz
300 kB
xet
12 days ago
944e4450
part-00060.jsonl.gz
497 kB
xet
12 days ago
336990fe
part-00061.jsonl.gz
404 kB
xet
12 days ago
6a18dc8c
part-00062.jsonl.gz
496 kB
xet
12 days ago
15531bee
part-00063.jsonl.gz
423 kB
xet
12 days ago
1ba934c4
part-00064.jsonl.gz
367 kB
xet
12 days ago
e7a904b1
part-00065.jsonl.gz
463 kB
xet
12 days ago
b981fb3d
part-00066.jsonl.gz
367 kB
xet
12 days ago
0e1bab13
part-00067.jsonl.gz
420 kB
xet
11 days ago
baac6d0f
part-00068.jsonl.gz
421 kB
xet
11 days ago
57ae77e2
part-00069.jsonl.gz
515 kB
xet
11 days ago
4d2527d0
part-00070.jsonl.gz
539 kB
xet
11 days ago
7b6d58fb
part-00071.jsonl.gz
435 kB
xet
11 days ago
7be46661
part-00072.jsonl.gz
391 kB
xet
11 days ago
20481327
part-00073.jsonl.gz
473 kB
xet
11 days ago
ceb2c077
part-00074.jsonl.gz
370 kB
xet
11 days ago
17458e5e
part-00075.jsonl.gz
448 kB
xet
11 days ago
58c3dd75
part-00076.jsonl.gz
532 kB
xet
11 days ago
9e202025
part-00077.jsonl.gz
558 kB
xet
11 days ago
dabc8ec5
part-00078.jsonl.gz
441 kB
xet
11 days ago
9acc1152
part-00079.jsonl.gz
489 kB
xet
11 days ago
ce254702
part-00080.jsonl.gz
396 kB
xet
11 days ago
b44dd2a0
part-00081.jsonl.gz
383 kB
xet
11 days ago
065f11d4
part-00082.jsonl.gz
462 kB
xet
11 days ago
d3c4640f
part-00083.jsonl.gz
476 kB
xet
11 days ago
c9a7146d
part-00084.jsonl.gz
395 kB
xet
11 days ago
86e396a5
part-00085.jsonl.gz
493 kB
xet
11 days ago
788920b9
part-00086.jsonl.gz
530 kB
xet
11 days ago
09c17ad8
part-00087.jsonl.gz
449 kB
xet
11 days ago
6a946f28
part-00088.jsonl.gz
490 kB
xet
11 days ago
e3bbab2c
part-00089.jsonl.gz
422 kB
xet
11 days ago
fbc12fd9
part-00090.jsonl.gz
483 kB
xet
11 days ago
49fe27fa
part-00091.jsonl.gz
369 kB
xet
11 days ago
a5f39574
part-00092.jsonl.gz
544 kB
xet
11 days ago
2f6231b2
part-00093.jsonl.gz
544 kB
xet
11 days ago
5672472e
part-00094.jsonl.gz
443 kB
xet
11 days ago
c0b25d01
part-00095.jsonl.gz
455 kB
xet
11 days ago
7d94552d
part-00096.jsonl.gz
392 kB
xet
11 days ago
21702c16
part-00097.jsonl.gz
461 kB
xet
11 days ago
5c26eea6
part-00098.jsonl.gz
342 kB
xet
11 days ago
a638656d
part-00099.jsonl.gz
441 kB
xet
11 days ago
cd7e3828
Load more
Use this bucket
Total size
78.4 MB
Files
181
Last updated
Apr 4
Pre-warmed CDN
US
EU
US
EU
Contributors