Collections
Discover the best community collections!
Collections including paper arxiv:2311.06242
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 19 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 68 -
CRAG -- Comprehensive RAG Benchmark
Paper • 2406.04744 • Published • 46 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 44
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper • 2405.08748 • Published • 23 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper • 2405.10300 • Published • 31 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 133 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper • 2405.11143 • Published • 41
-
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 95 -
microsoft/Florence-2-large
Image-Text-to-Text • 0.8B • Updated • 1.28M • 1.79k -
microsoft/Florence-2-base
Image-Text-to-Text • 0.2B • Updated • 581k • 359 -
microsoft/Florence-2-large-ft
Image-Text-to-Text • 0.8B • Updated • 38.6k • 384
-
openbmb/MiniCPM-V-2
Visual Question Answering • 3B • Updated • 77.1k • 495 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.24k • 28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 144k • 621 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 95
-
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Paper • 2402.14848 • Published • 19 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 68 -
CRAG -- Comprehensive RAG Benchmark
Paper • 2406.04744 • Published • 46 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 44
-
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 95 -
microsoft/Florence-2-large
Image-Text-to-Text • 0.8B • Updated • 1.28M • 1.79k -
microsoft/Florence-2-base
Image-Text-to-Text • 0.2B • Updated • 581k • 359 -
microsoft/Florence-2-large-ft
Image-Text-to-Text • 0.8B • Updated • 38.6k • 384
-
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Paper • 2405.08748 • Published • 23 -
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection
Paper • 2405.10300 • Published • 31 -
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Paper • 2405.09818 • Published • 133 -
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework
Paper • 2405.11143 • Published • 41
-
openbmb/MiniCPM-V-2
Visual Question Answering • 3B • Updated • 77.1k • 495 -
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.24k • 28 -
HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 144k • 621 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 95