LLM Paperlist
updated
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
• 2406.04692
• Published • 59
CRAG -- Comprehensive RAG Benchmark
Paper
• 2406.04744
• Published • 46
Boosting Large-scale Parallel Training Efficiency with C4: A
Communication-Driven Approach
Paper
• 2406.04594
• Published • 6
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language
Models
Paper
• 2406.04271
• Published • 29
4-bit Shampoo for Memory-Efficient Network Training
Paper
• 2405.18144
• Published • 12
Self-Exploring Language Models: Active Preference Elicitation for Online
Alignment
Paper
• 2405.19332
• Published • 22
Paper
• 2405.18407
• Published • 48
2BP: 2-Stage Backpropagation
Paper
• 2405.18047
• Published • 26
Yuan 2.0-M32: Mixture of Experts with Attention Router
Paper
• 2405.17976
• Published • 21
LLaMA-NAS: Efficient Neural Architecture Search for Large Language
Models
Paper
• 2405.18377
• Published • 21
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
• 2406.15319
• Published • 64
ColPali: Efficient Document Retrieval with Vision Language Models
Paper
• 2407.01449
• Published • 51
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
• 2406.19215
• Published • 32
Visual Haystacks: Answering Harder Questions About Sets of Images
Paper
• 2407.13766
• Published • 2