LLM Paperlist - a Kevin16 Collection

Kevin16 's Collections

Inference Acceleration

Video Understanding

LLM Paperlist

updated Jul 30, 2024

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 59
CRAG -- Comprehensive RAG Benchmark

Paper • 2406.04744 • Published Jun 7, 2024 • 46
Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

Paper • 2406.04594 • Published Jun 7, 2024 • 6
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 29
4-bit Shampoo for Memory-Efficient Network Training

Paper • 2405.18144 • Published May 28, 2024 • 12
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published May 29, 2024 • 22
Phased Consistency Model

Paper • 2405.18407 • Published May 28, 2024 • 48
2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published May 28, 2024 • 26
Yuan 2.0-M32: Mixture of Experts with Attention Router

Paper • 2405.17976 • Published May 28, 2024 • 21
LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

Paper • 2405.18377 • Published May 28, 2024 • 21
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21, 2024 • 64
ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 51
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Paper • 2406.19215 • Published Jun 27, 2024 • 32
Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18, 2024 • 2