Large Language Model

llm만 하는건 아니고 그냥 AI 논문 리뷰. 레이아웃 조만간 바꿀 예정 ..

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
GPT-1: Improving Language Understanding by Generative Pre-Training
GPT-2: Language Models are Unsupervised Multitask Learners
GPT-3: Language Models are Few-Shot Learners
FLAN: Finetuned Language Models Are Zero-shot Learners
RAG: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
LLaVA: Visual Instruction Tuning
T-RAG: LESSONS FROM THE LLM TRENCHES
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
REALM: Retrieval-Augmented Language Model Pre-Training
LLM Agent; Can Large Language Model Agents Simulate Human Trust Behavior?
Eval; CRAG: Comprehensive RAG Benchmark
Eval; G-EVAL: NLG Evaluation using GPT-4 with Better Human Alignment
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
AI Agent; The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
MoE; Mixtral of Experts
Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains
Magma: A Foundation Model for Multimodal AI Agents