Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains

04 Apr 2025 in Llm

Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains
2025.03.31에 나온 논문이다.

MoE; Mixtral of Experts

27 Mar 2025 in Llm

Mixtral of Experts

AI Agent; The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

21 Mar 2025 in Llm

The AI Scientist: Towards Fully Automated

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

07 Mar 2025 in Llm

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes (2023.05)

about docker

02 Mar 2025 in Trivia

bla

Eval; G-EVAL: NLG Evaluation using GPT-4 with Better Human Alignment

28 Feb 2025 in Llm

G-EVAL: NLG Evaluation using GPT-4 with Better Human Alignment
2023.05

환경 설정: homebrew, git, pyenv, poetry

27 Feb 2025 in Trivia

homebrew 설치

(macos) 터미널을 켜고 아래의 명령어 입력

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

macos 비밀번호 입력
brew 명령어를 사용할 수 있도록 경로 설정
```
echo 'eval "$(/opt/homebrew/bin/brew shellenv)"' >> ~/.zprofile
eval "$(/opt/homebrew/bin/brew shellenv)"
```
4. which brew
- brew가 실행되고 있는 위치 출력
echo ‘eval “$(위치 shellenv)”’ » ~/.zprofile

Eval; CRAG: Comprehensive RAG Benchmark

25 Feb 2025 in Llm

CRAG: Comprehensive RAG Benchmark
2024.07

AI Agent; Can Large Language Model Agents Simulate Human Trust Behavior?

21 Feb 2025 in Llm

Can Large Language Model Agents Simulate Human Trust Behavior?

REALM: Retrieval-Augmented Language Model Pre-Training

07 Feb 2025 in Llm

REALM: Retrieval-Augmented Language Model Pre-Training
2020.02