Textbooks Are All You Need

2023. 7. 2. 22:29·🗣️ Natural Language Processing

날짜	모델	모델 크기	데이터셋 크기	HumanEval	MBPP
2021년 7월	Codex-300M [CTJ+21]	300M	100B	13.2%	-
2021년 7월	Codex-12B [CTJ+21]	12B	100B	28.8%	-
2022년 3월	CodeGen-Mono-350M [NPH+23]	350M	577B	12.8%	-
2022년 3월	CodeGen-Mono-16.1B [NPH+23]	16.1B	577B	29.3%	35.3%
2022년 4월	PaLM-Coder [CND+22]	540B	780B	35.9%	47.0%
2022년 9월	CodeGeeX [ZXZ+23]	13B	850B	22.9%	24.4%
2022년 11월	GPT-3.5 [Ope23]	175B	N.A.	47%	-
2022년 12월	SantaCoder [ALK+23]	1.1B	236B	14.0%	35.0%
2023년 3월	GPT-4 [Ope23]	N.A.	N.A.	67%	-
2023년 4월	Replit [Rep23]	2.7B	525B	21.9%	-
2023년 4월	Replit-Finetuned [Rep23]	2.7B	525B	30.5%	-
2023년 5월	CodeGen2-1B [NHX+23]	1B	N.A.	10.3%	-
2023년 5월	CodeGen2-7B [NHX+23]	7B	N.A.	19.1%	-
2023년 5월	StarCoder [LAZ+23]	15.5B	1T	33.6%	52.7%
2023년 5월	StarCoder-Prompted [LAZ+23]	15.5B	1T	40.8%	49.5%
2023년 5월	PaLM 2-S [ADF+23]	N.A.	N.A.	37.6%	50.0%
2023년 5월	CodeT5+ [WLG+23]	2B	52B	24.2%	-
2023년 5월	CodeT5+ [WLG+23]	16B	52B	30.9%	-
2023년 5월	InstructCodeT5+ [WLG+23]	16B	52B	35.0%	-
2023년 6월	WizardCoder [LXZ+23]	16B	1T	57.3%	51.8%
2023년 6월	phi-1	1.3B	7B	50.6%	55.5%

모델	크기	훈련 토큰	점수	HumanEval
CodeGen-Mono-350M	350M	577B	0.19	12.8%
CodeGen-Mono-16.1B	16.1B	577B	0.38	29.3%
Replit	2.7B	525B	0.37	21.9%
StarCoder	15.5B	1T	0.51	33.6%
phi-1-base	1.3B	7B	0.37	29%
phi-1-small	350M	7B	0.45	45%
phi-1	1.3B	7B	0.52	50.6%

τ	문제 개수	phi-1 재학습	StarCoder-Prompted
0.95	Similar 71 (81.7%)	74.6%	57.7%
non-similar 93 (26.9%)	32.3%	29.0%
total 164 (50.6%)	50.6%	41.5%
0.9	Similar 93 (63.4%)	51.6%	48.4%
non-similar 71 (33.8%)	36.6%	32.4%
total 164 (50.6%)	45.1%	41.5%
0.85	Similar 106 (62.3%)	52.8%	47.2%
non-similar 58 (29.3%)	34.5%	31.0%
total 164 (50.6%)	46.3%	41.5%
0.8	Similar 116 (59.5%)	52.6%	45.7%
non-similar 48 (29.2%)	27.1%	31.2%
total 164 (50.6%)	45.1%	41.5%

[Pinecone] llama-index with Pinecone (0)	2023.10.01
The Path to Achieve Ultra-Low Inference Latency With LLaMA 65B on PyTorch/XLA (0)	2023.07.06
LLM Context 확장 불가능은 아니다. (token size 늘리기 정리) (0)	2023.06.28
Text Embedding + t-SNE Visualization (0)	2023.06.22
[Langchain] paper-translator (0)	2023.06.16

'🗣️ Natural Language Processing' 카테고리의 다른 글

다했다

🚩One By 🐢One

다했다

전체

오늘

어제

검색

블로그 메뉴

hELLO· Designed By정상우.v4.9.0

Textbooks Are All You Need