728x90

Model deprecations

 OpenAI์™€ ๊ฐ™์ด vectordb๋ฅผ ๊ตฌ์ถ•ํ•  ์ˆ˜ ์žˆ๊ฒŒ embedding API๋ฅผ ์ง€์›ํ•˜๋Š” ์„œ๋น„์Šค๊ฐ€ ๋งŽ์•„์กŒ๋‹ค. ์ด๋Ÿฌํ•œ platform์€ ์ผ๋ฐ˜์ธ๋“ค์ด ๋ณด์œ ํ•˜๊ธฐ ์–ด๋ ค์šด GPU ์ž์›์„ ํ•ด์†Œํ•ด ์ฃผ๋ฉด์„œ ์ €๋ ดํ•˜๊ฒŒ ์ด์šฉํ•  ์ˆ˜ ์žˆ์œผ๋‚˜ ํฌ๋‚˜ ํฐ ๋‹จ์ ์ด ์žˆ๋‹ค. legacy model์˜ ์ง€์›์ด ์˜์›ํ•˜์ง€ ์•Š๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ์˜ˆ๋ฅผ ๋“ค๋ฉด ๋‚ด๊ฐ€ ๋ชจ์€ ์ž๋ฃŒ๋ฅผ ๋ชจ๋‘ vectorํ™” ์‹œ์ผœ vectordb๋ฅผ ๊ตฌ์ถ•ํ•˜๊ณ  RAG๋‚˜ RetrievalQA๋ฅผ ํ†ตํ•ด ์งˆ์˜๋ฅผ ํ–ˆ๋Š”๋ฐ ์ž˜ ๋‚˜์˜ค๋˜ ๋‹ต๋ณ€์ด ์กฐ๊ธˆ์”ฉ ํ‹€์–ด์งˆ ์ˆ˜ ์žˆ๋‹ค. ๊ทธ ์›์ธ์œผ๋กœ text-embedding-ada-002๋กœ ๊ตฌ์ถ•ํ•ด ๋†“์€ embedding vector ๊ฐ’๋“ค์ด text-embedding-ada-003์—์„œ๋Š” ์œ ํšจํ•˜์ง€ ์•Š์•„ ๊ทธ๋ ‡๋‹ค. ๋”ฐ๋ผ์„œ ์ด๋Ÿฌํ•œ ์ผ์ด ์—†์œผ๋ ค๋ฉด text-embedding-ada-002 ์„œ๋น„์Šค๊ฐ€ ์ข…๋ฃŒ๋˜๊ธฐ ์ „์— v3๋กœ ๋‹ค migration์„ ํ•ด์•ผ ํ•œ๋‹ค. platform์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์€ ํŽธ๋ฆฌํ•˜์ง€๋งŒ ์ด๋ ‡๊ฒŒ ๋ฒ„์ „ ์—…์ด ๋น ๋ฅด๊ฒŒ ์ด๋ฃจ์–ด์ง€๋ฉด ์งˆ์ˆ˜๋ก ์˜คํžˆ๋ ค ๋น„์šฉ๊ณผ ์‹œ๊ฐ„์ด ๋งŽ์ด ์†Œ์š”๋  ์ˆ˜ ์žˆ๋‹ค.

 

https://platform.openai.com/docs/deprecations

embedding model deprecation date

 

 

 ํ•œ๊ธ€ embedding ๊นกํŒจ์˜ text-embedding-ada-002๋Š” ์•„์ง ์ง€์› ์ค‘๋‹จ์ด ์—†์ง€๋งŒ ์ง€์›์ด ์ค‘๋‹จ๋œ๋‹ค๋ฉด ์—ฌํŒŒ๊ฐ€ ํด๊ฒƒ์œผ๋กœ ๋ณด์ธ๋‹ค.

https://github.com/ssisOneTeam/Korean-Embedding-Model-Performance-Benchmark-for-Retriever

 

GitHub - ssisOneTeam/Korean-Embedding-Model-Performance-Benchmark-for-Retriever: Korean Sentence Embedding Model Performance Ben

Korean Sentence Embedding Model Performance Benchmark for RAG - ssisOneTeam/Korean-Embedding-Model-Performance-Benchmark-for-Retriever

github.com

 

 

 ๋Œ€์•ˆ : Local Embedding Model

Local embedding model์ด ์ •๋ง ๋งŽ์ง€๋งŒ ํ•œ๊ตญ์–ด ํŠนํ™”, ํ•œ๊ตญ์–ด ์ „์šฉ ๋ชจ๋ธ์€ ๋งŽ์ด ์—†๋‹ค. Model hosting site Huggingface์—์„œ sentence-transformers ์ง€์› ๋ชจ๋ธ์—์„œ ์ฐพ์•„๋ณด๋ฉด ๋œ๋‹ค. ์ด ๋ชจ๋ธ tag๊ฐ€ ์žˆ๋Š” ๋ชจ๋ธ์€ langchain, chromadb ๋“ฑ ๋‹ค์–‘ํ•œ ๋ชจ๋ธ์—์„œ ๋ชจ๋ธ ๋ช…๋งŒ ์•Œ๋ฉด import, download ๊ฐ€๋Šฅํ•˜๋‹ค. 

https://huggingface.co/models?library=sentence-transformers

 

Models - Hugging Face

 

huggingface.co

 

 ๋‚ด๊ฐ€ ์›ํ•˜๋Š” ๋ชจ๋ธ์€ ์ฃผ๋กœ ํ—ˆ์šฉ token ์ˆ˜๊ฐ€ ๊ธธ์–ด์•ผํ•˜๊ณ  coding๊ณผ ํ•œ๊ตญ์–ด๋ฅผ ์ž˜ํ•˜๋Š” ๋ชจ๋ธ์„ ์ฃผ๋กœ ์ฐพ์•˜๋‹ค. max_length๊ฐ€ ์งง์œผ๋ฉด ํ—ˆ์šฉ ๋ฌธ์žฅ์˜ ๊ธธ์ด๋ฅผ ์ž„์˜๋กœ ์ž˜๋ผ์ค˜์•ผ ํ•ด ์ „์ฒ˜๋ฆฌ์— ์ˆ˜๊ณ ๊ฐ€ ๋งŽ์•„์ง„๋‹ค. ๋ฒˆ๊ฑฐ๋กœ์›€์„ ์ค„์ด๊ธฐ ์œ„ํ•ด ์ ๋‹นํ•˜๊ณ  ํฐ embedding model ์ด์—ฌ์•ผ ํ•ด bge-m3 Embedding Model์„ ์‚ฌ์šฉํ–ˆ๋‹ค. 

 

bge-m3

 ์ค‘๊ตญ๊ณผํ•™๊ธฐ์ˆ ๋Œ€ํ•™๊ต์—์„œ ๋ฐœํ‘œํ•œ ๋ชจ๋ธ๋กœ ๋‹ค๊ตญ์–ด 100๊ฐœ ์ด์ƒ์˜ ์–ธ์–ด๋ฅผ ์ฒ˜๋ฆฌํ•˜๋Š” ๋ชจ๋ธ ๋ถ€๋ฌธ์—์„œ SOTA๋กœ ์†Œ๊ฐœํ•˜๊ณ  ์žˆ๋‹ค. LLM๊ณผ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ์—ฌ๋Ÿฌ ๊ฒ€์ฆ ๋ฐฉ์‹์ด ์žˆ๊ณ  ์ง€ํ‘œ๊ฐ€ ์žˆ์ง€๋งŒ ๊ฒฐ๊ณผ๊ฐ€ ๊ฐœ๊ฐœ์ธ๋งˆ๋‹ค ๋Š๋ผ๋Š” ์ฐจ์ด์™€ ์˜ค๋ฅ˜๊ฐ€ ๋‹ค์–‘ํ•ด HumanEval์„ ์ง์ ‘ ์ˆ˜ํ–‰ํ•˜์ง€ ์•Š๋Š” ์ด์ƒ ์–ด๋Š ์ •๋„ ์ข‹์•„ ์กŒ๋Š”์ง€ ์•Œ ์ˆ˜๋Š” ์—†์œผ๋‚˜ local๋กœ ์ œ๊ณตํ•ด ์ฃผ๋Š” ๊ฒƒ๋งŒ์œผ๋กœ๋„ ๊ฐ์‚ฌํ•˜๊ณ  8192์˜ max_length๊นŒ์ง€ ์ปค๋ฒ„ ๊ฐ€๋Šฅํ•ด ์ถ”์ฒœํ•œ๋‹ค.

 

MKQA, Multilingual Knowledge Questions & Answers

 

๋ฐ˜์‘ํ˜•

'๐Ÿ› ๏ธ Tools' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[Gemini] gemini calculate Tokenize in Locally  (0) 2024.07.06
[Ollama] Response Structure Answer  (0) 2024.07.01
[draw.io] sql๋ฌธ ๊ฐ€์ ธ์˜ค๊ธฐ  (0) 2024.06.03
[crewAI] Multi-agent Custormer Support Automation (3)  (0) 2024.05.25
[CrewAI] Key elements of AI agent (2)  (0) 2024.05.21
๋‹คํ–ˆ๋‹ค