[Candle] huggingface Candle
ยท
๐Ÿ› ๏ธ Tools
git clone https://github.com/huggingface/candle.git Candle ์ด๋ž€? Candle์€ ์„ฑ๋Šฅ(GPU ์ง€์›)๊ณผ ์‚ฌ์šฉ ํŽธ์˜์„ฑ์— ์ค‘์ ์„ ๋‘” rust ์šฉ Minimalist ML ํ”„๋ ˆ์ž„์›Œํฌ์ž…๋‹ˆ๋‹ค. whisper, LLaMA2, T5, yolo, Segment Anything์„ ํ•œ๋ฒˆ์— ๋ถˆ๋Ÿฌ์™€ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๊ณ  huggingface์˜ ๋‹ค์–‘ํ•œ ๋ชจ๋ธ์„ ์ง€์›ํ•ฉ๋‹ˆ๋‹ค. Kakaobot์— ์‚ฌ์šฉํ•  Stable Diffusion ์‚ฌ์šฉ ์˜ˆ์‹œ ์„ค์น˜ ๋ฐฉ๋ฒ• 1) Candle์€ Rust๋ฅผ ์‚ฌ์šฉํ•˜๋ฏ€๋กœ rust package ๊ด€๋ฆฌ์ž์ธ Cargo๋ฅผ ์„ค์น˜ํ•ด์ค€๋‹ค. curl https://sh.rustup.rs -sSf | sh # env enroll source "$HOME/.cargo/env" rustc ..
[OpenAI] Whisper - Robust Speech Recognition via Large-Scale Weak Supervision
ยท
๐Ÿ‘พ Deep Learning
https://arxiv.org/abs/2212.04356 Robust Speech Recognition via Large-Scale Weak Supervision We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet. When scaled to 680,000 hours of multilingual and multitask supervision, the resulting models generalize well to standard arxiv.org Robust Speech Recognition via Large-Sca..
[M1] Whisper.cpp Deploy C++ (ALL OS-)
ยท
๐Ÿ‘พ Deep Learning
https://github.com/ggerganov/whisper.cpp GitHub - ggerganov/whisper.cpp: Port of OpenAI's Whisper model in C/C++ Port of OpenAI's Whisper model in C/C++. Contribute to ggerganov/whisper.cpp development by creating an account on GitHub. github.com M1 Install 1 . git clone์œผ๋กœ ์ตœ์‹  ๋ฒ„์ „์œผ๋กœ ์„ค์น˜ํ•  ๊ฒฝ์šฐ M1์—์„œ .o architecture error ๋ฐœ์ƒ์œผ๋กœ [stable version]์„ ๋‹ค์šด๋กœ๋“œ ํ•œ๋‹ค. https://github.com/ggerganov/whisper.cpp/releases/..
[Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (4)
ยท
๐Ÿ‘พ Deep Learning
https://bnmy6581.tistory.com/133 --(1) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://bnmy6581.tistory.com/134 --(2) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://bnmy6581.tistory.com/135--(3) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistor..
[Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (3)
ยท
๐Ÿ‘พ Deep Learning
https://bnmy6581.tistory.com/133 --(1) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://bnmy6581.tistory.com/134 --(2) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (2) https://bnmy6581.tistory.com/133 --(1) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://arxiv...
[Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1)
ยท
๐Ÿ‘พ Deep Learning
[Whisper] Kspon Valid --- (2) CER
ยท
๐Ÿ‘พ Deep Learning
Robust Speech Recognition via Large-Scale Weak Supervision *large model์€ 2023.1 large-v2์™€ ๋™์ผํ•˜๊ฒŒ ๋ฐ”๋€œ KsponSpeech ๋ฐ์ดํ„ฐ๋Š” ์งง์€ ๋ฐœํ™”์˜ audio๋ฅผ ์ฃผ๋กœ ๊ตฌ์„ฑ๋˜์–ด์žˆ๋‹ค. Whisper๋Š” 99๊ฐœ์˜ ํ† ํฐ์œผ๋กœ ์ฒ˜์Œ ๋ฐœํ™”์— ๋Œ€ํ•œ ์–ธ์–ด ์˜ˆ์ธก(language identification)์„ ์ˆ˜ํ–‰ํ•œ๋‹ค. ํ•˜์ง€๋งŒ ๋„ˆ๋ฌด ์งง์€ ๋ฐœํ™” ๊ฐ™์€ ๊ฒฝ์šฐ whisper๊ฐ€ ๋‹ค๋ฅธ ์–ธ์–ด๋กœ ์˜ˆ์ธกํ•ด translate ์ž์ฒด๊ฐ€ ํ‹€๋ ค๋ฒ„๋ ค CER์ด ์ฆ๊ฐ€ํ•˜๋Š” ๊ฒƒ์„ ๋ณผ ์ˆ˜ ์žˆ๋‹ค. language Configure์„ korean์œผ๋กœ ์„ค์ •ํ•˜๋ฉด language identification์„ ์ˆ˜ํ–‰ํ•˜์ง€ ์•Š๊ณ  ๋ฐ”๋กœ transcript๋กœ ์˜ˆ์ธกํ•ด ๋” ์ข‹์€ ์„ฑ๊ณผ๊ฐ€ ๋‚ฌ๋‹ค. model size๋Š” ์˜ˆ..
[Whisper] (1) - Abstract & Introduction
ยท
๐Ÿ‘พ Deep Learning
https://github.com/openai/whisper GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision Robust Speech Recognition via Large-Scale Weak Supervision - GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision github.com Paper Review Abstract & Introduction 680,000 ์‹œ๊ฐ„์˜ ๋‹ค๊ตญ์–ด ํ•™์Šต์„ ์ง„ํ–‰ ์‹œ fine-tuning ์—†์ด zero-shot transfer benchmark ์ˆ˜์ค€์˜ ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ๋‹ค. ..
๋‹คํ–ˆ๋‹ค
'Whisper' ํƒœ๊ทธ์˜ ๊ธ€ ๋ชฉ๋ก