[Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (3)
ยท
๐Ÿ‘พ Deep Learning
https://bnmy6581.tistory.com/133 --(1) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://bnmy6581.tistory.com/134 --(2) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (2) https://bnmy6581.tistory.com/133 --(1) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://arxiv...
[Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (2)
ยท
๐Ÿ‘พ Deep Learning
https://bnmy6581.tistory.com/133 --(1) [Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1) bnmy6581.tistory.com https://arxiv.org/abs/2109.07740 Scaling Laws for Neural Machine Translation We present an empirical study of scaling properties of encoder-decoder Transformer models used in neural machine translation (NMT). We show that cross-entropy loss as a function of model..
[Whisper] Robust Speech Recognition via Large-Scale Weak Supervision - (1)
ยท
๐Ÿ‘พ Deep Learning
[leetcode-2] Add Two Numbers
ยท
๐Ÿข One step
LeetCode-2 Add Two Numbers : Add Two Numbers in Linked List note : You may assume the two numbers do not contain any leading zero, except the number 0 itself.myAnswer :::python #Definition for singly-linked list. class ListNode: def init(self, val=0, next=None): self.val = val self.next = next class Solution: def addTwoNumbers(self, l1: Optional[ListNode], l2: Optional[ListNode]) -> Optional[Lis..
[leetcode-206] Reverse Linked List
ยท
๐Ÿข One step
LeetCode-206 Reverse Linked List : Reverse Linked List note : ์—ฐ๊ฒฐ ๋ฆฌ์ŠคํŠธ ์—ญ์ˆœ์œผ๋กœ ์ •๋ ฌ :::python # Definition for singly-linked list. # class ListNode: # def init(self, val=0, next=None): # self.val = val # self.next = next class Solution: def reverseList(self, head: Optional[ListNode]) -> Optional[ListNode]: def reverse(node: ListNode, prev: ListNode = None): if not node: return prev next, node.next = no..
[Whisper] Kspon Valid --- (2) CER
ยท
๐Ÿ‘พ Deep Learning
Robust Speech Recognition via Large-Scale Weak Supervision *large model์€ 2023.1 large-v2์™€ ๋™์ผํ•˜๊ฒŒ ๋ฐ”๋€œ KsponSpeech ๋ฐ์ดํ„ฐ๋Š” ์งง์€ ๋ฐœํ™”์˜ audio๋ฅผ ์ฃผ๋กœ ๊ตฌ์„ฑ๋˜์–ด์žˆ๋‹ค. Whisper๋Š” 99๊ฐœ์˜ ํ† ํฐ์œผ๋กœ ์ฒ˜์Œ ๋ฐœํ™”์— ๋Œ€ํ•œ ์–ธ์–ด ์˜ˆ์ธก(language identification)์„ ์ˆ˜ํ–‰ํ•œ๋‹ค. ํ•˜์ง€๋งŒ ๋„ˆ๋ฌด ์งง์€ ๋ฐœํ™” ๊ฐ™์€ ๊ฒฝ์šฐ whisper๊ฐ€ ๋‹ค๋ฅธ ์–ธ์–ด๋กœ ์˜ˆ์ธกํ•ด translate ์ž์ฒด๊ฐ€ ํ‹€๋ ค๋ฒ„๋ ค CER์ด ์ฆ๊ฐ€ํ•˜๋Š” ๊ฒƒ์„ ๋ณผ ์ˆ˜ ์žˆ๋‹ค. language Configure์„ korean์œผ๋กœ ์„ค์ •ํ•˜๋ฉด language identification์„ ์ˆ˜ํ–‰ํ•˜์ง€ ์•Š๊ณ  ๋ฐ”๋กœ transcript๋กœ ์˜ˆ์ธกํ•ด ๋” ์ข‹์€ ์„ฑ๊ณผ๊ฐ€ ๋‚ฌ๋‹ค. model size๋Š” ์˜ˆ..
๋‹คํ–ˆ๋‹ค
B's