음성 인식에서 음소 클러스터 수의 효과 | sam

학술논문

음성 인식에서 음소 클러스터 수의 효과

이용수 6

영문명: The Effect of the Number of Phoneme Clusters on Speech Recognition
발행기관: 한국전자통신학회
저자명: 이창영(Chang-Young Lee)
간행물 정보: 『한국전자통신학회 논문지』제9권 제11호, 1221~1226쪽, 전체 6쪽
주제분류: 공학 > 전자/정보통신공학
파일형태: PDF
발행일자: 2014.11.30

이용권 구매하기

이용가능 이용불가

sam무제한 이용권 으로 학술논문 이용이 가능합니다.
이 학술논문 정보는 (주)교보문고와 각 발행기관 사이에 저작물 이용 계약이 체결된 것으로, 교보문고를 통해 제공되고 있습니다. 1:1 문의

국문 초록

본 논문에서는 음성 인식의 효율을 높이기 위하여 음소 클러스터 개수의 효과에 대해 연구하였다. 이를 위하여 음소 클러스터 개수를 바꾸어 가면서 수정된 k-평균 군집 알고리듬을 사용하여 코우드북을 작성하였다. 그런 다음, 퍼지 벡터 양자화와 은닉 마코브 모델을 사용하여 음성인식 테스트를 수행하였다. 실험 결과 두 개의 영역이 구분되어 나타났다. 음소 클러스터 개수가 클 때 인식 성능은 대체로 그와 무관하지만, 개수가 작을때에는 그 감소와 더불어 인식 오류율이 비선형적으로 증가하는 것으로 나타났다. 수치 해석적 계산으로부터, 이 비선형 영역은 멱승함수에 의해 모델링 될 수 있었다. 또한 300개의 고립단어 인식의 경우에, 166개의 음소클러스터가 최적의 수임을 보일 수 있었다. 이는 음소당 3개 정도의 변화에 해당하는 값이다.

영문 초록

In an effort to improve the efficiency of the speech recognition, we investigate the effect of the number of phoneme clusters. For this purpose, codebooks of varied number of phoneme clusters are prepared by modified k-means clustering algorithm. The subsequent processing is fuzzy vector quantization (FVQ) and hidden Markov model (HMM) for speech recognition test. The result shows that there are two distinct regimes. For large number of phoneme clusters, the recognition performance is roughly independent of it. For small number of phoneme clusters, however, the recognition error rate increases nonlinearly as it is decreased. From numerical calculation, it is found that this nonlinear regime might be modeled by a power law function. The result also shows that about 166 phoneme clusters would be the optimal number for recognition of 300 isolated words. This amounts to roughly 3 variations per phoneme

키워드

speech recognition number of phoneme clusters fuzzy vector quantization hidden Markov model 음성 인식 음소 클러스터 수 퍼지 벡터 양자화 은닉 마코브 모델

국문 초록

영문 초록

목차

키워드

해당간행물 수록 논문

참고문헌

최근 이용한 논문

APA

MLA