56 말소리와음성과학제 권제 3 호 (2009),. [7], [8].,, ( ) (), () (48.52±5.50yr), 64 ( 32, 44, 88), 309( 82, 6, ) 473

Similar documents
Lumbar spine

4 CD Construct Special Model VI 2 nd Order Model VI 2 Note: Hands-on 1, 2 RC 1 RLC mass-spring-damper 2 2 ζ ω n (rad/sec) 2 ( ζ < 1), 1 (ζ = 1), ( ) 1

012임수진

08김현휘_ok.hwp

(JBE Vol. 21, No. 1, January 2016) (Regular Paper) 21 1, (JBE Vol. 21, No. 1, January 2016) ISSN 228


DBPIA-NURIMEDIA

Rheu-suppl hwp

230 한국교육학연구 제20권 제3호 I. 서 론 청소년의 언어가 거칠어지고 있다. 개ㅅㄲ, ㅆㅂ놈(년), 미친ㅆㄲ, 닥쳐, 엠창, 뒤져 등과 같은 말은 주위에서 쉽게 들을 수 있다. 말과 글이 점차 된소리나 거센소리로 바뀌고, 외 국어 남용과 사이버 문화의 익명성 등

Slide 1

Abstract Background : Most hospitalized children will experience physical pain as well as psychological distress. Painful procedure can increase anxie

<31372DB9DABAB4C8A32E687770>


12이문규

DBPIA-NURIMEDIA

03이경미(237~248)ok

Software Requirrment Analysis를 위한 정보 검색 기술의 응용


THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE. vol. 29, no. 10, Oct ,,. 0.5 %.., cm mm FR4 (ε r =4.4)

2009;21(1): (1777) 49 (1800 ),.,,.,, ( ) ( ) 1782., ( ). ( ) 1,... 2,3,4,5.,,, ( ), ( ),. 6,,, ( ), ( ),....,.. (, ) (, )

878 Yu Kim, Dongjae Kim 지막 용량수준까지도 멈춤 규칙이 만족되지 않아 시행이 종료되지 않는 경우에는 MTD의 추정이 불가 능하다는 단점이 있다. 최근 이 SM방법의 단점을 보완하기 위해 O Quigley 등 (1990)이 제안한 CRM(Continu

Analyses the Contents of Points per a Game and the Difference among Weight Categories after the Revision of Greco-Roman Style Wrestling Rules Han-bong

Analysis of objective and error source of ski technical championship Jin Su Seok 1, Seoung ki Kang 1 *, Jae Hyung Lee 1, & Won Il Son 2 1 yong in Univ

<313120C0AFC0FCC0DA5FBECBB0EDB8AEC1F2C0BB5FC0CCBFEBC7D15FB1E8C0BAC5C25FBCF6C1A42E687770>

DBPIA-NURIMEDIA

서론 34 2

대한한의학원전학회지24권6호-전체최종.hwp

8-VSB (Vestigial Sideband Modulation)., (Carrier Phase Offset, CPO) (Timing Frequency Offset),. VSB, 8-PAM(pulse amplitude modulation,, ) DC 1.25V, [2

Journal of Educational Innovation Research 2018, Vol. 28, No. 4, pp DOI: 3 * The Effect of H

인문사회과학기술융합학회

영어교육연구제 22 권 4 호 2010 년겨울 대학생들의영어모음발음과지각 ( ) Yang, Byunggon. (2010). College students production and perception of English vowels. English Language Te

지능정보연구제 16 권제 1 호 2010 년 3 월 (pp.71~92),.,.,., Support Vector Machines,,., KOSPI200.,. * 지능정보연구제 16 권제 1 호 2010 년 3 월

16_이주용_155~163.hwp


09È«¼®¿µ 5~152s

Microsoft Word - 1-차우창.doc

전립선암발생률추정과관련요인분석 : The Korean Cancer Prevention Study-II (KCPS-II)

달생산이 초산모 분만시간에 미치는 영향 Ⅰ. 서 론 Ⅱ. 연구대상 및 방법 達 은 23) 의 丹 溪 에 최초로 기 재된 처방으로, 에 복용하면 한 다하여 난산의 예방과 및, 등에 널리 활용되어 왔다. 達 은 이 毒 하고 는 甘 苦 하여 氣, 氣 寬,, 結 의 효능이 있

[ 영어영문학 ] 제 55 권 4 호 (2010) ( ) ( ) ( ) 1) Kyuchul Yoon, Ji-Yeon Oh & Sang-Cheol Ahn. Teaching English prosody through English poems with clon

<352EC7E3C5C2BFB55FB1B3C5EBB5A5C0CCC5CD5FC0DABFACB0FAC7D0B4EBC7D02E687770>

DBPIA-NURIMEDIA

???? 1

04조남훈

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Sep.; 30(9),

03-ÀÌÁ¦Çö

Microsoft Word doc

(Exposure) Exposure (Exposure Assesment) EMF Unknown to mechanism Health Effect (Effect) Unknown to mechanism Behavior pattern (Micro- Environment) Re

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Feb.; 28(2),

서강대학교 기초과학연구소대학중점연구소 심포지엄기초과학연구소


슬라이드 제목 없음

#Ȳ¿ë¼®

cha4_ocw.hwp

<5B D B3E220C1A634B1C720C1A632C8A320B3EDB9AEC1F628C3D6C1BE292E687770>

분석결과 Special Edition 녹색건물의 가치산정 및 탄소배출 평가 이슈 서 민간분야의 적극적인 참여 방안의 마련이 필요하다. 또한 우리나라는 녹색건축의 경제성에 대한 검증에 대 한 연구가 미흡한 실정이다. 반면, 미국, 영국, 호주 등은 민간 주도로 녹색건축물


example code are examined in this stage The low pressure pressurizer reactor trip module of the Plant Protection System was programmed as subject for

Kor. J. Aesthet. Cosmetol., 라이프스타일은 개인 생활에 있어 심리적 문화적 사회적 모든 측면의 생활방식과 차이 전체를 말한다. 이러한 라이프스 타일은 사람의 내재된 가치관이나 욕구, 행동 변화를 파악하여 소비행동과 심리를 추측할 수 있고, 개인의

., (, 2000;, 1993;,,, 1994), () 65, 4 51, (,, ). 33, 4 30, 23 3 (, ) () () 25, (),,,, (,,, 2015b). 1 5,

45-51 ¹Ú¼ø¸¸

Can032.hwp

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Dec.; 26(12),

???? 1

(5차 편집).hwp

14.531~539(08-037).fm


<353420B1C7B9CCB6F52DC1F5B0ADC7F6BDC7C0BB20C0CCBFEBC7D120BEC6B5BFB1B3C0B0C7C1B7CEB1D7B7A52E687770>

특집-5

< C6AFC1FD28B1C7C7F5C1DF292E687770>

The characteristic analysis of winners and losers in curling: Focused on shot type, shot accuracy, blank end and average score SungGeon Park 1 & Soowo


untitled

Æ÷Àå½Ã¼³94š

DBPIA-NURIMEDIA

<C7D1B9CEC1B7BEEEB9AEC7D03631C1FD28C3D6C1BE292E687770>

6.24-9년 6월

DBPIA-NURIMEDIA

Journal of Educational Innovation Research 2019, Vol. 29, No. 1, pp DOI: : * Research Subject

대한한의학원전학회지23권3호(최종)_ hwp

hwp

Æ÷Àå82š


<C3D6C1BEBFCFBCBA2DBDC4C7B0C0AFC5EBC7D0C8B8C1F D31C8A3292E687770>

B-05 Hierarchical Bayesian Model을 이용한 GCMs 의 최적 Multi-Model Ensemble 모형 구축

정보기술응용학회 발표

유한차분법을 이용한 다중 기초자산 주가연계증권 가격결정

요약문 1 요 약 문 1. 과 제 명 : 소음노출 저감을 위한 작업환경관리 및 측정방안 연구 2. 연구기간 : ~ 연 구 자 : 연구책임자 장 재 길 (연구위원) 공동연구자 정 광 재 (연구원) 4. 연구목적 및 필요성

264 축되어 있으나, 과거의 경우 결측치가 있거나 폐기물 발생 량 집계방법이 용적기준에서 중량기준으로 변경되어 자료 를 활용하는데 제한이 있었다. 또한 1995년부터 쓰레기 종 량제가 도입되어 생활폐기물 발생량이 이를 기점으로 크 게 줄어들었다. 그러므로 1996년부

레이아웃 1

<C7D1B1B9B1B3C0B0B0B3B9DFBFF85FC7D1B1B9B1B3C0B05F3430B1C733C8A35FC5EBC7D5BABB28C3D6C1BE292DC7A5C1F6C6F7C7D42E687770>

박선영무선충전-내지

부문별 에너지원 수요의 변동특성 및 공통변동에 미치는 거시적 요인들의 영향력 분석

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE. vol. 29, no. 6, Jun Rate). STAP(Space-Time Adaptive Processing)., -

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE. vol. 27, no. 8, Aug [3]. ±90,.,,,, 5,,., 0.01, 0.016, 99 %... 선형간섭

Chapter4.hwp

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Jul.; 27(7),

DBPIA-NURIMEDIA

<32382DC3BBB0A2C0E5BED6C0DA2E687770>

Jkcs022(89-113).hwp

¼º¿øÁø Ãâ·Â-1

Transcription:

55 말소리와음성과학제 권제 3 호 (2009) pp. 55~63 Automated Speech Analysis Applied to Sasang Constitution Classification ABSTRACT This paper introduces an automatic voice classification system for the diagnosis of individual constitution based on Sasang Constitutional Medicine (SCM) in Traditional Korean Medicine (TKM). For the developing of this algorithm, we used the voices of 473 speakers and extracted a total of 44 speech features from the speech data consisting of five sustained vowels and one sentence. The classification system, based on a rule-based algorithm that is derived from a non parametric statistical method, presents binary negative decisions. In conclusion, 55.7% of the speech data were diagnosed by this system, of which 72.8% were correct negative decisions. Keywords: non-parametric, quantitative, Sasang, constitution, SCM, TKM. (,,, ),,, [].,,, [2],[3]. 4() ) doskian@kiom.re.kr 2) jhyoo@kiom.re.kr 3) haejung064@kiom.re.kr 4) ssmed@kiom.re.kr. (: 0028438) : 2009 7 5 : 2009 8 6 : 2009 9 7.,. Chiu 4 (zero-crossing), (peak) (valley),, 40-60 () 70% [4]. FD(Fractal Dimension) DTW (Dynamic Time Warping) (Qi-vacuity), (Ying-vacuity) 85% [5]. () (). ().. ()... ().., (). [6].,.

56 말소리와음성과학제 권제 3 호 (2009),. [7], [8].,,. 00. 2. 2. 2007 2008 7 0 ( ) 3. 5 60 4 (), 20 60 (). 0 80 (48.52±5.50yr), 64 ( 32, 44, 88), 309( 82, 6, ) 473. %. 30dB. Sennheiser e-835s 4-5cm. 5 (,,,, ) 2-2 (. ). PCM signed 6bits, mono (sampling rate) 44,00Hz. 2.2 <> 44. 5 (a, e, i, o, u).. 44 Figure. 44 Speech Features for Constitution Classification 2.2. F 0, Intensity, Formant MDVP (F 0, fundamental frequency) (intensity), (formant), MDVP(Multi-Dimensional Voice Program). Praat (script) [9]-[0] 20msec F 0,, 2 (F, F 2), -3dB (bandwidth), (amplitude),. Praat., Matlab. To Pitch (ac)... 0.00 75 3 yes 0.03 0.45 0.0 0.35 0.4 500 To Intensity... 00 0.005! To Formant (burg)... 0 5 5000 0.025 50! To Formant (burg)... 0 5 5500 0.025 50 deltat = 0.020 dur = ('offsettime'-'onsettime') / deltat iter = 'dur:0'+ i = 'onsettime' while i < 'offsettime' db = Get value at time... i Cubic f = Get value at time... i Hertz Linear w = Get bandwidth at time... i Hertz Linear f2 = Get value at time... 2 i Hertz Linear w2 = Get bandwidth at time... 2 i Hertz Linear mimamp = Get minimum... i i+0.02 Parabolic maxamp = Get maximum... i i+0.02 Parabolic fileappend 'wfilename$' 'i:3''tab$''f0:''tab$''db:''tab$' 'f:''tab$''w:''tab$''f2:''tab$''w2:''tab$''mimamp:5

음성을이용한사상체질분류알고리즘 57 endwhile ''tab$''maxamp:5''newline$' i = 'i'+deltat F 0 5 (segmentation). 500msec 750msec (onset) (offset) (candidates) 20%~80%. 5. /u/ F 0 Praat F 0. F 0 (Pearson correlation coefficient), F 0 0, 50, 90th (percentile),. MDVP []. F 0 / (jitter) (Jita, Jitt, RAP, PPQ) (shimmer) (ShdB, Shim, APQ).. MDVP Table. Formulas for jitter and shimmer related parameters used in MDVP n 2 STD å( F i - F) n - i= N Jita å - T 0 T N - i= - i- 0i RAP N-2 T + T + T 0i+ 0i 0i- å - T0i N - 2 i= 3 N- åt0i N 00 PPQ APQ 2.2.2 MFCC i= 0 ShdB Shim (Mel-scale) (cepstrum) MFCC(Mel-Frequency Cepstral Coefficients) [2]. MFCC. MFCC HTK(Hidden Markov Toolkit), 2 MFCC 65 MFCC. MFCC HTK <2>. 2. MFCC HTK (HCopy Command) Table 2. HTK Coding Parameters for the Extraction of MFCC OPTION VALUE SOURCEKIND WAVEFORM SOURCEFORMAT WAV SORCERATE 227 TARGETKIND MFCC_0 TARGETRATE 00000.0 SAVECOMPRESSED F SAVEWITHCRC F WINDOWSIZE 400000.0 USEHAMMING T PREEMCOEF 0.97 NUMCHANS 30 CEPLIFTER 22 NUMCEPS 2 ENORMALISE F 2.2.3 <3> 44. 3. Table 3. The Definition of Speech Features xf0 average fundamental frequency xt0 period of the average glottal period xstd standard deviation of F 0 xjita absolute jitter xjitt jitter percent xrap relative average perturbation xppq pitch period perturbation quotient xshdb shimmer in db xshim shimmer percent xapq amplitude perturbation quotient xf st formant xbw st 3dB bandwidth xf2 2nd Formant xbw2 2nd 3dB Bandwidth xmfcc~2 ~2th MFCC xc0 energy CORR correlation between F 0 and intensity P0 0th percentile of F 0 P50 50th percentile of F 0 P90 90th percentile of F 0 PHL (P90-P50)/(P50-P0) I0 0th percentile of intensity I50 50th percentile of intensity I90 90th percentile of intensity IHL (I90-I50)/(I50-I0) x : 5(a, e, i, o, u)

58 말소리와음성과학제 권제 3 호 (2009) 3. 3.. 44 train set. umin, lmax. umin ui, lmax li. umin, ui, lmax, li 4. 2. 4 2 Figure 2. 4 Conditional Variables and 2 Logical Rules 3.2,,, BMI 44,,. 4,,, BMI(Body Mass Index). 0.3 9(eSHDB, eshim, uf0, if2, P0, P50, emfcc8, imfcc8, omfcc8, umfcc8), 2(uF0, P50) BMI 0.3. 2(aF0, at0, ef0, et0, if0, of0, ot0, uf0, P0, P50, P90, amfcc9, amfcc2, emfcc6, emfcc, emfcc2, imfcc6, imfcc9, omfcc9, umfcc7, umfcc9, umfcc2), BMI 5(eF0, P0, P50, P90, amfcc2) 0.3. 0.3 6 5, BMI 4 <3> sub-grule., BMI sub-grule grule. grule li ui grule. <2> 4. If (X > umin) then NOT ui () If (X < lmax) then NOT li (2), 4 umin (upper threshold), lmax (lower threshold). (X), X umin ui.. lmax li.. 44 2, 288 4 44 4 ( grule ),. 4 (outlier). 3. sub-grule Figure 3. sub-grule Matrix 3.3.,,.. 44,, 0

음성을이용한사상체질분류알고리즘 59 2 44 grule. (), (2) ui li -. grule. (normalized vector) (W). train set grule 3 (S). S (3) (4) (W). train set sub-grule grule (W). S s se s sy s te (3) s se :, s sy :, s te : a sse s sy s te s se b sse s sy s te s sy s te c sse s sy s te W a b c a b c (4) 3.4 grule. 4. Figure 4. Constitutional Score and Decision Rule,,, 0 grule. ui li (W).,.. <4>,, (N_SF), 44 0% 4.. 4. 4. 5 0-fold (cross validation)., 0 9 train set, grule test set. Test set 0 test set, 0-fold... train set 4 0% 0th 90th. 5 0-fold. 5 0-fold, 54.7%, 74.0%. 56.3% 72.%. /.

60 말소리와음성과학제 권제 3 호 (2009) 4. Table 4. Algorithm Result of the Male Patients 5. Table 5. Algorithm Result of the Female Patients 4.2 44. 5 0-fold test set grule 20 <6>. F 0,.. 6. Table 6. Rank of Speech Features for Classification Rank P50[Hz] * af[hz] ** 2 amfcc2 amfcc8 3 of0[hz] * omfcc2 4 ot0[ms] * uf[hz] ** 5 P90[Hz] * of2[hz] ** 6 af[hz] ** umfcc 7 if0[hz] * emfcc4 8 ut0[ms] * amfcc6 9 it0[ms] * emfcc6 0 at0[ms] * imfcc0 omfcc2 omfcc5 2 IHL et0[ms] * 3 irap[%] I50[dB] 4 uf2[hz] ** emfcc9 5 ef0[hz] * if0[hz] * 6 af0[hz] * ef0[hz] * 7 et0[ms] * uc0 8 ubw[hz] ** amfcc9 9 uf0[hz] * I0[dB] 20 obw[hz] ** emfcc0 * : F 0, ** : Formant 5..,,,.....,,,. 50% 70%. /.. F 0,. (testosterone) [3]-[4]

음성을이용한사상체질분류알고리즘 6... [] WHO. (2007). WHO International Standard Terminologies on Traditional Medicine in The Western Pacific Region", http://www.who.int [2] Park, S. H., Kim, M. G., Lee, S. J., Kim, J. Y. & Chae, H. (2009). Temperament and character profiles of sasang typology in an adult clinical sample, Evid. Based Complement. Altern. Med., Advance Access published on April 20; doi:0.093/ecam/nep034 [3] Park, S. C. & Kim, D. J. (2004). Implementation of the automatic pulse-power diagnostic system and the discrimination algorithm of four constitutions, Journal of IEEK SC, Vol. 4, No. 2, pp. 53-60 (, (2004)., 4 SC 2, pp. 53-60, March) [4] Chiu, C. C., Chang, H. H. & Yang, C. H. (2000). "Objective auscultation for traditional Chinese medical diagnosis using novel acoustic parameters", Comput. Methods Programs Biomed, Vol. 62, pp. 99-07. [5] Chiu, C. C., Yang, M. T. & Lin, C. S. (2002). Using fractal dimension analysis on objective auscultation of traditional Chinese medical diagnosis, J. Med.Biol. Eng., Vol. 22, pp. 29-225. [6] Kim, D. R. (999). The Principle of Life Ppreservation in Oriental Medicine, Chungdam. ( (999).,.) [7] Cho, D. U., (2006). Sasang Constitution Classification by Speech Signal Processing, Journal of KICS, Vol. 3, No. 5C, pp. 548-555, May. ( (2006)., 3 5C, pp. 548-555.) [8] Moon, S. J., Tak, J. H. & Hwang, H. J. (2005). A phonetic study of Sasang Constitution, Malsori, Vol. 55, pp. -4. (,, (2005). :, 55, pp. -4.) [9] Boersma, P. (993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound, Proceeding Institute of Phonetic Sciences Vol. 7, pp. 97-0. [0] Yang, B. G. (2003). Speech Analysis Using Praat Script, Mansu ( (2003).,.) [] Ko, D. H. & Jeong, O. R. (200). The Use of Speech and Language Analyzer, Hankukmunhwasa (, (200).,.) [2] Dellar, J. R., Hansen, J. H. L. & Proakis, J. G. (999). Discrete- Time Processing of Speech Signals, Wiley-IEEE Press, pp. 380-385. [3] Dabbs, J. M. & Mallinger, A. (999). High testosterone levels predict low voice pitch among men, Personality and Individual Differences, Vol. 27, pp. 80-804. [4] King, A., Ashby. J. & Nelson, C. (200). "Effects of testosterone replacement on a male professional singer", Journal of Voice, Vol. 5, No. 4, pp. 553-557. (Kang, Jaehwan) 483 Tel: 042-868-930 Fax: 042-868-9480 Email: doskian@kiom.re.kr :, (Yoo, Jonghyang) 483 Tel: 042-868-959 Fax: 042-868-9480 Email: jhyoo@kiom.re.kr : (Lee, Haejung) 483 Tel: 042-868-9320 Fax: 042-868-9480 Email: haejung064@kiom.re.kr :, (Kim, Jongyeol) 483 Tel: 042-868-9489 Fax: 042-868-9480 Email: ssmed@kiom.re.kr :