Presented at Annual CKM Session (Toronto, March 16, 2017) Hyoungbae Lee Korean Studies Librarian Princeton University
Based on AutoIt, a freeware automation language for Microsoft Windows o Easy to learn BASIC-like syntax o Simulate keystrokes and mouse movements o Manipulate windows and processes o Interact with all standard windows controls o Scripts can be compiled into standalone executables
Oct. 2015 - Mar. 2016 : Developing and testing (demo version) Mar. 2016 (CEAL Meeting at Seattle) : Demonstration (demo version) Apr. 2016 - Dec. 2016 : Implementation of new functions, testing and debugging, result analysis, dictionary-building, discussion via listserv Jan. 2017 - Mar. 2017 : Wrap-up and preparation Mar. 2017 (CEAL Meeting at Toronto) : Demonstration (release version)
Demo version tested with examples in ALA-LC Korean Romanization Table (December, 2015) o The ALA-LC Table is basically rules and exceptions o Rules Write automation scripts o Exceptions Build dictionaries Release version tested with random book titles (April-December, 2016) o 1,600 titles (4,000 words) o Error rate: less than 5% o Error rate excluding capitalization: less than 3%
Hyoungbae Lee (Princeton, Chair) Hyunjoo Eom (UBC) Eunseung Oh (Stanford) Jee-Young Park (Chicago)
Ctrl + Shift + Ki romanizes Personal names 이세상 ( 李世祥, 1671-1719) Yi Se-sang 박마리아 Pak Maria 김삿갓 Kim Satkat 선우휘 Sŏnu Hwi 김조광수 Kim Cho Kwang-su 신립, 류성룡, 이하늬, 김초롱 1, 심마르시아스 Sin Rip, Yu Sŏng-nyong, Yi Hanŭi, Kim Ch'o-rong, Sim Marŭsiasŭ 1 K-Romanizer cannot tell whether 초롱 is of Hancha origin or a native Korean name, thus inserting a hyphen by default. On the other hand, 늬 in 하늬 cannot be of Hancha origin, and K-Romanizer regards it as a non-hancha name by removing a hyphen.
Ctrl + Alt + Ki romanizes general words and phrases 이세상 ( this world ) I sesang 오늘의사건사고 Onu l uĭ sako n sago 한미경제협력 Han-Mi kyo ngje hyo mnyo k 토론토 CEAL 연례회의 T'oront'o CEAL yo llye hoeuĭ 2017 년 3 월 16 일 2017-yo n 3-wo l 16-il ǂa 한국신문방송말글변천사 : ǂb 독립신문 2 100 돌기념 ǂa Han'guk sinmun pangsong malgu l pyo nch'o nsa : ǂb tongnip sinmun 100-tol kinyo m 2 독립신문 has to be manually capitalized as Tongnip sinmun.
Recognizes geographical names ( 國, 道, 市, 區, 郡 ) 서울특별시 Soŭl T'u kpyo lsi 서울시종로구 Soŭl-si Chongno-gu 전라북도고창군 Cho lla-bukto Koch'ang-gun 대한민국, 중국, 일본 Taehan Min'guk, Chungguk, Ilbon ǂa 실크로드문명기행 : ǂb 일본나라 3 에서이탈리아로마까지 ǂa Silk'u rodu munmyo ng kihaeng : ǂb Ilbon nara eso It'allia Roma kkaji 3 실크로드 and 나라 has to be manually capitalized as Silk'u Rodu and Nara respectively.
Recognizes names of Korean kings ( 宗, 祖, 王, 君 ) 선조와광해군 So njo wa Kwanghae Kun 고구려광개토대왕 Koguryo Kwanggaet'o Taewang 의친왕과영친왕 Uĭ Ch'inwang kwa Yo ng Ch'inwang ǂa 숙종대왕민중전 4 덕행록 : ǂb 숙종대왕어제 ǂa Sukchong Taewang min chungjo n to khaengnok : ǂb Sukchong Taewang o je 4 민중전, the queen of King Sukchong, has to be manually capitalized as Min Chungjŏn.
Romanizes Hancha or Hancha-Hangul mixture 李世祥 Yi Se-sang 高句麗廣開土大王 Koguryo Kwanggaet'o Taewang 全羅北道高敞郡 Cho lla-bukto Koch'ang-gun 합천 : 陝川 or 陜川? 陝川 = So mch'o n / 陜川 = Hapch'o n ǂa 高麗時代의硏究 : ǂb 特히圖讖思想의發展을中心으로 ǂa Koryo sidae uĭ yo n'gu : ǂb t'u khi toch'am sasang uĭ palcho n u l chungsim u ro
ㄱ부터ㅎ까지 Kiyo k put'o hiu t kkaji 조선, 리더, 제주도 Choso n, lido, Chejudo 5 Ctrl + Shift + Alt + Ki will not capitalize the first letter 문국현 조명현外지음 문국현 조명현 oe chiu m Shift + Alt + Ki capitalizes the first letters of all words 동아시아도서관협의회한국학자료위원회 Tong Asia Tosoğwan Hyo buĭhoe Han'gukhak Charyo Wiwo nhoe 5 A warning message will pop up for these words due to ambiguity: 朝鮮 =Choso n / 造船 =choso n, 리더 (Leader)= lido / 리더 (reader)= rido, 濟州島 = Chejudo / 濟州道 = Cheju-do. Users have to modify them as necessary.
Entering special characters for cataloging Ctrl + Alt + Oi enters a lower-case ŏ Shift + Alt + Oi enters an upper-case Ŏ Ctrl + Alt + Ui enters a lower-case ŭ Shift + Alt + Ui enters an upper-case Ŭ Ctrl + Alt + \i enters a double dagger ǂ
Ctrl + Shift + Alt + Ci adds Hangul reading to Hancha texts. 春雨細不滴, 夜中微有聲, 雪盡南溪漲, 草芽多少生 / 鄭夢周 春雨細不滴, 夜中微有聲, 雪盡南溪漲, 草芽多少生 / 鄭夢周 ( 춘우세부적, 야중미유성, 설진남계창, 초아다소생 / 정몽주 ) 6 不恨吾生晩只恨吾有眼無復見斯人危途悌空潸 / 李安訥 不恨吾生晩只恨吾有眼無復見斯人危途悌空潸 / 李安訥 ( 불한오생만지한오유안무복견사인위도제공산 / 이안눌 ) 6 A warning message will pop up for 復 and 見, as these characters have more than one reading. Users have to modify them as necessary: 復 should be manually changed into 부 in this poem. On the other hand, 不 has two readings as 부 or 불 depending on the following sound, and the sound change will be automatically applied by K-Romanizer.
One Big CAVEAT! Since word division has to be first done manually, it is strongly advsied that users get familiarized with the ALA-LC Korean Romanization rules before using K-Romanizer. http://www.loc.gov/catdir/cpso/romanization/korean.pdf Download Link available at: http://www.eastasianlib.org/ckm/subcommittees.html Download a new version of K-Romanizer at least every 3 months for updated dictionaries. Each version will expire after 90 days from the last update.