(JBE Vol. 23, No. 2, March 2018) (Special Paper) 23 2, (JBE Vol. 23, No. 2, March 2018) ISSN

Similar documents
2 : (JEM) QTBT (Yong-Uk Yoon et al.: A Fast Decision Method of Quadtree plus Binary Tree (QTBT) Depth in JEM) (Special Paper) 22 5, (JBE Vol. 2

09권오설_ok.hwp

(JBE Vol. 23, No. 5, September 2018) (Regular Paper) 23 5, (JBE Vol. 23, No. 5, September 2018) ISSN

2 : 3 (Myeongah Cho et al.: Three-Dimensional Rotation Angle Preprocessing and Weighted Blending for Fast Panoramic Image Method) (Special Paper) 23 2

(JBE Vol. 23, No. 2, March 2018) (Special Paper) 23 2, (JBE Vol. 23, No. 2, March 2018) ISSN

(JBE Vol. 21, No. 1, January 2016) (Regular Paper) 21 1, (JBE Vol. 21, No. 1, January 2016) ISSN 228

À±½Â¿í Ãâ·Â

(JBE Vol. 22, No. 2, March 2017) (Regular Paper) 22 2, (JBE Vol. 22, No. 2, March 2017) ISSN

(JBE Vol. 23, No. 6, November 2018) (Special Paper) 23 6, (JBE Vol. 23, No. 6, November 2018) ISSN 2

2 : (Seungsoo Lee et al.: Generating a Reflectance Image from a Low-Light Image Using Convolutional Neural Network) (Regular Paper) 24 4, (JBE

08김현휘_ok.hwp

DBPIA-NURIMEDIA

<30312DC1A4BAB8C5EBBDC5C7E0C1A4B9D7C1A4C3A52DC1A4BFB5C3B62E687770>

1 : 360 VR (Da-yoon Nam et al.: Color and Illumination Compensation Algorithm for 360 VR Panorama Image) (Special Paper) 24 1, (JBE Vol. 24, No

°í¼®ÁÖ Ãâ·Â

(JBE Vol. 23, No. 1, January 2018) (Special Paper) 23 1, (JBE Vol. 23, No. 1, January 2018) ISSN 2287-

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Nov.; 26(11),

High Resolution Disparity Map Generation Using TOF Depth Camera In this paper, we propose a high-resolution disparity map generation method using a lo

(JBE Vol. 24, No. 2, March 2019) (Special Paper) 24 2, (JBE Vol. 24, No. 2, March 2019) ISSN

(JBE Vol. 23, No. 1, January 2018). (VR),. IT (Facebook) (Oculus) VR Gear IT [1].,.,,,,..,,.. ( ) 3,,..,,. [2].,,,.,,. HMD,. HMD,,. TV.....,,,,, 3 3,,

DBPIA-NURIMEDIA

1 : (Sunmin Lee et al.: Design and Implementation of Indoor Location Recognition System based on Fingerprint and Random Forest)., [1][2]. GPS(Global P

(JBE Vol. 23, No. 5, September 2018) (Regular Paper) 23 5, (JBE Vol. 23, No. 5, September 2018) ISSN

4 : (Hyo-Jin Cho et al.: Audio High-Band Coding based on Autoencoder with Side Information) (Special Paper) 24 3, (JBE Vol. 24, No. 3, May 2019

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Feb.; 29(2), IS

디지털포렌식학회 논문양식

학습영역의 Taxonomy에 기초한 CD-ROM Title의 효과분석

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Mar.; 25(3),

<4D F736F F D20B1E2C8B9BDC3B8AEC1EE2DC0E5C7F5>

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE. vol. 29, no. 10, Oct ,,. 0.5 %.., cm mm FR4 (ε r =4.4)

10 이지훈KICS hwp

2 : (Juhyeok Mun et al.: Visual Object Tracking by Using Multiple Random Walkers) (Special Paper) 21 6, (JBE Vol. 21, No. 6, November 2016) ht

<35335FBCDBC7D1C1A42DB8E2B8AEBDBAC5CDC0C720C0FCB1E2C0FB20C6AFBCBA20BAD0BCAE2E687770>

02손예진_ok.hwp

4 : WebRTC P2P DASH (Ju Ho Seo et al.: A transport-history-based peer selection algorithm for P2P-assisted DASH systems based on WebRTC) (Special Pape

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE. vol. 29, no. 6, Jun Rate). STAP(Space-Time Adaptive Processing)., -

<30362E20C6EDC1FD2DB0EDBFB5B4EBB4D420BCF6C1A42E687770>

05( ) CPLV12-04.hwp

03-ÀÌÁ¦Çö

(JBE Vol. 23, No. 1, January 2018) (Regular Paper) 23 1, (JBE Vol. 23, No. 1, January 2018) ISSN 2287

3 : (Won Jang et al.: Musical Instrument Conversion based Music Ensemble Application Development for Smartphone) (Special Paper) 22 2, (JBE Vol

04 최진규.hwp

Journal of Educational Innovation Research 2017, Vol. 27, No. 3, pp DOI: (NCS) Method of Con

인문사회과학기술융합학회

45-51 ¹Ú¼ø¸¸

DBPIA-NURIMEDIA

Æ÷Àå½Ã¼³94š

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Jan.; 26(1),

DBPIA-NURIMEDIA

(JBE Vol. 24, No. 1, January 2019) (Regular Paper) 24 1, (JBE Vol. 24, No. 1, January 2019) ISSN 2287

<313120C0AFC0FCC0DA5FBECBB0EDB8AEC1F2C0BB5FC0CCBFEBC7D15FB1E8C0BAC5C25FBCF6C1A42E687770>

1 : UHD (Heekwang Kim et al.: Segment Scheduling Scheme for Efficient Bandwidth Utilization of UHD Contents Streaming in Wireless Environment) (Specia

<32382DC3BBB0A2C0E5BED6C0DA2E687770>

07.045~051(D04_신상욱).fm

(JBE Vol. 22, No. 5, September 2017) (Special Paper) 22 5, (JBE Vol. 22, No. 5, September 2017) ISSN

그림 2. 5G 연구 단체 현황 앞으로 다가올 미래에는 고품질 멀 티미디어 서비스의 본격화, IoT 서 비스 확산 등의 변화로 인해 기하 급수적인 무선 데이터 트래픽 발생 및 스마트 기기가 폭발적으로 증대 할 것으로 예상된다 앞으로 다가올 미래에는 고품질 멀티미디어 서

,, RFID,. ITU-R [7], IoT (Internet of Thing), (ultra reliable) (low latency). IoT ( ) , [1]., [8] 10 IoT.,. Ofcom [10] IoT/M2M, (utilities),,

04 김영규.hwp

<30312DC1A4BAB8C5EBBDC5C7E0C1A4B9D7C1A4C3A528B1E8C1BEB9E8292E687770>

07변성우_ok.hwp

example code are examined in this stage The low pressure pressurizer reactor trip module of the Plant Protection System was programmed as subject for

3. 클라우드 컴퓨팅 상호 운용성 기반의 서비스 평가 방법론 개발.hwp

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Jun.; 27(6),

À¯Çõ Ãâ·Â

14.531~539(08-037).fm

06_ÀÌÀçÈÆ¿Ü0926

½Éº´È¿ Ãâ·Â

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Dec.; 27(12),

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Sep.; 30(9),

2 : (Jaeyoung Kim et al.: A Statistical Approach for Improving the Embedding Capacity of Block Matching based Image Steganography) (Regular Paper) 22

Journal of Educational Innovation Research 2018, Vol. 28, No. 4, pp DOI: A Study on Organizi

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Jul.; 27(7),

(JBE Vol. 21, No. 3, May 2016) HE-AAC v2. DAB+ 120ms..,. DRM+(Digital Radio Mondiale plus) [3] xhe-aac (extended HE-AAC). DRM+ DAB HE-AAC v2 xhe-aac..

DBPIA-NURIMEDIA

<3031B0ADB9CEB1B82E687770>

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Dec.; 26(12),

8-VSB (Vestigial Sideband Modulation)., (Carrier Phase Offset, CPO) (Timing Frequency Offset),. VSB, 8-PAM(pulse amplitude modulation,, ) DC 1.25V, [2

5 : HEVC GOP R-lambda (Dae-Eun Kim et al.: R-lambda Model based Rate Control for GOP Parallel Coding in A Real-Time HEVC Software Encoder) (Special Pa

3 : 3D (Seunggi Kim et. al.: 3D Depth Estimation by a Single Camera) (Regular Paper) 24 2, (JBE Vol. 24, No. 2, March 2019)

원고스타일 정의

<31362DB1E8C7FDBFF82DC0FABFB9BBEA20B5B6B8B3BFB5C8ADC0C720B1B8C0FC20B8B6C4C9C6C32E687770>

<31325FB1E8B0E6BCBA2E687770>

DBPIA-NURIMEDIA

Voice Portal using Oracle 9i AS Wireless

<353420B1C7B9CCB6F52DC1F5B0ADC7F6BDC7C0BB20C0CCBFEBC7D120BEC6B5BFB1B3C0B0C7C1B7CEB1D7B7A52E687770>

Microsoft Word - 1-차우창.doc

20(53?)_???_O2O(Online to Offline)??? ???? ??.hwp

2 : MMT QoS (Bokyun Jo et al. : Adaptive QoS Study for Video Streaming Service In MMT Protocol). MPEG-2 TS (Moving Picture Experts Group-2 Transport S

03-서연옥.hwp

04_이근원_21~27.hwp

6.24-9년 6월

DBPIA-NURIMEDIA

THE JOURNAL OF KOREAN INSTITUTE OF ELECTROMAGNETIC ENGINEERING AND SCIENCE Mar.; 28(3),

(JBE Vol. 24, No. 1, January 2019) (Regular Paper) 24 1, (JBE Vol. 24, No. 1, January 2019) ISSN 2287

정보기술응용학회 발표

<30345F D F FC0CCB5BFC8F15FB5B5B7CEC5CDB3CEC0C720B0BBB1B8BACE20B0E6B0FCBCB3B0E8B0A120C5CDB3CE20B3BBBACEC1B6B8ED2E687770>

09오충원(613~623)

12È«±â¼±¿Ü339~370

09È«¼®¿µ 5~152s

CONTENTS Volume 테마 즐겨찾기 빅데이터의 현주소 진일보하는 공개 기술, 빅데이터 새 시대를 열다 12 테마 활동 빅데이터 플랫폼 기술의 현황 빅데이터, 하둡 품고 병렬처리 가속화 16 테마 더하기 국내 빅데이터 산 학 연 관

Transcription:

(Special Paper) 23 2, 2018 3 (JBE Vol. 23, No. 2, March 2018) https://doi.org/10.5909/jbe.2018.23.2.246 ISSN 2287-9137 (Online) ISSN 1226-7953 (Print) CNN a), a), a) CNN-Based Hand Gesture Recognition for Wearable Applications Hyeon-Chul Moon a), Anna Yang a), and Jae-Gon Kim a) NUI(Natural User Interface). MPEG IoT(Internet of Things) IoMT(Internet of Media Things). IoMT.,. IoMT (use case) CNN(Convolutional Neural Network). (depth), CNN,. 95%. Abstract Hand gestures are attracting attention as a NUI (Natural User Interface) of wearable devices such as smart glasses. Recently, to support efficient media consumption in IoT (Internet of Things) and wearable environments, the standardization of IoMT (Internet of Media Things) is in the progress in MPEG. In IoMT, it is assumed that hand gesture detection and recognition are performed on a separate device, and thus provides an interoperable interface between these modules. Meanwhile, deep learning based hand gesture recognition techniques have been recently actively studied to improve the recognition performance. In this paper, we propose a method of hand gesture recognition based on CNN (Convolutional Neural Network) for various applications such as media consumption in wearable devices which is one of the use cases of IoMT. The proposed method detects hand contour from stereo images acquisitioned by smart glasses using depth information and color information, constructs data sets to learn CNN, and then recognizes gestures from input hand contour images. Experimental results show that the proposed method achieves the average 95% hand gesture recognition rate. Keyword : MPEG-IoMT, Hand Gesture, Hand Contour, CNN, Gesture Recognition a) (Korea Aerospace University, School of Electronics and Information Engineering) Corresponding Author : (Jae-Gon Kim) E-mail: jgkim@kau.ac.kr Tel: +82-2-300-0414 ORCID: http://orcid.org/0000-0003-3686-4786 [10077958]. 2017. This work was supported by National Standards Technology Promotion Program of Korean Agency for Technology and Standards (KATS) grant funded by the Ministry of Trade, Industry and Energy (MOTIE, Korea) (10077958). Parts of this work have been published in the 2017 Fall Conference of the Korean Institute of Broadcasting and Media Engineers. Manuscript received January 12, 2018; Revised February 5, 2018; Accepted February 14, 2018. Copyright 2016 Korean Institute of Broadcast and Media Engineers. All rights reserved. This is an Open-Access article distributed under the terms of the Creative Commons BY-NC-ND (http://creativecommons.org/licenses/by-nc-nd/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited and not altered.

2 : CNN (Hyeon-Chul Moon et al.: CNN-Based Hand Gesture Recognition for Wearable Applications). NUI (Natural User Interface). MPEG IoT(Internet of Thing) IoMT(Internet of Media Things), (use case). NUI [1][2][3]. [4][5], (Deep Learning) [5][6]. MPEG IoMT CNN(Convolutional Neural Network). MPEG IoMT, API(Application Programming Interface) [5]. CNN.. CNN [6].. 2 IoMT (use case), 3 CNN. 4, 5.. IoMT 1 [2][3]. NUI [8]. IoMT 1 (User), MThings(Media Things), (Processing Unit: PU),. 1. Fig 1. A scenario hand gesture based wearable applications 1. (Detection Module). [1].,,.. IoMT PU PU API

. XML API, CNN. III. CNN 1.. 2, (RGB ) (depth). (30 ~ 50cm),. (morphology). 2. 3 [1][2][3]. 2. Fig 2. Procedure of hand contour detection 3. Fig 3. An example of the detected hand contour 2. CNN CNN. CNN, (feature) (classification). CNN (con- volution), (Pooling), (Fully-Connected: FC) [9]. 4 CNN. C1, C2 Convolution 5 5 1(stride= 1),. S1, S2 Pooling Max Pooling 2 2,. Max Pooling 5, 2 2. F1, F2 Fully-Connected,. 6 Fully- Connected, P1, P2, P3. 6, 7, P1~P10. Softmax

2 : CNN (Hyeon-Chul Moon et al.: CNN-Based Hand Gesture Recognition for Wearable Applications) 4. CNN Fig 4. Proposed CNN structure,( 1 ) 9 0.. (gradient descent), (momentum). (loss function),. [10]. 5. Max Pooling Fig 5. Example of Max Pooling 6. Fully-Connected Fig 6. Classification process of fully-connected layer 3. CNN CNN (weight),. (1), (2) t gradient, learning rate,, t. (1), gradient., gradient. (2), t+1..

4. Tensorflow, Theano, MXnet, Keras, [11]. S/W R. R MXnetR, Deepnet, H20,., MXnet, Deepnet. H20,.. IV. 8. 7 10 5 6,000, 3 1,000.,., 1 MxnetR, 2 3 MxnetR. 1. Table 1. Recognition accuracy comparisons among deep learning frameworks Framework Accuracy (%) MxnetR 95 Deepnet 94.7 H20 94.4 2., 5, 3., 7,000. 0.3%,. 2. Table 2. Recognition accuracy comparisons according to the size of data set and the application of momentum Methods Accuracy (%) Data set = 2,800 Momentum: not applied Data set = 5,600 Momentum: not applied Data set = 7,000 Momentum: not applied Data set = 7,000 Momentum: applied 77.2 88.4 94.7 95 7. Fig 7. A set of hand gestures used in the experiments 1., 1,000. 3 6. 3 Three 98.3%, Rice 91.7%. 0.5 ~ 6.6%, 10 95%.

2 : CNN (Hyeon-Chul Moon et al.: CNN-Based Hand Gesture Recognition for Wearable Applications) 3. Table 3. Recognition accuracy of each gesture label 1 (One) 2 (Two) 3 (Three) 4 (Four) 5 (Five) 93.1 96.1 98.3 95.6 95.2, CNN. 95%, MPEG IoMT. Accuracy (%) 6 (Okay) 7 (Promise) 8 (Rice) 9 (Scissor) 10 (Victory) 94.2 96.1 91.7 93.7 94.6 (References) Average accuracy = 95 % 4 CNN, CNN [12].,. [12] 93.8%, 1.2%. 4. Table 4. Comparison of recognition accuracy with the existing method Method Accuracy (%) Existing method [12] 93.8 Proposed method 95 V.. CNN., CNN., [1] A. Yang, S. Chun, H. Ko, J. G. Kim, Hand gesture description for wearable applications in M-IoTW, ISO/IEC JTC1/SC29/WG11 M38526, Geneva, Swiss, May. 2016. [2] A. Yang, S. Chun, and J-G. Kim, Detection and recognition of hand gesture for wearable applications in IoMTW, In Proc. ICACT 2017, pp. 598 601, Feb. 2017. [3] A. Yang, S. Chun, and J.-G. Kim, Detection and Recognition of Hand Gesture for Wearable Applications in IoMTW, ICACT Trans. Advanced Communications Technology (TACT), vol. 6, no. 5, pp. 1046-1053, Sep. 2017. [4] S. Mitra and T. Acharya, Gesture recognition: A survey, IEEE Trans. Syst., Man, Cybern. C, vol. 37, no. 3, pp. 311 324, 2007. [5] S. Byun, S. Lee, G. Kim, S. Han, Gesture recognition with wearable device based on deep learning, Broadcasting and Media Magazine, Vol.22, No.1, pp. 58-66, Jan.2017 [6] H. Moon, A. Yang, S. Chun, and J-G. Kim, CNN-based hand gesture recognition for wearable applications, The Korean Institute of Broadcase and Media Engineers Conference, Seoul, Korea, pp. 58-59, 2017. [7] Y. LeCun, K. Koray, and F. Clément, Convolutional networks and applications in vision, In Proc. ISCAS 2010. [8] M. Mitrea, Working draft 2.0 of ISO/IEC 23093-1 IoMT Architecture, ISO/IEC JTC1/SC29/WG11 N17094, Torino, July 2017. [9] A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagent classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, 2012. [10] I. Sutskever, J. Martens, G. Dahl, and G. Hinton, On the importance of initialization and momentum in deep learning, In Proc, Int, Conf. Machine Learning (ICML), pp 1139-1147, Feb, 2013. [11] Y. Lee, P. Moon, A comparison and Analysis of deep learning framework, J. Korea Institute of Electron. Communi. Science (KIECS), vol 12, no.1, pp. 115-122, Feb, 2017. [12] M. Han et al., Visual hand gesture recognition with convolution neural network, In Proc. Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2016.

- 2018 2 : - 2018 3 ~ : - ORCID : http://orcid.org/0000-0002-1672-2345 - :,, - 2014 7 : - 2017 2 : - 2018 3 ~ : - ORCID : https://orcid.org/0000-0003-4957-9589 - :, IoT, - 1990 2 : - 1992 2 : KAIST - 1992 2 : KAIST - 1992 3 ~ 2007 2 : (ETRI) / - 2001 9 ~ 2002 7 : - 2015 12 ~ 2016 1 : UC San Diego, Visiting Scholar - 2007 9 ~ : - ORCID : http://orcid.org/0000-0003-3686-4786 - :,,, UHD/