2017 2017 Data Industry White Paper
2017 1
4 1 2 3 4 5 6 7 8 9 10 Interview
1 DBMS 4 DBMS * 128 2017
DBMS Database Management System DB DBMS DBMS NoSQL Non-Structured Query Language DBMS NoSQL 4 4 Relational, Mainframe INGEST Documents, Emails Social Media, Web Logs, Machine Devices, Cloud Managed Information Object Secure Searchable Understandable Profiled Traceable LINEAGE ENTITLEMENTS INDEX CATALOG The Data Lake Architecture http www hsolutions fi services 129
OLAP http www karvyanalytics com Expertise Analytics Factory aspx 130 2017
4 SW DB DB DB DB 4 DB 131
2 IT 1 * PPC 132 2017
IT 4 IT 2 ETL 133
SNS XML IoT Rationalization Data enrichment Consolidation Enrichment Rationalization Single Versio of Truth 134 2017
4 3 20 SQL SQL ROW API 135
API CSV TSV Delimiter JSON TCP CSV TSV Delimiter JSON SNS SNS API Change Data Capture CDC DBMS Redo Log DML DDL TeraStream for Hadoop RSC 1 Row Count CPU Property 136 2017
TeraStream BASS BDI 4 DBMS DBMS Redo Log Archive log SQL DBMS Hex DBMS HDFS Insert Update Delete HIVE 0.14 CRUD 4 HW SW 137
RDB VOC 10G SQL UI GUI GUI 1 2 3 4 5 6 7 8 IOT 138 2017
5 4 Categorize Metadata Management Discovery 139
3 DBMS 18 19 IT 4 3D IoT 4 DBMS 1 DBMS DBMS DBMS DBMS key value * 140 2017
Relational Database Management System RDBMS 4 RDBMS NoSQL DBMS DBMS NoSQL DBMS NoSQL SQL NoSQL DBMS NoSQL DBMS DBMS RDBMS IT DBMS DBMS 2 DBMS 2013 7 12c Oracle 12c 2017 4 Oracle Exadata Cloud Machine On-Premise 141
IBM 2016 7 IBM DB2 V11.1 9.7 11.1 2017 SQL SQL Server 2016 SQL SQL SQL SAP HANA 2 HANA 2 DB SAP Google Cloud Launcher Martetplace SAP HANA express edition PaaS SAP 2015 3 Tibero 6 DB Tibero 6 DBMS DBMS T-Up 2016 10 TmaxCloud Day IaaS PaaS KT AWS Tibero Tibero Zeta Edition 142 2017
2013 DBMS SUNDB 4 Near-zero latency DBM DataBase Manager Goldilocks DBMS G-Cloud DBMS 2016 Cubrid 10.0 MVCC Multi-Version Concurrency Control Snapshot Isolation SQL Function PostgreSQL ORDBMS SQL MVCC AWS Aurora PostgreSQL DB MariaDB MySQL 90 DBMS 2017 DB 10.2 DB 10.2 Window Function Syntax Information Schema EXPLAIN Variables Script DB 1.0 MariaDB ColumnStore 1.0 2016 12 DB OLTP SQL DB DBMS IT DBMS 3 DBMS DBMS IT DBMS DBMS 143
DBMS Advocacy B2B HR IaaS IBM IBM Red Hat Certified Cloud and Service Provider Red Hat OpenStack Platform Red Hat Ceph Storage IBM Red Hat Cloud Access 2017 2 IBM IBM Cloud Red Hat Enterprise Linux IBM IBM Cloud Data Centers SQL 6 16 Premium Assurance 2008 SQL 2008 2008 2008 R2 SQL 2008 2008 R2 144 2017
SAP 17 Google Cloud Next 17 4 SAP Google Cloud Platform SAP HANA SAP Google Cloud Launcher Marketplace SAP HANA SAP HANA express edition AWS SAP CSP Cloud Service Provider DBMS DBMS Tool DBMS DBMS 2016 Altibase 6.5 2016 SUNDB 3.0 Maxscale SUNDB DBMS OLTP RDBMS DBMS DBMS DBMS 4 18 19 145
IT 4 4 3D IoT DBMS OS RDBMS DBMS DBMS DBMS CRM Customer Relationship Management DBMS IT DBMS OLTP RDBMS DBMS DBMS DBMS DB DB DB DB 146 2017
4 4 1 Data Lake BI Business Intelligence Pentaho CTO James Dixon 2014 BI * 147
2 CSV RCFile Record Columnar File ORC Optimized Row Columnar GZip GNU zip LZO Lempel-Ziv-Oberhumer Python Pig SQL GUI IoT CQL Continuous Query Language orchestration authentication authorization audit BI DW 148 2017
3 4 on-premise TeraData DellEMC Apache Hadoop Apache Spark Apache NiFi Data Lake Kylo 2.0 EMC Pivotal VM VMware EMC FBDL Federation Business Data Lake NoSQL MPP FBDL VM VM SQL SQL-on- Hadoop HAWQ HD PivotalHD SAS Tableau Cloudera Hortonworks EMC V EMC VBlock EMC EMC ISILON 149
CSP Cloud Service Provider Azure AWS U-SQL R Net Visual studio Eclipse U-SQL Spark Hive Storm GUI AWS AWS Cloud Formation Amazon Dynamo DB Elastic search AWS DW RDBMS AnyMiner AnyMiner 150 2017
IoT 4 Inverted index UI 4 2014 cleansing 151
152 2017
5 4 1 BI Business Intelligence Advanced Analytics BI Self-Service * 153
Spark R Python BDaaS Big-Data-As-A Service BI KPIs OLAP Cube IT 2 BI BI BI MATRIX Suite Octagon EOS ERS WISE OLAP D3 R SAP BO IBM Cognos 154 2017
OLAP Qlikview Tableau Splunk 4 BigO Azure Cortana Intelligence Suite BI BI R Python Anaconda Jupyter Pandas Scikit-learn 155
3 BDA Map Reduce Spark HDFS SQL SQLon-Hadoop Apache Tajo Cloudera Impala MongoDB NoSQL Key-Value Documents Graph Database SAP HANA DBMS In-Memory DBMS IMDBMS DW IT TensorFlow DMTK Distributed Machine Learning Toolkit Baidu Andrew Ng Warp-CTC Connectionist Temporal Classification 4 IDC 156 2017
BDaaS BigML https bigml com 4 Predictive analytics 157
6 OLAP BI BI 1 Business Intelligence BI BI DBMS BI OLAP BI BI OLAP R Python Spark * 158 2017
4 IDC IDC 159
2 Wise- OLAP MATRIX DATAPLANET Octagon EOS ERS BI SAP BO Microstrategy MSTR TableauSoftware Tableau QlikTech QlikView TIBCO Spotfire R Shiny ggplot2 WISE OLAP WISE Advisor WISE Visual WISE Campaign WISE DQ 3 MATRIX i-big DATAPLANET R D3 Octagon EOS ERS Octagon BI Platform R GIS Octagon Advantage Octagon Visualization ankus MapReduce R 160 2017
4 1 2 Analytics 3 R OLAP 161
4 OLAP 5 162 2017
4 4.3 30.2 2018 77 4 2019 98 * 4 * 2015 2016 163
7 4 1 Master Data ERP SCM MES MMS BI EDW ID Business Code Reference Code * 164 2017
4 2 Master Data Management Solution MDM MDM DQ DI ERP MES IT MDM MDM ERP MDM 3 MDM MDM ERP Data Management Platform MDM MDM SAP IBM StiboSystems 4-7 1 165
MDM UI ERP MDM ERP MDM DQ DI MDM 2000 MDM MDM MDM ERP ERP MDM ERP ERP ERP MDM DQ DI 166 2017
UI UI 4 4 MDM MasterStream DQM Global MDM Configuration Setting BI ToBeWAY Enterprise MDM 360 ToBeWAY Operational MDM MES SCM WMS MMS MDM ERP MDM ToBeWAY Standard Reference Code Multi-domain MDM 2017 Magic Quadrant for MDM DQM ETL MDM Consolidation MDM IDD UI UI UI Product 360 Heiler PIM Product Information Management Multi-domain MDM 167
SAP Master Data Governance MDG ERP ERP MDM ERP Central Management ABAP SAP ERP MDG ERP MDG ERP MDM Audit Risk MDM MDM MDM MasterStream Multi domain MDM Global MDM Multi domain ToBeWAY Enterprise MDM v 8 5 Multi domain MDM ToBeWAY Operational MDM Operational ToBeWAY Standard Reference Code Reference EnterWorks MDM EnterWorks Enable v 8 1 Multi domain IBM InfoSphere MDM v11 5 InfoSphere MDM Reference Data Hub Multi domain Reference 168 2017
MDM MDM MDM 4 Informatica MDM v 10 2 Multi domain Informatica Supplier 360 v 10 1 Customer 360 Cloud Customer 360 for Salesforce Domain Specific Cloud Product 360 v 8 0 5 2012 Heiler PIM Domain Specific Magnitude Software MDM Kalido MDM v 9 1 SP3 Magnitude One Multi domain Cloud Product Hub Supplier Hub Site Hub vr12 2 Domain Specific ORACLE ERP Customer Hub vip 2016 Product Cloud Service Customer Cloud Service vr11 Domain Specific Cloud Data Relationship Management vr11 1 Hyperion Multi domain Orchestra Networks MDM EBX5 v 5 7 Multi domain Riversand MDM MDMCenter v 7 8 Multi domain Master Data Governance MDG v 9 0 Multi domain SAP ERP SAP NetWeaver MDM 7 1 SP17 Multi domain Hybris Product Content Management PCM v 6 2 Domain Specific Stibo Systems MDM Stibo Enterprise Platform STEP Trailblazer v 8 0 Multi domain TIBCO Software TIBCO MDM v 9 0 TIBCO Cloud MDM v 9 0 Multi domain Cloud Reltio MDM Reltio Cloud Analytics Cloud 169
5 4 10 MDM MDM Domain- Specific MDM MDM Multi-Domain MDM MDM MDM PIM CIM 2016 2 Magic Quadrant 2017 Magic Quadrant for MDM MDM MDM Multi-Vector MDM MDM 5 MDM MDM MDM MDM MDM MDM MDM MDM GSI(Global Single Instance) ERP IT ERP 170 2017
MDM MDM MDM MDM MDM MDM MDM 4 171
8 1 * 172 2017
4 2 QualityStream DATAWARE DQ# WISE DQ DQMiner Informatica Data Quality IBM InfoSphere Information Server SAS SAS Data Quality 173
WISE DQ preanalysis Outlier DQMiner 80 QualityStream CDC DATAWARE DQ DATAWARE DQ DATAWARE DQ DATAWARE DA DATAWARE Meta 174 2017
Informatica Informatica DataQuality IDQ 4 IBM InfoSphere QualityStage SAS SAS Data Quality Informatica DataQuality IDQ 10 IT IoT Machine Learning IBM InfoSphere Information Server IBM IoT SAS SAS Data Quality ETL Extract transform and load ELT Extract load and transform RDB IoT 175
Big data Preprocessing 4 3 1 Magic Quadrant IBM Watson 10 IoT SNS 4 176 2017
4 2~3 177
9 1 IoT APT Advanced Persistent Threat 100 APT * 178 2017
DRM 4 2 IT IT 179
3 DRM 3 DRM PC 180 2017
LOB Line Of Business 4 Data Lifecycle Data Protection Data Repositories PC DB 4-9 2 181
Data Lifecycle 4 9 IT IoT 182 2017
4 PPDM Privacy Preservation in Data Mining 2000 183
10 1 TensorFlow CNTK Spark MLlib R Caffe Veles Azure ML Studio AML SAS Matlab Watson * 184 2017
4 2 R Spark MLlib CNTK TensorFlow Caffe Veles Decision Tree Logistic Regression SVM k NN CF HMM Bayesian Network CNN RNN 4-10-3 185
R Spark MLlib CNTK CNT Computational Network Toolkit Azure Azure CNTK GPU Cortana TensorFlow GPU TPU Caffe CNN Veles 3 Azure ML Studio AML Amazon Machine Learning TensorFLow API 186 2017
4 Azure ML Studio Microsoft Azure ML Studio Azure ML Studio API 3 AML AML S3 Redshift RDS MX API 4 IBM Watson IBM 20 API API IBM API 187
IBM API IBM Azure IoT Paxata 5 188 2017
189 4
Interview 04 2016 RDB RDB IT 17 13 2017 2017
2018 2019 RDB DB RDB RDB OLAP 2018 RDB DB
295 6
328 2017
329
330 2017
2017 2017 Data Industry White Paper 75 9 42 7 Tel. 02-3708-5300 Fax. 02-318-5040 www.kdata.or.kr 9 772465 766005 ISSN 2465-7662