Lee Tae Youngnexr.com/upload/legacy.pdf · 2020-04-16 · Based sampling Probability Distribution...

Preview:

Citation preview

1

Lee Tae Young

2

3

4

5

Big Data Data

Refinery

Data

Delivery Web

Services

Interface

6

Command Line Interface

Core Spring Batch

Cluster Integration

Visual Rules Integration

Event Management Integration

Cluster

Registry

Artifact

Repository

Visual Rules Batch Platform

7

branch

Main

Office

Batch Job

Processing

Mainframe

?

? ?

?

8

9

Web

HDFS

MR

HBASE

User

RHIVE HIVE

Nutch

Spring framework batch

JBOSS (WAS)

PIG

10

11

S

12

S

13

1 2

S

14

1 2

15

16

C

17

F

18

F

19

20

~라도

~가

21

N

22

23

24

25

26

27

Distribution

Based sampling

Probability

Distribution calc.

Sampler

Local Disk

Visualization

Reporting

Flow Designer

HDFS

Hadoop/Hive/Hbase

M/R R UDP

RServe TCP/IP

M/R R UDP

RServe TCP/IP

M/R R UDP

RServe TCP/IP

Export-R

R

rJava

R-Hive

Bridge

Hive

Client

R Studio

Ad-hoc analysis

Cleansing

Console

28

29

30

31

32

33

34

35

36

branch

Main

Office

Batch Job

Processing

Mainframe

?

? ?

?

Recommended