17
Orange Canvas A Visual Programming Environment for Data Mining Justin Sun Boston DataCon September 14, 2014

Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

Embed Size (px)

Citation preview

Page 1: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

Orange CanvasA Visual Programming Environment for Data

MiningJustin Sun

Boston DataConSeptember 14, 2014

Page 2: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

OverviewWhy Use Orange?Classification Tree ExampleProject HistoryArchitectureWidgetsDemoResources

Page 3: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

Why Use Orange?Free and open sourceNo programming needed

Visual programmingInteractive

Easy to Use – Encourages ExperimentationData VisualizationsMachine Learning Algorithms

Add-ons forBioinformaticsNetwork AnalysisText Analytics

Page 4: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

Classification Tree Scheme

Page 5: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

History1996 – University of Ljubljana and Jožef

Stefan Institute started development of ML*, a machine learning framework in C++.

1997 – Python integration layer2003 – GUI based on PyQt2013 – Orange Canvas 2.7 released – Major

GUI redesign.

Source: http://en.wikipedia.org/wiki/Orange_%28software%29

Page 6: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

High-level Architecture

Algorithms written in C++

Python integration layer (Python 2.7)

Orange Canvas – Visual programming

Page 7: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

InstallationDownload full package installer from

http://orange.biolab.si/Run installer

Requires Python 2.7Includes NumPy, SciPy, PyQt, other required

librariesAfter installing, double-click on the Orange

Canvas icon

Page 8: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources
Page 9: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

Scheme

Widgets

Page 10: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources
Page 11: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources
Page 12: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources
Page 13: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources
Page 14: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources
Page 15: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

DemoClassification exampleEvaluation

Page 16: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

ResourcesOrange Website: http://orange.biolab.si/

Tutorials: http://www.biolab.si/janez/kyoto/

Interactive Network Analysis with Orange http://www.jstatsoft.org/v53/i06

Orange Whitepaper with scripting examples http://www.celta.paris-sorbonne.fr/anasem/papers/miscelanea/InteractiveDataMining.pdf

Page 17: Justin Sun Boston DataCon September 14, 2014. Overview Why Use Orange? Classification Tree Example Project History Architecture Widgets Demo Resources

Thank You!