94
Open Source development for students. Why should I work on free software? http://www.flickr.com/photos/inaz/454059437 By Inaz

Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Open Source development for students.

Why should I work on free software?

http://www.flickr.com/photos/inaz/454059437By Inaz

Page 2: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Isabel Drost

Nighttime:Co-Founder Apache Mahout.

Organizer of Berlin Hadoop Get Together.Member ComDev PMC.

Daytime:Software developer

Page 3: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Hello...

HPI students.

Page 4: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Agenda

• The Apache Software Foundation.

• Apache Mahout.

• Reasons and ways to get started.

• Invitation.

Page 5: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

What?

Apache Software Foundation

Page 6: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Community over code.

Page 7: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Meritocracy.

Page 8: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Open communication.

Page 9: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

NOT:

Github, Google Code, sourceforge.

Page 10: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

How?

Behind the scenes.

Page 11: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 12: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 13: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 14: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 15: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 16: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 17: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 18: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 19: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 20: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 21: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 22: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 23: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 24: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 25: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 26: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Community development

GsoCMentoring

University relations

Page 27: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 28: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 29: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 30: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

How?

Open source collaboration tools are good for you.

Page 31: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 32: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 33: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 34: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 35: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 36: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Mahout

A sub-project of Lucene

Page 37: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 38: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 39: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 40: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

January 3, 2006 by Matt Callowhttp://www.flickr.com/photos/blackcustard/81680010

Page 41: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

News aggregation

Today: Read news papers,Blogs, Twitter, RSS feed.

Wish: Aggregate sourcesand track emerging topics.

September 10, 2008 by Alex Barthhttp://www.flickr.com/photos/a-barth/2846621384

Page 42: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 43: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Go to cinema

Today: IMDB, zitty, movie reviewpages, twitter, blogs, ask friends.

Wish: Reviews, sentimentdetection, recommendations.

March 22, 2008 by Crystian Cruzhttp://www.flickr.com/photos/crystiancruz/2353895708

Page 44: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Machine learning – what's that?

Page 45: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Image by John Leech, from: The Comic History of Rome by Gilbert Abbott A Beckett.Bradbury, Evans & Co, London, 1850s

Archimedes taking a Warm Bath

Page 46: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Archimedes model of nature

Page 47: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

June 25, 2008 by chase-mehttp://www.flickr.com/photos/sasy/2609508999

Page 48: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 49: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

An SVM's model of nature

Page 50: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

The challenge

Page 51: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

• Large amounts of data.

• Structured and unstructured data.

• Diverse tasks.

Page 52: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Mission

Provide scalable data mining algorithms.

Page 53: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

• Commercially friendly license.

• Scalable to large amounts of data.

• Well documented.

• Healthy community.

• Targeted to developers.

Page 54: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

What does Mahout have to offer.

Page 55: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Discover groups of items

• Group items by similarity.

• Examples:– Group news articles by topic.– Find developers with similar interests.

Page 56: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 57: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 58: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Discover groups of similar items

• Canopy.

• k-Means.

• Fuzzy k-Means.

• Dirichlet based.

• Others upcoming.

Page 59: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Discover groups of similar items

Page 60: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Identify dominant topics

• Given a dataset of texts, identify main topics.

• Examples:– Dominant topics in set of mails.– Identify news message categories.

Algorithms: Parallel LDA

Page 61: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Assign items to defined categories.

• Given pre-defined categories, assign items to it.

• Examples:– Spam mail classification.– Discovery of images depicting humans.

Page 62: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

By freezelight, http://www.flickr.com/photos/63056612@N00/155554663/

Page 63: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 64: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 65: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Assign items to defined categories.

• Naïve Bayes.

• Complementary naïve bayes.

• Random forests.

• Others upcoming.

Page 66: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Assign items to defined categories

• Examples based on “standard” datasets:

• 20 Newsgroupshttp://cwiki.apache.org/MAHOUT/twentynewsgroups.html

• Wikipediahttp://cwiki.apache.org/MAHOUT/wikipediabayesexample.html

Page 67: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Recommendation mining.

• Recommend items to users.

• Examples:– Find books related to the book I am buying.– Find movies I might like.

Page 68: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Recommending places

Page 69: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Recommending people

Page 70: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Recommendation mining.

• Integrated Taste.• Mature Java library.• Java-based, web service / HTTP bindings.

• Batch mode based on EC2 and Hadoop.

Page 71: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Frequent pattern mining

• Given groups of items, find commonly co-occurring items.

• Examples:– In shopping carts find items bought together.– In query logs find queries issued in one session.

Page 72: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

By crypto, http://www.flickr.com/photos/crypto/3201254932/sizes/l/

Page 73: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

By crypto, http://www.flickr.com/photos/crypto/3201254932/sizes/l/

By libraryman, http://www.flickr.com/photos/libraryman/78337046/sizes/l/

Page 74: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

By crypto, http://www.flickr.com/photos/crypto/3201254932/sizes/l/

By libraryman, http://www.flickr.com/photos/libraryman/78337046/sizes/l/

By quinnanya, http://www.flickr.com/photos/quinnanya/2806883231/

Page 75: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Upcoming

• More algorithms.

• Optimization of existing implementations.

• More examples.

• Release 0.3

Page 76: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Jumpstart your project with proven code.

January 8, 2008 by dreizehn28http://www.flickr.com/photos/1328/2176949559

Page 77: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Discuss ideas and problems online.

November 16, 2005 [phil h]http://www.flickr.com/photos/hi-phi/64055296

Page 78: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Become part of the community.

Page 79: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

[email protected]@lucene.apache.org

Interest in solving hard problems.Being part of lively community.

Engineering best practices.

Bug reports, patches, features.Documentation, code, examples.

Image by: Patrick McEvoy

Page 80: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

June 7/8th: Berlin Buzzwords 2010Store, Search, Scale

Lucene Sphinx

Hadoop

Business IntelligenceNoSQL

HBase

ScalabilityCloud Computing

Distributed computing

Solr

CouchDB

MongoDB

Isabel DrostJan Lehnardtnewthinking storeSimon Willnauer

Page 81: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Mar., 10th 2010: Hadoop* Get Together in Berlin

– Bob Schulze (eCircle/ Munich): Database and Table Design Tips with HBase

– Dragan Milosevic (zanox/ Berlin): Product Search and Reporting powered by Hadoop

– Chris Male (JTeam/ Amsterdam): Spatial Search

http://upcoming.yahoo.com/event/5280014/

* UIMA, Hbase, Lucene, Solr, katta, Mahout, CouchDB, pig, Hive, Cassandra, Cascading, JAQL, ... talks welcome as well.

Page 82: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

[email protected]@lucene.apache.org

Interest in solving hard problems.Being part of lively community.

Engineering best practices.

Bug reports, patches, features.Documentation, code, examples.

Image by: Patrick McEvoy

Page 83: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Why?

Why should I waste my time with doing stuff for free?

Page 84: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Work on what you want...

when you want.

http://www.flickr.com/photos/abnelgonzalez/2058764760/

Page 85: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Share and discuss with peers.Discuss ideas and problems online.

November 16, 2005 [phil h]http://www.flickr.com/photos/hi-phi/64055296

Page 86: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Learn from the best.

http://www.flickr.com/photos/mg315/381296439/

Page 87: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Soft Skills.

http://www.flickr.com/photos/ajawin/3587215356/

Page 88: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Make work visible and re-usable.

http://www.flickr.com/photos/telstar/2916051841/

Page 89: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

Get started

Turn users into developers.

Page 90: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

GSoC

Page 91: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

ComDev

Page 92: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 93: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and
Page 94: Open Source development for students.isabel-drost.de/hadoop/slides/christoph.pdf · Mar., 10th 2010: Hadoop* Get Together in Berlin – Bob Schulze (eCircle/ Munich): Database and

[email protected]@lucene.apache.org

Interest in solving hard problems.Being part of lively community.

Engineering best practices.

Bug reports, patches, features.Documentation, code, examples.

Image by: Patrick McEvoy