Upload
chloe-burke
View
228
Download
0
Tags:
Embed Size (px)
Citation preview
Big Data, Future of Computing,
Parting Thoughts
Slobodan VuceticAssociate Professor
Department of Computer and Information SciencesTemple University
Slides and pictures borrowed from:rio.ecs.umass.edu/~lgao/ece697_11/01Overview.ppt http://www.nsf.gov/attachments/124212/public/BIG-Data-Webinar-Honavar-Final-May8with508.pdfGoogle Images
Name(Symbol) Value
kilobyte (kB) 103
megabyte (MB) 106
gigabyte (GB) 109
terabyte (TB) 1012
petabyte (PB) 1015
exabyte (EB) 1018
zettabyte (ZB) 1021
yottabyte (YB) 1024
Big Data
4
Reasons for the Emergence of Large Data Sets:Better technology
• Storage & disks– Cheaper– More volume– Physically smaller– More efficient
Large data sets are affordable
5
Reasons for the Emergence of Large Data Sets:Better networking
• High speed Internet• Cellular phones• Wireless LAN
More data consumers
More data producers
6
Reasons for the Emergence of Large Data Sets:Better IT
• More processes are automatic– E-commerce and V-commerce– Online and telephone banking– Online and telephone customer service – E-learning– Chats, news, blogs– Online journals– Digital libraries
• More enterprises are computerized– Companies– Banks– Governmental institutions– Universities
More data is available in digital form
7
Reasons for the Emergence of Large Data Sets:Growing needs
• Science– Astronomy– Earth and environmental studies– Meteorology– Genetics
• Business– Billing– Mining customer data
More incentive to construct large data sets
Intelligence Emails Web sites Phone calls
Search Web pages Images Audio & Video
8
Big Data – Opportunities
• Big Data presents unprecedented opportunities to– Accelerate scientific discovery and innovation– Lead to new fields of inquiry that would not otherwise be
possible– Improve decision making– Understand human and social processes– Promote economic growth– Improve health and quality of life
• Sky surveys• 120 GB/week, 6.5 TB/year
Astronomy
Genomics
Remote Sensing
• Air, Land, Ocean• 100s GBs /day
Drug Discovery
• 2M of compounds• > 100M interactions
• 25K genes, 3B base pairs• 8B humans• thousands of organisms
Participatory Sensing
Big Data – Science
11
Typical router:• 42 bytes/second
• 3.5 Gigabytes/day
Internet Traffic
• 8 Billion pages• 10kB/Page• 8 TB of indexed text
WebSocial Networks
Mobile Apps
Big Data – Internet
Big Data – Intelligent Transportation SystemsThe future lies in integration, mining and analytics of BIG DATA
From the sky or space From the ground From the vehicles
13
Big Data and CIS– Specific Challenges
• Data management, collection and storage– New data storage, I/O, architectures– Efficient archiving, storing, indexing, retrieving, and recovery– Privacy and security– Cloud computing– Languages, tools, methodologies and programming environments
• Data Analytics– Data analytics under processing, memory, storage, energy constraints – Scalable and interactive data visualization– Extraction and integration of knowledge from massive, complex, multi-
modal, or dynamic data– New algorithms, languages, data structures for data analytics
Big Data and Future of Computing
• Google web search and Google news search– “big data”– “big data computer science”– “big data jobs”– “big data future computing”– “big data cloud computing”
Parting thoughts• Take-home messages from CIS 1001
– CIS is a growing field entering its golden age• Physical world will be increasingly driven by computers and
information technology• Increased importance of virtual and augmented reality world
– There is no free lunch• Work hard
– Get good GPA– Broaden your skills and perspective
• Make smart choices– Get internships– Do undergraduate research– Open a startup
• Use resources available to you– Temple advising and help desks– Professors, TAs, Colleagues– Family and friends– Web
Parting thoughts• Web is the biggest knowledge source
– It contains all known wisdom, it is there for taking, use it for your advantage – It can be a great time sink, there are many dead ends
• Some Pointers: – http://tech.mit.edu/V132/N34/education.html (free high quality courses!!!)– https://students.cis.uab.edu/wiki/index.php/Main_Page (survival guide with many
good pointers)– http://
www.topuniversities.com/student-survival/student-life/getting-through-your-first-year-uni (student survival guide)
– http://csugsac.eng.utah.edu/survival_guide/professors.html (student survival guide)– https://sites.google.com/site/princetoncsmajors/jobs/interviewing (interviewing)– http://oedb.org/fast-track-careers-computer-science (careers in CS)– http://
emmaus.patch.com/articles/five-things-to-know-about-the-future-of-computer-science (CS future)
– http://www.cs.umd.edu/~oleary/gradstudy/gradstudy.pdf (graduate school)
Parting thoughts
• Please fill out the class evaluations (e-SFF evaluations)
• Last homework: exit survey => You need to fill both to get the grade in this course!!