Is the Sky Falling? New Technology, Changing Media, and the Future

Is the Sky Falling?New Technology, Changing Media, and

the Future of Surveys

Mick P. CouperSurvey Research Center, University of Michigan, and

Joint Program in Survey Methodology, University of Maryland

A talk featuring Mick’s metaphors and no gratuitous graphics

“To everything there is a season, and a time to every purpose under the heaven … a time

to be born, a time to die, a time to plant, and a time to pluck up that which is planted….”

(Ecclesiastes 3:1)

“A time to tweet, a time to blog, a time to survey, a time to experiment, a time to

interview, a time to observe…”?

Is There a Future for Surveys?

With the rise of Big Data, who needs surveys anymore?

With the rise of opt-in panels, Google Consumer Surveys, Mechanical Turk, surveys on Facebook, etc., who needs probability surveys anymore?

With the rise of do-it-yourself (DIY) online survey tools, who needs survey professionals anymore?

Overview of Talk

Review three technology-driven trends with implications for surveys

• Big data• Non-probability samples (especially online panels)• Mobile data collection

Offer some observations on what this means for survey research and survey researchers

Big Data

Following Groves (2011), the term organic data may be a better descriptor

Three characteristics of organic data:• Volume• Velocity• Variability

Three related types of organic data• Administrative data – provided by persons or organizations for

regulatory or other government activities • Transaction data – generated as an automatic byproduct of

transaction and activities (e.g., credit card data, traffic flowdata)

• Social media data – created by people with the express purpose of sharing with (at least some) others

Big Data Is Exciting

Some people see big data as replacing surveys

• It’s (mostly) free, it’s everywhere, it’s big• E.g., Savage and Burrows (2007, p. 891): “…where

data on whole populations are routinely gathered as a by-product of institutional transactions, the sample survey seems a very poor instrument.”

Some even see big data as replacing science:

• 2008 article in Wired Magazine: “The End of Theory: The Data Deluge Makes the Scientific Method Obsolete.”

Some Limitations of Big Data

Single variable, few covariates

Bias through self-selection and self-presentation

Volatility or lack of stability

Privacy issues

Access issues

Opportunity for mischief

Size is not everything (bigger is not necessarily better)

File drawer problem

Single Variable, Limited Covariates

Surveys are much more than a single variable

Limited demographic variables provided or imputed may be wrong• Only about 1/3 of Facebook users provide

demographic information• Demographic information not available for 30-40%

of Google Consumer Survey respondents• What is available (derived from cookies) may be

wrong, e.g., gender matches about 75% of time

Knowing changing fuel prices is not the same as knowing what people do in response to such changing prices

Match of Reported and Inferred Gender

Source: Keeter and Christian (2012)

No demographics available for about 30-40% of Google consumer survey respondents

Accuracy of Information from DoubleClickCookie

For example, what Google thinks I am on one of my browsers and devices:

Check your own profile at:

• https://www.google.com/settings/ads/onweb/

Two sources of bias:• Selection bias• Self-presentation (measurement) bias

Selection bias: “haves” versus “have-nots”• Not everyone uses social media!• Need to distinguish between producers and users of users of

social media – about 13% of US online population actively tweets

• Not everyone uses loyalty cards or credit cards, or makes purchases online

Measurement bias• Impression management is a key element of social media• The average Facebook user has 229 “friends”

Volatility or Lack of Stability

What will Facebook look like 5 or 10 years from now? Will it even exist?• Anyone remember MySpace? Second Life?• Who’s on Google+?

Twitter has only been around since 2006, and grew 5000% in 5 years• Twitter today is very different from Twitter 5 years ago

Google.com was registered in 1997 – making it a mere teenager

Social media may be good for measuring short term trends, but surveys may be better for longer-run measurement

The Growth of Facebook Users and Articles

Source: Wilson, Gosling, and Graham (2012)

Privacy Issues

The more people become aware of what is being done with their data, the more they may opt-out or limit sharing

• E.g., choose to pay cash for certain transactions (alcohol, condoms, etc.)

• E.g., use fake identities or aliases online

EU legislation on cookies

Growth of “do not track” options – now the default in IE 10

Privacy options are changing on social media

Access Issues

Social media and transaction data are usually proprietary

• Only available to insiders, or at a cost• Exception: Twitter

A key strength of surveys is public access to data, permitting replication and reanalysis

Opportunity for Mischief

Three factors increase the likelihood of mischief with social media relative to other media (e.g., call-in polls)

• Relative anonymity of the Internet• Virtually costless• Automated systems can be written to generate

content

83 million Facebook accounts (8.7% of all accounts) are estimated to be fake

Source: http://www.ubermotive.com/?p=68

Bigger ≢ Better

Exhibit A:

• Very large sample (n=10 million) from commercial databases

• Response rate comparable to many telephone polls• 2.3 million surveys returned• Correctly predicted last 5 elections

But wrong!

This was the Literary Digest Poll of 1936

1948 U.S. Election

Famous “Dewey Defeats Truman “ headline illustrates failure of polls in 1948

Failure of quota samples led to rise of probability-based methods

2012 U.S. Election Polls

Source: FiveThirtyEight Blog in NYT

The File Drawer Effect

“For any given research area, one cannot tell how many studies have been conducted but never reported. The extreme view of the ‘file drawer problem’ is that journals are filled with the 5% of the studies that show Type I errors, while the file drawers are filled with the 95% of the studies that show nonsignificant results”(Rosenthal 1979)

Macroeconomic Conditions and Problem Drinking as Captured by Google Searches

Source: Frijters et al. (2013)

Twitter Flu Trends 2009-2010

Source: Paul and Drezde (2011)

Twitter Flu Trends 2007-2011

Source: Murphy (2013)

Google Flu Trends 2011-2013

Source: Butler (2013), in Nature

Salvia Trends Compared to NSDUH

Source: https://blogs.rti.org/surveypost/2012/01/04/can-surveillance-of-tweets-and-google-searches-substitute-survey-research-2/

Obesity-Related Tweets and McDonalds Restaurants

Ghosh and Guha (2013) report a “strong correlation” between the two

Any alternative explanation come to mind?

Stupid Data Mining Tricks

Source: Leinweber (2007, original paper from 1995)

Is the Sky Falling? New Technology, Changing Media, and the Future

Documents

The Sky is Falling - Peter Smagorinsky · · 2017-09-07The Sky is Falling A seventh grade unit ... Another option is The Wizard of Oz and Wicked. ... Comparing the point of view

The Sky Is NOT Falling! The Sky Is NOT Falling! Combat Resilience Special Operations Psychology - Force Multiplication for the 21 st Century Special Operations

Chicken Licken The –y word family. sky Chicken Licken thinks the sky is falling!

Sky is Falling - Roth and Singer

Lester Del Rey - The Sky is Falling

Gene Patenting after In re Kubin: The Sky is Not Falling The

Inside: Is the Sky Falling on Dental Education?

The Sky is Falling: Chemical Characterization and

Sky Is Falling.pdf · The sky Is Falling Written by Katie Knight Illustrated by Joe Boddy . The Sky Is Falling Written by Katie Knight Illustrated by Joe Boddy . A nut falls from

Asterix and the Falling Sky

Know your facts Double and triple check them The sky probably isn’t falling “The sky is falling!! The sky is falling”

VOCABULARY WORDS. dear Oh, dear! The sky is falling!

Sky falling vocab

The Sky Is Not Falling

Death Falling from the Sky

Is The Sky Falling? Segmented Risk Identification Questions

Is the Sky Falling for Airline Profits in the - Climate Advisers

Campbell: The Sky is Falling (Again)

Flying and Falling – Book from the heart to the sky and back · 2016. 11. 28. · Jan Rudzinskyj FLYING AND FALLING Book from the heart to the sky and back. 3 Contents Acknowledgements

When the Sky is Falling: Network-Scale Mitigation of Reflection/Amplification DDoS Attacks