34
Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology Senior Research Fellow, Center for Survey Research

Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Organic data: Sensors, Social Networks, and “Big Data”

Philip S. BrennerAssistant Professor, Department of SociologySenior Research Fellow, Center for Survey Research

Page 2: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

What is organic data?

• Unlike survey data (design data), organic data is transmitted or collected for purposes other than generating quantitative descriptors of a population

Page 3: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Organic data: an example

Google street view

Page 4: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Organic data: an example

“using Google Street View and an archive of 1990s videotapes … doctoral student Jackelyn Hwang and Harvard sociologist Robert Sampson used Google Street View to take a virtual walking tour of Chicago. As they went, they looked for details like home renovations or new construction that indicate gentrification is underway, or litter and graffiti, which indicate it’s not.”

Page 5: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Organic v. design data

Design Organic

purpose pre-specified post-hoc

data-to-noise ratio very high very low

pathway to information analysis algorithms

sampling probabilistically selected self-selected

Page 6: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

What is organic data?

• Unlike survey data (design data), organic data is transmitted or collected for purposes other than generating quantitative descriptors of a population

• Location tracking/GPS

• Wearable sensors/monitors

• The “internet of things”

• Social networking services (Facebook, Twitter, Instagram)

• Transaction data (Amazon, Target, Google)

Page 7: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Wearable monitors: Location

Page 8: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Wearable monitors: Location

• Geotagging: linking a piece of data (tweet, photo, facebook post, etc.) to a specific location (latitude, longitude)

• Geofencing: creating an area used to measure the location/behavior of an individual; location flagged when individual arrives, departs fenced area

Page 9: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Geotagging

Page 10: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Geofencing

Page 11: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Geofencing

Page 12: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Geofencing

Page 13: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Geofencing

Page 14: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

What is organic data?

• Unlike survey data (design data), organic data is transmitted or collected for purposes other than generating quantitative descriptors of a population

• Location tracking/GPS

• Wearable sensors/monitors

• The “internet of things”

• Social networking services (Facebook, Twitter, Instagram)

• Transaction data (Amazon, Target, Google)

Page 15: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Wearable monitors: Physical condition

accelerometers actigraphs

pulse monitors oximeters

Page 16: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Wearable monitors: Activity

Page 17: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Plug-in sensors: Environmental attributes

• Gas sensor/detector (oxygen, CO2)

• Humidity

• Thermometer

• Light sensors

• Barometer

• Proximity sensors

• Network (WiFi) sensor

Page 18: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Exercise 9(1)

• Consider using location or another sensor in your research project. How would you do it? What would you measure and why?

• Would it provide a better measure than a direct survey question? Why or why not?

• Do you think it would change the way the survey respondent/study participant will behave? Why or why not?

Page 19: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

What is organic data?

• Unlike survey data (design data), organic data is transmitted or collected for purposes other than generating quantitative descriptors of a population

• Location tracking/GPS

• Wearable sensors/monitors

• The “internet of things”

• Social networking services (Facebook, Twitter, Instagram)

• Transaction data (Amazon, Target, Google)

Page 20: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Internet of things

Page 21: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Internet of things

• Wired devices to monitor energy use, purchases, viewing patterns (ostensibly) to improve our lives

• Can be used by social scientists

• survey questions about energy use or smartmeter, thermostat, or appliance use data

• questions about healthy eating or contents of refrigerator

• questions about hours watching TV or data from smart TV

Page 22: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

What is organic data?

• Unlike survey data (design data), organic data is transmitted or collected for purposes other than generating quantitative descriptors of a population

• Location tracking/GPS

• Wearable sensors/monitors

• The “internet of things”

• Social networking services (Facebook, Twitter, Instagram)

• Transaction data (Amazon, Target, Google)

Page 23: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Social networks: Facebook

“So I give you a survey you fill it out, which is very artificial, whereas ethnography, as soon as you walk into the room, you change that room, because you are a foreign presence. There’s a scientist in the room. People get self-conscious. They don’t act naturally.”

In comparison, Facebook data is not influenced by the presence of a social science researcher. “It has no artificial construct, you are not bringing people to the lab,” Nelson said. “So you are recording social interaction in real time as it occurs completely naturally.”

Page 24: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Social networks: Facebook

Wilson, Gosling, and Graham 2012

Page 25: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Social networks: Twitter

McCormick et al. 2013

Using Twitter messages to analyze planned non-voting

Use Twitter API (application program interface) to search for attribute of interest

Compare non-voters on other information (bio, location, past tweets)

Page 26: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

What is organic data?

• Unlike survey data (design data), organic data is transmitted or collected for purposes other than generating quantitative descriptors of a population

• Location tracking/GPS

• Wearable sensors/monitors

• The “internet of things”

• Social networking services (Facebook, Twitter, Instagram)

• Transaction data (Amazon, Target, Google)

Page 27: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Transaction data

• often treated synonymously with “big data”

• data “munging” to extract valuable information from the messiness

• used by the corporate world for focusing advertising, marketing

• also used by social scientists

• role in survey research?

Page 28: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Transaction data: an innocuous example

Page 29: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Transaction data: a not so innocuous example

Page 30: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Transaction data

• transaction: searching google for flu symptoms, medical information about symptoms, etc.

Page 31: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Transaction data

• transaction: searching google for particular racist epithets

• research associated these searches with high rates of black mortality

(Chae et al. 2015)

Page 32: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Design approach v. “big data” approach

• Design • draw probability sample • gain access to organic data source • code organic data • generate estimates • make inferences

• Big data • take population • use data munging and powerful algorithms • generate estimates • inferences?

Page 33: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

“Big data hubris”

“Instead of focusing on a ‘big data revolution,’ perhaps it is time we were focused on an ‘all data revolution.’”

Page 34: Organic data: Sensors, Social Networks, and “Big Data”Organic data: Sensors, Social Networks, and “Big Data” Philip S. Brenner Assistant Professor, Department of Sociology

Exercise 9(2)

• Consider using social network data (not limited to Facebook and Twitter; Instagram, Flickr, and others are options too) in your research project. How would you do it? What would you measure and why?

• Would it provide a better measure than a direct survey question? Why or why not?

• Do you agree with the quotation from lecture that Facebook updates are “natural” and therefore contain less error than other forms of data collection? Why or why not?