10

Click here to load reader

Moving From Small Science To Big Science

Embed Size (px)

DESCRIPTION

moving from small

Citation preview

Page 1: Moving From Small Science To Big Science
Page 2: Moving From Small Science To Big Science

We will see two case studies like marine Mammal science and psychiatric genetics.

Page 3: Moving From Small Science To Big Science

Subjects: 41 interviewees as principal investigators, junior researchers, and technicians.

The purpose of the projects the researchers are involving is tracking each mammals they are studying.

Place: 13 different laboratories in the U.S. and Europe. Period of their projects: about 40 years from 1970s.

These researchers are scientists rather than social scientists, but their experience on

Organizing the data is more likely social science.

At first, it seemed helpful. For example, a certain school of dolphins (200-500 dolphins)

Stay at one area and only researchers working near the area could study the dolphins.

After People gather and share their information, researchers living in different area could

Study the dolphins as well.

Page 4: Moving From Small Science To Big Science

Leah Tull: Well, honestly, I’m very protective about it… I guess it rather bugs me that I have to do the work and everyone always asks me for a CD… it’s out scientific Study.

Others also say that it is hard to organize the data with considering how others will Systemize and standardize other data. It is difficult to know how deep, where to, and To whom researchers should distribute and share the information is unsolved question

In the past, a small scientific group held a project. People could get information in informal gatherings based on common attendance at a university or through shared contacts. But For now, some people are putting efforts to build much larger databases like a project Named SPLASH

SPLASH involve over 300 scientists from 50 research groups working in various areas in the Pacific Ocean (Calambokidis et al., 2007).

Page 5: Moving From Small Science To Big Science

Background: scientists are using photographs to distinguish each mammal.

Problem 1: most pictures prior to 2003 are in the form of slides,

black and White negatives, or black and white prints. Since 2003,

many Scientists have switched to digital photography, and have

used Different idiosyncratic systems to cope with digital

catalogs.

Problem 2: that the amount and the range of the data are too broad Because the purpose of collecting the data is tracking the Mammal rather than organizing the data. It means

Page 6: Moving From Small Science To Big Science

Psychiatric genetics Subjects: around 50 researchers from institutions expanded from 4 to 11. Each laboratory has one to five researchers working.

Period: about 20 years

The project the researchers are involving is BP (bipolar disorder) project.

BP project was selected to GAIN (Genetic Association Identification Network). Rather than a funded group, GAIN is a group encouraging researchers to organize and Share the data to help not only others but also themselves in return.

Page 7: Moving From Small Science To Big Science

Background: researchers have collected blood, genetic data, and phenotypic data on thousand of

subjects.

Problem 1: the amount and the range of the data is too broad. For example, the biggerst data is over 100 pages’ interview data. Each of them took 4-6

hours And it includes approximately 2600 variables. Additionally, each includes a trained

clinician’s Analysis, family history, medical records, and other information. Each subject has multiple

best Estimates from at least two clinicians plus the interviewer and an editor.  Problem 2: the data was encoded in three different versions of the interview instrument. 1) First data was collected by Oracle database.2) Second data was organized by a Paradox database. 3) Third data was managed by a proprietary database with using labtops and PCs.   Problem 3: The three data systems are not compatible.

Problem 4: the diagnoses are conducted by different system. The earliest diagnoses use a combined DSM-IIIR/RDC systems, while the latest subjects are

implEmented with DSM-IV.

Problem 5: Variables in the three versions are confusing. All three versions are converted from their Original storage into SAS files, but their variable names are not consistence. For example, one Variable is “I1120” in the first set, “Number_of_manic_episodes” in the second set, “V756” in the Third set. To organize these data, people are required to know the professional knowledge with Organizing information skills.

Page 8: Moving From Small Science To Big Science

As we can see from the two cases, there are hardships to go to big science from a small scientific project. The researchers from SPLASH and BP collaborations Are trained for their scientific task, but for organizing Information. If they were trained for organizing Information, it would be a help.  In SPLASH, the new system contain three versions of Systems is not made for expanding more. If it wants To expand, it will have some incompatible problems.

In BP, even though the numbers of researchers were less than SPLASH, there were problems. They had difficulties in computer programming. For example, they had hards hips to implement EAV with various variables.

Furthermore, SAS does not provide ampersand. So, “Total Manic & Depressive Episodes” in paradox Became “total_manic_depressive episodes” in SAS.

Page 9: Moving From Small Science To Big Science

Style of social interaction in the project

SPLASH didn’t “try to force them to do it one way” Jacob TiptonBP project was always very decentralized. Both SPLASH and BP projects have non-dogmatic leaders.  This flexible and decentralized form of leadership is common among scientific and creative teams(Mumford, Scott, Gaddis, & Strange, 2002) and is not inherently problematic. Science relies on the freedom of scientists to innovate (Bush, 1945; Gordon, Marquis, & Anderson, 1962), although some recent work suggests that these patterns are chaning in the face of calls for measures of increased accountability and relevance for scientific work (Demeritt, 2000; Harman, 2003).  The point is, to what extent data management should require to dictate and to what extent should individual scientists be allowed to ignore or skill issues of compatibility and data availability.

Page 10: Moving From Small Science To Big Science

Derek de Solla Price (1963) identified some of these issues four decades ago in his work thay helped to develop the field of Scientometrics.

More recently, scholars in computer science have addressed issues of scalability (Simmhan, Plale, & Gannon, 2005; Zheng, Venters, & Cornford, 2007).

Any number of papers discussing the implementation of Grid enabled projects have identifies scalability as one Of the key issues developers have had to deal with (Pakhira, Fowler, Sastry, & Perring, 2005; Shimojo, Kalia, Nakano, & Vashishta, 2001).

Only recently, have researchers begun to pay attention to how small scientific projects negotiate the changes required as they move towards becoming large, collaborative scientific projects (Calson & Anderson, 2006; Walsh & Maloney, 2007). Scientists attempt to sustain these collaborations over time (Bos et al., 2007).