14
The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure ISSI 2013 Vienna, 16 July 2013 Marc Bertin, Iana Atanassova, Vincent Lariviere, Yves Gingras

The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Embed Size (px)

Citation preview

Page 1: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

The Distribution of Referencesin Scientific Papers:

an Analysis of the IMRaD Structure

ISSI 2013

Vienna, 16 July 2013

Marc Bertin, Iana Atanassova, Vincent Lariviere, Yves Gingras

Page 2: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Problem

Scientific papers usually follow a specific rhetorical structure: the IMRaD structure (Introduction, Method, Result and Discussion).

Questions:Questions:

� What relationships exist between cited references and the structure of the text?

� How does the IMRaD structure affect the distribution of references in scientific papers?

Page 3: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Method

� Corpus: 7 peer-reviewed academic journals:� PLoS series (ONE, Biology, Computational Biology,

Genetics, Medicine, Neglected Tropical Diseases, Pathogens)

XML using Journal Article Tag Suite (JATS)� XML using Journal Article Tag Suite (JATS)

� More than 47,000 scientific articles

� Identify the section structure of the articles

� Identify cited references in the text

� Study the distribution of references according to the text progression and structure.

Page 4: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Sections Identification

• Section titles can vary according to the article.

• e.g. "Method", "Methods", "Method and Model"Model"

• Section titles were analyzed in order to match each section with one of the section types in the IMRaD structure.

Page 5: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Sentence Level Processing

� We use sentences as basic units to model text progression

� Sentence segmentation allows us to work with text elements that are smaller than paragraphsparagraphs

� Analysis of the punctuation of the text following a set of typographic rules

� For each sentence, we count the number of references it contains and obtain their distribution along the text.

Page 6: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Corpus

Page 7: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Cited References

� Cited references are present as separate elements in the XML structure

� Special cases needing specific processing: reference ranges

Page 8: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

ResultsResults

Page 9: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

PLoS ONE &

PLoS Computational Biology

Page 10: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

PloS Genetics, PLoS

Pathogens & PLoS Biology

Page 11: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

PLoS Medicine & PLoS

Neglected Tropical Diseases

Page 12: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

IMRaD Structure

Page 13: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Conclusion

� We have obtained the distribution of cited references in scientific papers.

� We have shown that this distribution seems quite stable and maybe even seems quite stable and maybe even invariant if we take into account the changes that occur in some journals in the positions of the different sections in the text of the articles.

Page 14: The Distribution of References in Scientific Papers: an Analysis of the IMRaD Structure - ISSI-2013

Thank you!Thank you!