20
Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Embed Size (px)

Citation preview

Page 1: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Subject Analysis:The INIS Thesaurus

Olga Vakula12-16 October 2015

Page 2: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Introduction to Subject Analysis

Piece of literature – INIS record:1. Bibliographic description2. Subject description Bibliographic description

Subject analysis

2

Page 3: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Introduction to Subject Analysis

Steps of Subject Analysis

subject classification (categorization) abstracting subject indexing

List of Subject Categories and Scope Descriptionshttp://nkp.iaea.org/INISSubjectCategories/

Subject Specialist

- understanding of the subject content- familiarity with the Thesaurus and indexing rules

3

Page 4: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

INIS Thesaurus„A thesaurus is a terminological control device used in translating from the natural language of documents, indexers or users into a more constrained system language. It is a controlled and dynamic vocabulary of semantically and generically related terms which covers a specific domain of knowledge“This definition has been adopted by UNESCO„Guidelines for the establishment and development of monolingual thesauri“, UNESCO, SC/W/255, Paris, September 1973

The INIS Thesaurus contains the controlled terminology for indexing all information within the subject scopes of the International Nuclear Information System (INIS).The terminology is intended for use in subject descriptions for input or retrieval of information in INIS, as well as in other suitable systems.

4

Page 5: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

INIS Thesaurus: structure

Descriptors – well defined and unambiguous Structure of the terminology is based on the

interrelationships between terms:

Preferential(preferred choice in cases of semantic ambiguity or when less precision is required)Symbols:UF used for/USESF seen for/SEE… OR…

Hierarchical(broader and narrower terms)Symbols:BT/NT

Affinitive(related terms)Symbols:RT

5

Page 6: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Interrelationships: examplesphotoemission spectroscopy2015-06-03

USE photoelectron spectroscopy

QUANTUM OPTICS2015-02-24

A field of research where interactions with light and matter are studied on the basis of quantum mechanical properties of light.

BT1 optics RT lasers RT quantum electronics RT quantum mechanics

ABUNDANCE1992-03-09

SF concentration SF concentration (analytical) SF concentration dependence NT1 element abundance RT chemical composition RT concentration ratio RT isotope ratio RT ore composition

THREE-DIMENSIONAL LATTICES2015-06-22

*BT1 crystal lattices NT1 cubic lattices NT2 bcc lattices NT2 fcc lattices NT1 hexagonal lattices NT2 hcp lattices NT1 monoclinic lattices NT1 orthorhombic lattices NT1 pentagonal lattices NT1 tetragonal lattices NT1 triclinic lattices NT1 trigonal lattices

PARTICLE ELECTRIC POLARIZABILITY2015-01-29

UF polarizability (particle electric) *BT1 particle polarizability RT electric dipole moments

aircraft fuels2000-04-12

SEE gasoline SEE jet engine fuels 6

Page 7: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

INIS Thesaurus: structure

The descriptor is placed in its correct semantic context by its wordblock - forbidden, broader, narrower and related terms

For a few descriptors where there could still exist a possibility of ambiguity (homonyms) - scope note

Descriptors 30 897 descriptors with scope notes 22 182 valid terms and 8 715 forbidden terms

7

Page 8: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Examples1. "The conductivity of the solution was measured at different temperatures" 2. "The solution of the differential equation was derived by means of computer calculations "

SOLUTIONS1999-10-11

For chemical solutions only. For mathematics see the word block of MATHEMATICAL SOLUTIONS. *BT1 homogeneous mixtures NT1 aqueous solutions NT1 fuel solutions NT1 hypertonic solutions NT1 isotonic solutions NT1 leachates NT1 process solutions NT1 solid solutions RT brines RT buffers RT dilution RT dissolution RT organic solvents RT saturation RT sols RT solubility RT solutes RT solvents RT supersaturation

8

Page 9: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Examples

PRODUCTIONLimited to industrial production; see also PARTICLE PRODUCTION. UF output RT availability RT capacity RT computer-aided manufacturing RT fabrication RT gross domestic product RT gross national product RT isotope production RT manufacturing RT planning RT productivity

9

Page 10: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Indexing rules Document Interpretation Rule“Choose such information items for indexing as you would yourself expect to find in the piece of literature if you were the user searching for that information.”(Important: avoid over-indexing)

The Specificity Rule (I)“Always use the most specific appropriate descriptor.”

KINETICS NT1 radionuclide kinetics NT1 reaction kinetics NT2 biochemical reaction kinetics NT3 cpb (competitive protein binding) NT2 chemical reaction kinetics NT3 combustion kinetics NT2 nuclear reaction kinetics NT1 reactor kineticsRT collisionsRT deck effectRT dynamicsRT gasesRT mechanicsRT motionRT statistical mechanicsRT translocation

10

Page 11: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Indexing rules The Specificity Rule (II)”Do not assign a descriptor and one of its broader terms to the same item.”

NANOMATERIALS2014-10-28

Materials containing particles where, for most of the particles, one or more external dimensions are in the size range 1 nm – 100 nm.(See also NANOSTRUCTURES.) BT1 materials NT1 nanocomposites RT dendrimers RT metamaterials RT nanoparticles

Find implicit information:

"The reaction is enhanced by the presence of platinum" contains the concept CATALYSIS"The long-half-life carbon isotope" refers to CARBON 14"A computer code was developed" - for PROGRAMMING

11

Page 12: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Data FlaggingDATAFor data flagging always use a more specific term. UF measured values SF recorded information SF tables SF values BT1 information NT1 data compilation NT1 numerical data NT2 compiled data NT2 evaluated data NT2 experimental data NT2 financial data NT2 statistical data NT2 theoretical data RT cinda (Computer Index of Nuclear Data) RT comparative evaluations RT data base management RT data covariances RT data processing RT information needs RT redundancy 12

Page 13: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Proposed terms The Proposal Rule“If no suitable descriptor exists in the Thesaurus for the retrieval of a useful concept, propose a new one, containing the following”:

Proposed term Proposed word block of the term (in particular proposed BTs) Potential forbidden terms pointing to this proposed descriptor Scope note when appropriate Explanation and justification for the proposal One or more sample records (search ICS for this term)

Google based Advisory Group: https://groups.google.com/forum/#!forum/inisthesaurus

13

Page 14: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Proposed terms: mistaken proposals

Abbreviations TLD dosimeters →THERMOLUMINESCENT DOSEMETERS

Existing forbidden terms proposed Energy deposition → use ENERGY ABSORPTION

Concept could be represented by existing term Tissue engineering → GENETIC ENGINEERING

English language related: plural/singular, spelling and typos (!) Developing country → DEVELOPING COUTRIES Fungus → FUNGI Colour → COLOR Comercial bildings → COMMERCIAL BUILDINGS

14

Page 15: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Indexing Procedures: summary

1. Carefully read the title and abstract and scan the full text (grasp the concept)

2. Identify the concept about which the document contains useful information (pretend you are a user, not an indexer)

3. "Translate" the concepts into descriptors4. Check each descriptor to make sure that:

- the descriptors represent as precisely as possible the major concept; - the definition matches the use; - the selected descriptor is the most specific appropriate choice

5. Avoid over-indexing (number of descriptors: 8-12)

15

Page 16: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

INIS Thesaurus: access

English version integrated with FIBRE and CAI for indexing

Multilingual Thesaurus (Arabic, Chinese, English, French, German, Japanese, Russian, Spanish)

Integrated with INIS Collection Search http://inis.iaea.org/search/

Interactive Multilingual INIS Thesaurus with navigation capabilities http://nkp.iaea.org/Thesaurus/

16

Page 17: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Single-, bi- and multilingual Thesauri

Single language Thesauri in PDF

Bilingual Thesauri in PDFMultilingual dictionary

17

Page 18: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Thesaurus update

English language - monthlyAvailable in pdf and XML (Member’s Area of INIS web-site)

Mono-, bi- and multilingual thesauri - twice a year: January; July

We encourage you to take this information into consideration so, that we could always stay up-to-date with all the changes and provide the latest and most relevant product to our users.

18

Page 19: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Monolingual and bi-lingual Thesauri update

19

Page 20: Subject Analysis: The INIS Thesaurus Olga Vakula 12-16 October 2015

Thank you!

20