of 32 /32
IAEA International Atomic Energy Agency International Nuclear Information System (INIS) DIGITISATION INIS Training Seminar 7-11 October 2013, Vienna, Austria Thomas Kalapurackal INIS Unit Thomas - INIS Training Seminar-7-11 Oct 2013

International Atomic Energy Agency International Nuclear Information System (INIS)

  • Author
    ramona

  • View
    30

  • Download
    1

Embed Size (px)

DESCRIPTION

International Atomic Energy Agency International Nuclear Information System (INIS). DIGITISATION INIS Training Seminar 7-11 October 2013, Vienna, Austria Thomas Kalapurackal INIS Unit. WHAT IS DIGITISATION?. DIGITISING IS NOT PHOTOCOPYING….! - PowerPoint PPT Presentation

Text of International Atomic Energy Agency International Nuclear Information System (INIS)

INIS NCL Production

International Atomic Energy AgencyInternational Nuclear Information System (INIS)DIGITISATION

INIS Training Seminar7-11 October 2013, Vienna, Austria

Thomas KalapurackalINIS Unit

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAWHAT IS DIGITISATION?DIGITISING IS NOT PHOTOCOPYING.!

Process of converting paper docs, microfilm, microfiche etc.. Into electronic image files.

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEADIGITISATION

HARD COPY/SOFT COPYDOCUMENTSFULL TEXT SEARCHABLEELECTRONIC DOCUMENTS

WHY IS DIGITISATION?It came as a most wonderful and welcome tool in hands of libraries, museums, archives, societies, publishers, and others for preserving billions of paper/analogue documents in digital format.

Retain the original look with a point of view of the future relevance.

Protecting from loss or danger

Effective, efficient and purposeful use

Knowledge Transfer to the next generation

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAIn The PastPaper filing and capturing documents on film were common preservation methods.

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEATime has changedAnd the information must be stored in such media that the storage is safe and the retrieval is quick.

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEADIGITISATIONTHE AGE OF DIGITISATION HAS BEGUN!

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAHISTORY OF DIGITISATION..The first image scanner developed for use with a computer was a drum scanner. It was built in 1957 at the US National Bureau of Standards by a team led by Russell A. Kirsch.

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAHISTORY OF DIGITISATION..And the first image ever scanned on this machine was a 5 cm square photograph of Kirsch's then-three-month-old son, Walden.

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAHISTORY OF DIGITISATIONIn 1975 Ray Kurzweil invented the flat bed scanner.

Kurzweil also was the inventor of text to speech technology

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEADOCUMENT SCANNERA flatbed scanner is usually composed of a glass pane (or platen), under which there is a bright light (often xenon or cold cathode fluorescent) which illuminates the pane, and a moving optical array in CCD scanning.

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEADOCUMENT SCANNER Leading Flat-bed Scanners in the Market are from:

FUJITSUKODAKHP (HEWLETT PACKARD)CANONXEROXEPSONPANASONIC and many more..

Thomas - INIS Training Seminar-7-11 Oct 2013IAEADIGITISATIONVATICAN DIGITISED THE WHOLE LIBRARY COLLECTION RECENTLY!!

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAAt INIS.INIS has two Colour Scanning Stations at present:

FUJITSU (Serial No. fi-5750C) (72 page per minute A4 size)

KODAK (Serial No. i1440) (75 page per minute A4 size)

Thomas - INIS Training Seminar-7-11 Oct 2013

IAEAAt INISSince its creation in 1970, INIS collects and disseminate the NCL Reports received from Member States and Intl. Organisations.

In 1997 INIS replaced the microfiche-based production system with an imaging system to process and to disseminate all NCL Documents in electronic format.

Thomas - INIS Training Seminar-7-11 Oct 2013IAEADOCUMENT PREPARATIONPage size (A5 to A0) Color or B/W?Single or double side?Quality of documentCan we feed the document?

Thomas - INIS Training Seminar-7-11 Oct 2013IAEASETTINGSResolution(300 dpi)Single Side/Double sideFormat(Tiff/pdf/jpg etc.)ResolutionPage size (A5 to A0)Orientation(Portrait/Landscape)Source (Feeder/Flatbed)BrightnessContrastNoise RemovalDeskew?Thomas - INIS Training Seminar-7-11 Oct 2013IAEAQUALITY CONTROL & IMAGE ENHANCEMENTDeskew?Black Border?NoisyImage PositioningCrop?Clean-up?Rotate?Convert color?Page missing?Insert Page?Re-size?Delete, Split, Copy?Thomas - INIS Training Seminar-7-11 Oct 2013IAEAImage EnhancementSkewed?

Thomas - INIS Training Seminar-7-11 Oct 2013IAEAImage Enhancement

Noisy?Thomas - INIS Training Seminar-7-11 Oct 2013IAEAImage EnhancementImage Positioning:

Thomas - INIS Training Seminar-7-11 Oct 2013IAEAImage EnhancementBlack Border?

Thomas - INIS Training Seminar-7-11 Oct 2013IAEA

Thomas - INIS Training Seminar-7-11 Oct 2013

Without VRS

= VRS AUTO-BRIGHTNESS

= Not readable !!= Repeat scanning for better result= OCR will not be perfect

VRS AUTO BRIGHTNESS

= 100% readable != Get the best image ! = OCR will be perfect !

Thomas - INIS Training Seminar-7-11 Oct 2013Thomas-INIS Training Seminar 7-11 October 2013

like,

ohne VRS

Dokumenten StapelVRS EDGE TRESHOLDING Manual settings may not give perfect results always!This one was highlighted with Orange Marker and it is not readable!Thomas - INIS Training Seminar-7-11 Oct 2013Thomas-INIS Training Seminar 7-11 October 2013

Perfect ScanningNo missing textsBetter than OriginalThomas - INIS Training Seminar-7-11 Oct 2013Thomas-INIS Training Seminar 7-11 October 2013leil7nologjiOCR Resulttechnology OCR Resultit VRS

Ohne VRS

Original-> VRS also gives excellent results in OCR Thomas - INIS Training Seminar-7-11 Oct 2013

Thomas-INIS Training Seminar 7-11 October 2013Important Features in PixEdit

(Version 7.11.18)

Thomas - INIS Training Seminar-7-11 Oct 2013IAEAThomas-INIS Training Seminar 7-11 October 2013WHEN SHOULD I SCANHARD COPY DOCUMENTS ?The NCL document is not available in electronic format.

I have a scanner.

Refer to NCL Guidelines Section 4.1, Scanning to PDF

Thomas - INIS Training Seminar-7-11 Oct 2013IAEASCANNING DOCUMENTSINIS STANDARD300 dots per inch (DPI)400 DPI for small charactersB/W: TIFF CCITT Group 4 or JBIG2Color Images: JPEGPLEASE DO NOT SCAN B/W PAGES in 24 bit color depth First priority: Scanning Quality

Thomas - INIS Training Seminar-7-11 Oct 2013IAEAFIRST SCANNING TESTSend an e-mail with a small test document to INISYour test document will be analyzed and INIS will tell you if you can continue to submit your NCL full text electronicallyIf the scanning quality is not good enough, INIS will help you to find the best settings for your scanner

Thomas - INIS Training Seminar-7-11 Oct 2013IAEAFUTUREPeople live longer now.

Media too.!

THANK YOU.!!Thomas - INIS Training Seminar-7-11 Oct 2013

IAEA