27
NLP in Archival Processing Donald Mennerich, NYU Libraries

NLP in Archival Processing

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: NLP in Archival Processing

NLP in Archival ProcessingDonald Mennerich, NYU Libraries

Page 2: NLP in Archival Processing

Scale

Page 3: NLP in Archival Processing
Page 4: NLP in Archival Processing
Page 5: NLP in Archival Processing

Forensics

Page 6: NLP in Archival Processing

<!-- plugin_process -->

<pronomPuid>x-fmt/391</pronomPuid>

<pronomFormatName>Exchangeable Image File Format (Compressed)</pronomFormatName>

<pronomSignatureName>EXIF Compressed Image 2.2</pronomSignatureName>

<pronomMimeType>image/jpeg</pronomMimeType>

<pronomMatchType>signature</pronomMatchType>

<pronomSignatureFileVersion>formats-v70.xml</pronomSignatureFileVersion>

<pronomContainerFileVersion>20130501.xml</pronomContainerFileVersion>

<fidoVersion>1.3.1</fidoVersion>

<identificationUuid>49050200-e308-4060-886a-14a8efd82078</identificationUuid>

<scanStatus>PASSED</scanStatus>

<clamAVVersion>ClamAV 0.98.1</clamAVVersion>

<virusScanUuid>6dea8d54-107a-43d0-a1fe-beb9c6bc4a21</virusScanUuid>

Page 7: NLP in Archival Processing
Page 8: NLP in Archival Processing
Page 9: NLP in Archival Processing
Page 10: NLP in Archival Processing

Scale

Page 11: NLP in Archival Processing
Page 12: NLP in Archival Processing
Page 13: NLP in Archival Processing
Page 14: NLP in Archival Processing
Page 15: NLP in Archival Processing
Page 16: NLP in Archival Processing
Page 17: NLP in Archival Processing

improvements

• Better infrastructure, distributed processing, machine Learning• Topic modeling, cluster analysis, document similarity• Visualizations• Integration with discovery, dissemination and access systems, Linked

open data

Page 18: NLP in Archival Processing

Beyond the obsolete…

Page 19: NLP in Archival Processing
Page 20: NLP in Archival Processing
Page 21: NLP in Archival Processing
Page 22: NLP in Archival Processing
Page 23: NLP in Archival Processing

NLP

Named entity extractionTopic modelingClusteringClassificationCollaborative filteringLanguage detection

Page 24: NLP in Archival Processing
Page 25: NLP in Archival Processing
Page 26: NLP in Archival Processing
Page 27: NLP in Archival Processing

Thanks.