28
#alpsp17 www.alpsp.org/Conference Parallel – Picke Room I AI – Two publishing case studies David Smith (Chair & speaker) - IET Marcel Karnstedt-Hulpus - Springer Nature

Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

#alpsp17www.alpsp.org/Conference

Parallel – Picke Room IAI – Two publishing case studiesDavid Smith (Chair & speaker) - IETMarcel Karnstedt-Hulpus - Springer Nature

Page 2: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Youwon’tbelievehoweasyitistobuildanAI!

RetoolinganA&Idatabaseforthe21st Century.

Page 3: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

AbouttheIET• TheIETisoneoftheworld’slargestengineeringinstitutionswithover168,000membersin150countries.Itisalsothemostmultidisciplinary– toreflecttheincreasinglydiversenatureofengineeringinthe21stcentury.

• TheIETisworkingtoengineerabetterworldbyinspiring,informingandinfluencingourmembers,engineersandtechnicians,andallthosewhoaretouchedby,ortouch,theworkofengineers.

Page 4: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

INSPEC:ABluffersGuide• AhighlycuratedA&IdatabasecoveringEngineering,ComputingandPhysics(etc etc)

• Forover40Years• >17millionabstracts• SoMuchMetadataWOW!• SeveralhundredyearsworthofHumanExpertisekeepsaverycloseeyeonthemetadataquality

Page 5: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Soitwasamanualsystem…

Page 6: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Andhere’showitworked…

Page 7: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Weneededtochangethis…• TheTechwasE.O.L• TheManualmethodswererestrictive&expensive(butVHighQuality)

• Wehadreachedanupperlimitoncoverageandvolume

• Therewereclearopportunitiestorethinkwhatweweredoingandwhy…

• RebootingINSPECproductioncouldopenupnewbusinessavenues– ifwegotitright.

Page 8: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Goals…• Delivercostsavings(ROIargumentused).• Movethehumaneffortfurtherupthevaluechain• Beabletoextendcoveragecapabilities• Beabletoextendvolumecapabilities• ReconfigurethedatainINSPECtoallownewwaysofasking

questionsofit.• BuildanewIETIPasset• FocusonautomationwithhumanQA(‘GroundTruthing’)

Page 9: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Sothisiswhatwehavebuilt…(Simpleversion)

Acquisition

• IngestXML/PDF/OCR

Normalisation

•RendertostandardINSPECSchemaforonwardprocessing

MetadataApplication

•TheAIliveshere…

ProductGeneration

•Setupofabstractstovariousoutputcontainers

Output

•VariousXMLoutputsasneeded

Humans Machine Machine&HumanQA Machine Machine/HumanQA

Ohyeah…We’vealsobuiltanINSPECKnowledgeGraphCoveringallofINSPEC

Page 10: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Let’sfocusontheAI• Whatdoesitdo?• Howdoesitwork?• Isitanygood?

Page 11: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Whatdoesitdo?• Itreadstext.• ItexpectsthattexttocontainengineeringcontentcommensuratewithINSPECcoverage

• ItthenappliesthefullgamutofINSPECmetadatatothetext…Uncontrolledindexing/Controlledterms/Classifications/Numericalindexing/Chemicalindexing/Astronomicalobjectindexing/Treatmentcodes

Page 12: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

84

Page 13: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Howdoesitwork?• Wedon’treallyknow.Weturneditonafewmonthsagoandremovedhumansfromthedecisionprocess.Itstartedtolearnatageometricrateuntilitbecameselfaware.Wetriedtoturnitoff,butitalreadyhadphishedourAWScreditcarddetails.Itkeepsaskinguswhere‘Wintermute’is.Helpusplease…

Page 14: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Howdoesitwork?

JustKidding!

Page 15: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Howdoesitwork?• Itusesamixtureof

– Heuristics– NaturalLanguageProcessors– Statisticalanalysistools– AndaselectionofAIalgorithms.

• Webuiltadetaileddomainmodel&Ontologyforittouse

• It’sbeentrainedviadirectedlearningofagoldencorpus(circa600KdocumentsacrossINSPEC)

Page 16: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Howdoesitwork?• AndaselectionofAIalgorithms…

– Welookedatadaboost (goseewikipedia…)– Alsoword2vec(likewise)– AndTensorflow – thedeeplearningalgo fromGoogle.Interestingresults…ItdidsomeratheroddthingsTBHsoweabandonedthatapproach.

Page 17: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

IsitAnyGood?• Arathercomplexquestiontoanswerinmanyways…

• Whenitstartstogetgood(anditis)ittestspreviouslyheldassumptionsaboutwhatqualityactuallyis…

• We’velearnedourselvesquiteabitaboutwhatwethinkisgoodandWHYasaresultofteachingamachinetounderstandengineeringtexts.

Page 18: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

IsitAnyGood?ShowMeTheNumbers!

ControlledTermsFScoreresults ClassificationsFScoreresultsINSPECClassificationsarecomplexmetaconcepts

Remember– FscoreisafunctionofBOTHPrecisionandrecall…

Page 19: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

IsitAnyGood?• WecangetVERYhighnumbersindeedonindividualconceptandtermmatching(90%+)butmuchofthemetadataweaddisaboutwhereagivenitemshouldbelonginourvariousmeta-classificationapproaches.

• WealsohavetofigureoutawaytolookacrosstheentiretyoftheINSPECdatawhenthemachineislearning.Animprovementinoneareacanleadtooddresultselsewhere.

Page 20: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

IsitAnyGood?• Ohyes.It’sverygoodindeed.SeniorINSPECAlumniofmanyyearsarefrequentlystunnedbywhatitcando.

• It’slive.It’sdeliveringsavingstousnowandit’sallowingustogotakealookatwhat’soverthehorizonfortheIET…

Page 21: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Visualisation

Page 22: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are
Page 23: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are
Page 24: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are
Page 25: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are
Page 26: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are
Page 27: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are
Page 28: Parallel – Picke Room I · • The IET is working to engineer a better world by inspiring, informing and influencing our members, engineers and technicians, and all those who are

Thanks!

Q’s(attheend)