Upload
rguha
View
32
Download
2
Embed Size (px)
Citation preview
NCATS,NIHIDGF2F,March2017
Entitybrowsing(filterable&linked)Search(fulltext,auto-suggest)
Detailedviewofentities BuiltontopofarobustRESTAPI
AnInterfacetotheKMC
RecentUpdates
• SyncedwithlatestTCRD• Updateddossierfunctionality• APIdocumentationusingSwagger• UpdatestoKAScalculationandvisualization• IMPClandingpage• Revampedsupportandhelpdocumentation• ExtensiveUI/UXupdates• Improvedperformance• PublishedinNguyen&Mathiasetal,NAR,2017
CurrentStatus
Median = 3.8s
0
200
400
600
800
0 20 40 60Response Time (s)
Num
ber o
f Res
pons
esMedian = 25.3 Kb
0
300
600
900
40 80 120 160Page Size (Kb)
Num
ber o
f Res
pons
es
191 facets
17.8 GB database
30 GB Lucene indexes
36K LoC (Java)
14K LoC (Scala)
Image available
Source code available
PharosUsage
• Usagestatisticsoverthelastoneyeararegenerallyincreasing• 89Kpageviews• 14Ksessions• 7.5Kusers
PharosIndexing
NowincludeshitsinpartnerdatabasessuchasKEGGandChEMBL
Drug Target Ontology
TCRD
DISEASE
TIN-X
Interactionsinside&outsidetheIDG
TargetAudience
Biologists&ClinicalResearcher
• Characterize&validatenoveltargets
• Identifykeysmallmoleculesorbiologics
InformaticsScientists
• Datamining• Supporttargetvalidationprojects
ProgramStaff
• Exploretheresearchlandscape
• Newdirectionsforresearch &funding
DifferentWaystoUsePharos
Random Access
Direct Access
Manual Interaction Programmatic Interaction
Search Entity Info
Precomputation convertsanalysisintobrowsing
SupportingBothTypesofUsers
• Efficientfulltextsearch,coupledtorelevantauto-suggestion• Primaryentrypointwhenexploringandforhypothesisgeneration
• Extensivelistoffacets• Supportseasyconstructionofcomplexfilteringrules
• Extensivedetailsforeachtarget• Linkedtoexternalandinternal resources
Visualization
• Keyrequirementforefficientexploration,summary• Increaseinformationdensityinlimitedscreenrealestate,takecontextintoaccount• Interactivityisdesirable,highqualityforeasyinclusionindocuments• Simpleisbetterthanfancybutprettypictureshavevalue,makeforabetterexperience• Integrateandlinktoexternalvisualization• TinX,Harmonizome
VisualizationHighlightsVisualizationdashboard– filtersappropriatelyrepresented,plotsactasfilters
Inlinevisualizationtoincreaseinformationdensity
Summaryvisualizationsoverlaymultipledimensionsandcanbecontextaware
IntegratingExternalVisualization
Tclin,Kinase
Tdark,GPCR
Pharos
TinX
UpdatedDocumentation
EntityDossier
Multipledossiers
SetoperationsVisualizationtools
Download
Longerterm,dossierswillbeautomaticallyenrichedwithlinkeditemsandrecommendations
DossiersasContext
Overlaydatafromtargetsinadossier
TargetSimilarity• Computetargetsimilarityin“Harmonizome space”• Supportsrecommendations,prioritization• CurrentlyextendingtoageneralizedTargetKnowledgeVectorapproach
Tdark targetswhosemostsimilartargetisnotTdark
Outreach&DisseminationActivities
User Feedback Deployment
Webinars Documentation
NER API for targets & diseases
@idg_pharos
RecentpaperstoPharoslinksviaTweets
TheLongTermVision
• Incorporatedependenciesbetweendatatypestosupportinferenceandsophisticatedfilters• Frompresentationtosummarization• Useexplicitlinks&computationalinferencetogenerate(semi-)naturallanguagesummaryusingallknowndata• Influencedbythequery
• Theresultisabiologicaldashboard,customizedfortheuserandthequery
Target X has been implicated in 3 diseases related to skeletal, urological and nervous systems. It has been investigated in 5 in vitro assay, 2 in vivo assays. There are 4 compounds active against this target, 3 of which are in clinical trials.
Feedback
• ExploretheUI,tryit,breakit,andletusknowwhatworksandwhatdoesn’t• Aretheredatatypesandrelationsthatwouldhelpyoubutarenotavailable?
https://pharos.nih.govhttps://spotlite.nih.gov/[email protected]@idg_pharos
Acknowledgements
• Dac-Trung Nguyen,KyleBrinacombe,TimothySheils,Geetha Mandava,NoelSouthall,Ajit Jadhav• SteveMathias,OlegUrsu,JeremyYang,ChristianBologa,DanielCanon,TudorOprea• NicholasFernandez,AndrewRouillard,Avi Mayan• Finkbeiner lab,TomitaLab• AjayPillai,AaronPawlyk,ChristineColvis