18
Alternative ways to explore Clinical Data – graph visualisation Ed Cheetham, Principal Terminology Specialist

Alternative ways to explore Clinical Data – graph visualisation

  • Upload
    zofia

  • View
    41

  • Download
    0

Embed Size (px)

DESCRIPTION

Alternative ways to explore Clinical Data – graph visualisation. Ed Cheetham, Principal Terminology Specialist. Introduction. Considerable work is already going on in making complex multi-dimensional data accessible and understandable – some of this is distinctly ‘visual’. Use of e.g.: - PowerPoint PPT Presentation

Citation preview

Page 1: Alternative ways to explore Clinical Data –  graph visualisation

Alternative ways to explore Clinical Data – graph visualisation

Ed Cheetham, Principal Terminology Specialist

Page 2: Alternative ways to explore Clinical Data –  graph visualisation

Introduction

•Considerable work is already going on in making complex multi-dimensional data accessible and understandable – some of this is distinctly ‘visual’

Screenshots from:Enhancing Access to UK Renal Registry Data through Innovative Online Data Visualisations. Afzal Chaudhry, Terry FeestandUK Renal Registry Interactive Geographical Maps

Use of e.g.:ColourSizeLayout (x,y)Dynamic updating

Page 3: Alternative ways to explore Clinical Data –  graph visualisation

Is SNOMED CT just the ‘data’ in these visualisations?• If we want to see SNOMED CT we look at tables and browser

tree controls don’t we?

• Not necessarily...• Several browsers already have graphical features:

• CliniClue graphical view• SNOB IHTSDO ‘standard view’• IHTSDO workbench plugins

Tend to concentrate on individual concepts.What about sets?

Page 4: Alternative ways to explore Clinical Data –  graph visualisation

Sets, e.g. SNOMED CT Subsets

NHS Renal subset

Page 5: Alternative ways to explore Clinical Data –  graph visualisation

Subset subgraph - ZGRViewer

Zoomable

Dynamic highlighting

GraphViz layout

ZGRViewer: http://zvtm.sourceforge.net/zgrviewer.html (based on dot/GraphViz products: http://www.graphviz.org/)

Use does not indicate endorsement, but extremely valuable to illustrate points discussed.

Class count limits...

Page 6: Alternative ways to explore Clinical Data –  graph visualisation

Subset subgraph - Gephi

Node size – ‘level’ in graph

Force-directed layout

Dynamic labelling

Gephi: http://gephi.org/Use does not indicate endorsement, but extremely valuable to illustrate points discussed

Glomerulonephritis (disorder)

Kidney disease (disorder)

Malignant tumour of kidney (disorder)

Page 7: Alternative ways to explore Clinical Data –  graph visualisation

Subset compared to SCT corpus

Page 8: Alternative ways to explore Clinical Data –  graph visualisation

Areas of high and low density:

Neoplasm of kidney (disorder)

Infectious disorder of kidney(disorder)

Page 9: Alternative ways to explore Clinical Data –  graph visualisation

Refactoring greedy algorithm. From:Looking for ‘high concept density’:

• Rank each member:Actual number [of original set members subsumed]

(Potential - actual number) + n• Threshold:

‘Density’ must be > 1 for ‘compression’Can set lower threshold for ‘speculative analysis’

• n = ‘magnification factor’

Page 10: Alternative ways to explore Clinical Data –  graph visualisation

Refactoring greedy algorithm. To:

Desc

Self

Ref’d

Support

Self+Desc

Page 11: Alternative ways to explore Clinical Data –  graph visualisation

Remove referenced classes to simplify:

Self+Desc

Desc

Self

Ref’d

Support

Page 12: Alternative ways to explore Clinical Data –  graph visualisation

Application to frequency data:

Synthetic (but plausible) observation data based on data from Strathclyde Renal Electronic Patient Record and UK Renal Registry data. Thanks to: Colin Geddes, Keith Simpson, Afzal Chaudhry

Page 13: Alternative ways to explore Clinical Data –  graph visualisation

Application to frequency data:Node size – frequency values

Acute pyelonephritis (disorder)

Clear cell carcinoma of kidney(disorder)

Page 14: Alternative ways to explore Clinical Data –  graph visualisation

‘Speculative’ threshold:

Self+Desc

Desc

Self

Ref’d

Support

Page 15: Alternative ways to explore Clinical Data –  graph visualisation

‘Speculative’ threshold:

Desc

Self

Ref’d

Support

Self+Desc

Page 16: Alternative ways to explore Clinical Data –  graph visualisation

Including ‘new nodes’

Microangiopathic hemolytic anemia (disorder)

Pseudohypoaldosteronism (disorder)

Metabolic renal disease (disorder)

Desc

Self

Ref’d

Support

Self+Desc

Acute pyelonephritis (disorder)

Chronic renal impairment (disorder)

Glomerulonephritis (disorder)

Resize post-simplification

320 -> 150 categories

Page 17: Alternative ways to explore Clinical Data –  graph visualisation

Before and after ‘simplification’

90688005 36689008 14669001 29738008 236407003 433144002 127013003 46177005 63302006 431857002 191306005 77182004 302910002 30223300652254009 236403004 52777008 44785005 28728008 195353004 236574008 236435004 236448000 197764002 59400006 236500003 20018005 824131000000108236425005 236436003 34436003 16934004 23697004 266569009 197613008 433146000 48638002 80321008 43064006 236398000 197794008 1935100082525005 28637003 60926001 7163005 4390004 41729002 95570007 155441006 31054009 50711007 398064005 399072004 28560003 4006800816297002 20917003 68815009 236418003 28770003 15842009 1776003 847851000000101 23132008 236522005 48713002 75888001 89155008 23637900250581000 44323002 844701000000103 236443009 236499007 236439005 399340005 46956008 59479006 1426004 197659005 32278006 22830006 111395007204958008 204949001 236578006 274945004 236645006 236458001 36171008 236419006 51292008 7199000 76618002 68779003 111407006 19943007236411009 38481006 49120005 85020001 310670008 48796009 95579008 197579006 266556005 431856006 70650003 197601003 236421001 71275003128473001 22199006 2900003 197590001 236575009 373604002 80640009 90241004 239932005 254915003 254921004 36402006 60700002 61758007197919005 236409000 236710009 444794000 840451000000102 236380004 236506009 3321001 236461000 236503001 236505008 81475007 236428007 23659300524790002 302849000 32659003 399326009 65443008 79385002 15938005 190681003 197808006 237885008 268232000 399068003 431855005 561900461165007 86406008 236416004 236447005 236587002 25341005 363518003 36388008 37085009 61852001 62332007 65520001 7725007 78129009186235002 236373001 236399008 236413007 236415000 236429004 236452000 236466005 236502006 236576005 236577001 239928004 253878003 27440100537061001 40095003 4676006 47330001 54879000 66993009 76521009 85324003 126880001 12897005 13886001 17901006 20342001 23205900023402009 236444003 236496000 236706006 236711008 236713006 239927009 253880009 28689008 32093003 33763006 51055000 71110009 7737700179509009 82236004 852781000000103 86347007 92697000 102455002 10406007 15689008 16652001 18417009 190829000 190919008 197650009 197744007226309007 236015007 236017004 236384008 236385009 236488005 236514003 236527004 237604008 237743003 237770005 239947001 240096000 253864004253879006 253900005 253904001 267441009 282364005 290006 307604008 30911005 31742004 3188003 35546006 35759001 367528006 3704008373420004 373421000 373422007 373584008 373585009 37497004 38898003 40733004 408642003 40951006 420279001 421893009 42399005 42825500443941006 439990003 444558002 444645005 444690001 444699000 444717006 444749006 444830001 444976001 444977005 445108007 445119005 445236007445387007 445388002 445404003 445431000 445503007 446449009 446641003 446989009 446991001 447335007 45582004 46659004 46785007 516770005187006 519331000000100 519481000000106 519491000000108 519501000000102 519511000000100 519521000000106 519581000000107 522551000000101 522591000000109 522601000000103 523411000000105 53298000 5355600263247009 66451004 73305009 800991000000107 80710001 82275008 828971000000101 83866005 90505000 90708001 95566004

236425005 36689008 36171008 14669001 29738008 46177005 127013003 63302006302233006 236448000 236514003 191306005 52254009 95566004 236403004 4306400628728008 236574008 195353004 20018005 82525005 824131000000108 34436003 1693400440095003 266569009 48638002 28637003 60926001 7163005 41729002 15544100650711007 398064005 399072004 28560003 15842009 399340005 1776003 84785100000010123132008 48713002 190681003 89155008 44323002 111395007 253882001 23643900546956008 32278006 48796009 76618002 204949001 51071000 236458001 236578006274945004 106000008 236645006 49120005 38481006 363518003 3321001 512920087199000 236526008 32659003 19943007 85020001 188250002 95579008 71275003128473001 22199006 2900003 197590001 373604002 90241004 236380004 236502006239932005 254921004 60700002 61758007 197919005 236461000 444794000 42825500424790002 399326009 65443008 15938005 237885008 399068003 5619004 86406008236587002 7725007 237743003 240096000 30911005 445431000 446449009 51958100000010780710001 226309007 236488005 237770005 522591000000109 53298000 77098009 92697000102455002 10406007 126880001 13886001 190919008 197744007 232059000 236527004236706006 237604008 253900005 253904001 28689008 307604008 31742004 35759001367528006 40733004 408642003 42399005 445108007 445236007 446641003 4558200446659004 51677000 5187006 522601000000103 523411000000105 63247009 77377001 8223600482275008 86347007 90505000 90708001

320 categories90% covered by 1st 50 categories• Chronic renal failure syndrome (disorder)• Acute pyelonephritis (disorder)• Acute renal failure syndrome (disorder)• Proteinuria (finding)• IgA nephropathy (disorder)• Chronic kidney disease stage 3 (disorder)• Diabetic renal disease (disorder)

150 categories90% covered by 1st 22 categories• Chronic renal impairment (disorder) [30]• Acute pyelonephritis (disorder)• Glomerulonephritis (disorder) [92]• Acute renal failure syndrome (disorder)• Proteinuria (finding)• End stage renal disease (disorder) • Diabetic renal disease (disorder)

Page 18: Alternative ways to explore Clinical Data –  graph visualisation

Conclusions

• Established visualisation techniques can be applied to SNOMED CT data [reference and instance]

Exploratory and explanatory stagesWhat does the data ‘look like’What does that processing step ‘do’ to the data?

• Final views may be more ‘familiar’ (pie charts, summary tables)Competition for available axes!

• Need experience re. optimum contribution to the analytic process

Flexible and configurable toolingImaginative participantsStandards