44
Managing a Data Catalog Promoting Data Reuse and Collaboration at an Academic Medical Center Nicole Contaxis, Project Coordinator Ian Lamb, Solutions Developer

Contaxis Lamb Managing a Data Catalog

Embed Size (px)

Citation preview

Page 1: Contaxis Lamb Managing a Data Catalog

Managing a Data CatalogPromoting Data Reuse and Collaboration at an Academic Medical Center

Nicole Contaxis, Project CoordinatorIan Lamb, Solutions Developer

Page 2: Contaxis Lamb Managing a Data Catalog

2

Institutional Structure

Page 3: Contaxis Lamb Managing a Data Catalog

3

Meeting User NeedsHeavy users of large, external datasets (e.g. Census, national health surveys, Medicare)

Department of Population Health

Lack of knowledge about institutional licenses

Difficulty accessing datasets

Difficulty working with datasets

Page 4: Contaxis Lamb Managing a Data Catalog

Presentation Title Goes Here 4

NYU Data Catalog

Page 5: Contaxis Lamb Managing a Data Catalog

5

NYU Data Catalog Home Page

•Text starts here

Page 6: Contaxis Lamb Managing a Data Catalog

6

NYU Data Catalog Home PageSearch

•Text starts here

Page 7: Contaxis Lamb Managing a Data Catalog

7

NYU Data Catalog Home PageFilter

•Text starts here

Page 8: Contaxis Lamb Managing a Data Catalog

8

Record Details - External Datasets

Page 9: Contaxis Lamb Managing a Data Catalog

9

Record Details - External DatasetsLocal Experts

Page 10: Contaxis Lamb Managing a Data Catalog

10

Record Details - External DatasetsAccess Instructions

Page 11: Contaxis Lamb Managing a Data Catalog

11

Record Details - External DatasetsPubMed Search

Page 12: Contaxis Lamb Managing a Data Catalog

12

Record Details - Internal Datasets

Page 13: Contaxis Lamb Managing a Data Catalog

13

Record Details - Internal DatasetsAuthors

Page 14: Contaxis Lamb Managing a Data Catalog

14

Record Details - Internal DatasetsAccess Instructions

Page 15: Contaxis Lamb Managing a Data Catalog

15

Record Details - Internal DatasetsAssociated Publications

Page 16: Contaxis Lamb Managing a Data Catalog

16

Record Details External Internal

•Text starts here

Page 17: Contaxis Lamb Managing a Data Catalog

17

Record Details External Internal

Page 18: Contaxis Lamb Managing a Data Catalog

18

Record Details External Internal

Page 19: Contaxis Lamb Managing a Data Catalog

19

Record Details External Internal

Page 20: Contaxis Lamb Managing a Data Catalog

Presentation Title Goes Here 20

Metadata: Strategy over Purity

Page 21: Contaxis Lamb Managing a Data Catalog

21

Common Metadata Elements from Biomedical Repositories

Page 22: Contaxis Lamb Managing a Data Catalog

22

General Metadata Schemas Consulted

DCAT

Page 23: Contaxis Lamb Managing a Data Catalog

23

Matching External Efforts

Page 24: Contaxis Lamb Managing a Data Catalog

Translating Form into Function

Our carefully selected metadata model needed to become a usable application

24

Page 25: Contaxis Lamb Managing a Data Catalog

Goals

•Faithfully reproduce metadata schema specified by our librarians

•Enable easy maintenance of any items that will need to be updated often in the future

•Make sure all forms and user interfaces help rather than hinder the ongoing maintenance of a growing collection

The best way to meet these goals was not the easiest way…

25

Page 26: Contaxis Lamb Managing a Data Catalog

26

Page 27: Contaxis Lamb Managing a Data Catalog

27

Page 28: Contaxis Lamb Managing a Data Catalog

28

Page 29: Contaxis Lamb Managing a Data Catalog

29

Page 30: Contaxis Lamb Managing a Data Catalog

30

Page 31: Contaxis Lamb Managing a Data Catalog

31

Page 32: Contaxis Lamb Managing a Data Catalog

Goals

•Faithfully reproduce metadata model specified by librarians

•Enable easy maintenance of any items that will need to be updated often in the future

•Make sure all forms and user interfaces help rather than hinder the ongoing maintenance of a growing collection

32

Page 33: Contaxis Lamb Managing a Data Catalog

Help, don’t Hinder, the Maintainers

Ease of use = clean data

•Enable user to easily refer back to previous fields

•To avoid messy data, discourage us from adding items that may already exist (i.e. don’t let us add “J Doe” if “John Doe” is already in the system)

•If we do have to add a new metadata “entity,” we shouldn’t lose all the progress we’ve made entering this dataset record

33

Page 34: Contaxis Lamb Managing a Data Catalog

34

Page 35: Contaxis Lamb Managing a Data Catalog

35

Discouraging Duplicates

Page 36: Contaxis Lamb Managing a Data Catalog

Adding New Items Without Getting Lost

New metadata items can be added to the system without leaving this form that you’ve spent the last hour on

36

Page 37: Contaxis Lamb Managing a Data Catalog

Ease of Use = Clean Data

If your system is difficult to use, no

one will want to use it

37

Page 38: Contaxis Lamb Managing a Data Catalog

Presentation Title Goes Here 38

Processing Internal & External Datasets

Page 39: Contaxis Lamb Managing a Data Catalog

39

External Dataset

Page 40: Contaxis Lamb Managing a Data Catalog

40

Internal Dataset

Page 41: Contaxis Lamb Managing a Data Catalog

Presentation Title Goes Here 41

Make Your Own: Code and Documentation Availability

Page 42: Contaxis Lamb Managing a Data Catalog

42

Documentation on OSF

https://osf.io/vg7rn/

Page 43: Contaxis Lamb Managing a Data Catalog

43

Code on GitHub

https://github.com/nyuhsl/data-catalog

Page 44: Contaxis Lamb Managing a Data Catalog

Presentation Title Goes Here 44

Questions?

Contact:Data Services Team NYU Langone Medical [email protected]