59
Hussein Suleman [email protected] University of Cape Town Department of Computer Science Centre for ICT for Development Digital Libraries Laboratory April 2016 Digitally Preserving African Heritage

Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Embed Size (px)

Citation preview

Page 1: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Hussein [email protected]

University of Cape TownDepartment of Computer ScienceCentre for ICT for Development

Digital Libraries Laboratory

April 2016

Digitally Preserving AfricanHeritage

Page 2: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Why am I here? To talk about Digital Libraries/Preservation. To share some research findings. To collaborate and develop research links. To inspire you to think differently. To convince you to preserve our heritage!

Pre-Intro

Page 3: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What is the Digital Libraries Lab? Research in technologies for research and

education, specifically digital libraries: African language search engines, machine translation cultural heritage preservation technology for education

Teaching 2 staff supervising about 20 MSc and PhD students

Advocacy a little wherever we can

Collaboration industry, academic, (govt?)

Pre-Intro

Page 4: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

Mapungubwe Collection University of Pretoria

What Why How Case Study Open Issues

Page 5: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

Timbuktu Manuscriptshttp://www.timbuktufoundation.org/Manuscripts/index.htm

What Why How Case Study Open Issues

Page 6: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

Kirby Collectionhttp://web.uct.ac.za/depts/sacm/kirby.html

What Why How Case Study Open Issues

Page 7: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

Digital Imaging South Africahttp://www.disa.ukzn.ac.za/

What Why How Case Study Open Issues

Page 8: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

Bleek and Lloyd Collectionhttp://www.lloydbleekcollection.uct.ac.za/

What Why How Case Study Open Issues

Page 9: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

UPSpaceUniversity of Pretoria

What Why How Case Study Open Issues

Page 10: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Should We Preserve?

What Why How Case Study Open Issues

Page 11: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Why An African Perspective?Urgency

Some documents and storage media arerapidly deteriorating.

Some storytellers are the last in theirgenerations.

What Why How Case Study Open Issues

Page 12: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Why An African Perspective?Rewriting History

We now know there were powerful ancientcivilizations all over Africa. Colonial governments suppressed this information

for centuries!

History must be preserved – what littleevidence we have left.

What Why How Case Study Open Issues

Page 13: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Why An African Perspective?Skills and Education

Typical archivists are not as highly skilled ascounterparts elsewhere.

Digital media is still not the norm. Education levels of general population hinders

preservation – end-user data curation is verydifficult.

What Why How Case Study Open Issues

Page 14: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Why An African Perspective?Funding

Typically, there is little. Many preservation projects are funded by

external agencies, but with restrictions on dataaccessibility.

There is a desperate need to do more withless.

What Why How Case Study Open Issues

Page 15: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Why An African Perspective?Internet Bandwidth(Digital Divide)

Non-existent in some places and pooreverywhere else.

Preservation projects designed for highbandwidth are not suitable.

All online solutions must be bandwidth-friendly.

What Why How Case Study Open Issues

Page 16: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Is Africa Special? Definitely NOT!

The same problems are faced by some othercommunities.

Many communities face some of the problems. Most communities can benefit from solutions to

these problems.

What Why How Case Study Open Issues

Page 17: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Solutions: Lightweight and Reusable Systems Simplicity

XML

Minimalist Archives.

Metadata management using office suite.

Multi-purpose software tools (repositories).

Shared skills in common tools, e.g., DSpace

What Why How Case Study Open Issues

Page 18: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Solutions: Bandwidth Collections accessible over CD/DVD-ROM, local

drives, network, etc. Preservation by copying.

Static collections rather than dynamic. Preserve files instead of services.

Minimal bandwidth use e.g., using AJAX

What Why How Case Study Open Issues

Page 19: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Solutions: Experience Recreation Storytelling in virtual environments.

Low-cost virtual environments.

Virtual recreation of historical districts.

What Why How Case Study Open Issues

Page 20: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Solutions: Basic Digitization Scan documents. Take photographs. Take 3D laser scans. Record audio.

Build digital libraries / archives to preserve.

Share and reuse information.

What Why How Case Study Open Issues

Page 21: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What is a Digital Library: Example 1/3

What Why How Case Study Open Issues

Page 22: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What is a Digital Library: Example 2/3

What Why How Case Study Open Issues

Page 23: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What is a Digital Library: Example 3/3

What Why How Case Study Open Issues

Page 24: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Typical DL Services User Management: accounts, auth, profile Searching: info retrieval, Google, indexing Browsing: categories, classification, subsets Submission: explicit/harvested/crawled Review: quality, workflow Annotation: reviews, ratings, discussions Recommendation: suggestions, collab

filtering

What Why How Case Study Open Issues

Page 25: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Case Study: Bleek and Lloyd Collection 1/2 Books and drawings

documenting now-extinct culture of|xam and !kun(Khoi-San?) groups.

Documented byWilhelm Bleek, LucyLloyd and others inlate 1800s in CapeTown.

~20000 pageimages

What Why How Case Study Open Issues

Page 26: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Case Study: Bleek and Lloyd Collection 2/2 ~800 drawings

On UNESCO Memoryof the World register.

Curated byUCT/NLSA/Iziko-SAM/UNISA/…

Digital preservationfunded by Mellon, ledby Michaelis School ofFine Arts, UCT

What Why How Case Study Open Issues

Page 27: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Bleek and Lloyd Core Requirements Make the collection accessible as widely as

possible: Over the Web, Off a CDROM, Off a network-shared drive, Etc.

Platform independence (Mac/XP/Linux/etc.). Low barrier to use. Standards-compliance.

What Why How Case Study Open Issues

Page 28: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Option 1: Greenstone Greenstone, a digital

library tool, createsstandalone CDROMcollections.

It still requires softwareinstallation.

It does not work on ALLplatforms.

What Why How Case Study Open Issues

Page 29: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Option 2: XSL-FO XSL-FO can be used to

create hyper-linked staticPDFs, like books.

Does not work for largebooks. PDF file sizes increase

dramatically…

What Why How Case Study Open Issues

Page 30: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Solution 1: XML + XSLT XHTML Encode all descriptive information using XML.

Write XSL transformations to convert the XMLinto multiple formats, each corresponding to anHTML page view.

Needs advanced XSLT techniques to deal withsize of data.

What Why How Case Study Open Issues

Page 31: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Solution 2: in-Browser Services

What Why How Case Study Open Issues

Page 32: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 33: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 34: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 35: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 36: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 37: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 38: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 39: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 40: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 41: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 42: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 43: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 44: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 45: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 46: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 47: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What Why How Case Study Open Issues

Page 48: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Principles of DL for African Heritage Efficient bandwidth use Advanced technology Appropriate technology Local relevance Modernization instead of Africanization Global applicability of solutions Minimalism of staff/money Multicultural/multilingual inclusivity

What Why How Case Study Open Issues

Page 49: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

More Case Studies in Heritage DLs

alternative archive infrastructures cloud computing multilingual IR heritage preservation visual dictionaries rock-art exploration using mobile device mobile |Xam input

recent research at UCT

What Why How Case Study Open Issues

Page 50: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Cloud-based Archives individual services and whole

archives in private clouds

install locally reduces need for skilled staff instant archives shared resources automatic scalability

acceptable performance, aftercache priming

user studies in progressLebeko Poulo, LesothoMushashu Lumpa, Zambia

What Why How Case Study Open Issues

Page 51: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

MultilingualInformationRetrieval

search queries withmultiple languages

current systems biased toone language

rerank documents byunderstanding query andreweightinglanguages/results

better quality resultsfound, higher up inresultsMohammed Mustafa Ali, Sudan

What Why How Case Study Open Issues

Page 52: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

DocumentTranscription:Bleek and LloydStories

crowdsourcedtranscription application

volunteers to convertimages to text

automated algorithms tocheck and assess quality

interactive Web interfacefor users to enter text

10% better than AIapproaches!

Ngoni Munyaradzi, Zimbabwe

What Why How Case Study Open Issues

Page 53: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

LanguagePreservation:Online |Xamdictionary

visual dictionary of |Xamlanguage

simple archive foundation client-side processing as

far as possible linked into Bleek and

Lloyd

Kyle Williams, South Africa

What Why How Case Study Open Issues

Page 54: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

simplyCT simple archive

architecture

performance understandability flexibility applicability

good performance forsmall to mediumcollections

easy to use andexpandPhiri Lighton, Zambia

What Why How Case Study Open Issues

Page 55: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

What We Have Learnt Digital preservation in Africa has special

problems.

But all problems can be addressed adequatelywith appropriate and innovative use of currenttechnology.

What Why How Case Study Open Issues

Page 56: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Future Challenges Scalability of preservation efforts.

How to create similar collections easily? A national heritage archive?

Tools for management and dissemination? Extend Greenstone?

What Why How Case Study Open Issues

Page 57: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

Future Challenges Standard tools to manage metadata/data?

Usability Scalability Extensibility System independence

Many current repository tools better suited topapers and not heritage collections.

What Why How Case Study Open Issues

Page 58: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

The Future – Past Preserved

What Why How Case Study Open Issues

Page 59: Digitally Preserving African Heritage - University of Cape ...pumbaa.cs.uct.ac.za/~hussein/2016/uwc_2016/uwc_2016_dlandherita… · Digital Libraries Laboratory April 2016 Digitally

That’s all Folks!

direct all questions and comments to:[email protected]

Facebook/slumouTwitter@slumou