viii encuentros de centros de documentación de arte contemporáneo en artium - eero hyvönen

81
1 Publishing and Using Cultural Heritage Linked Data on the Semantic Web Documentation Congress 2016, Vitoria-Gasteiz, Spain Eero Hyvönen, Prof., Director Aalto University and University of Helsinki Heldig – Helsinki Centre for Digital Humanities http://heldig.fi Semantic Computing Research Group (SeCo) http://www.seco.tkk.fi/

Upload: artium-vitoria

Post on 16-Jan-2017

139 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

1

Publishing and Using Cultural Heritage Linked Data

on the Semantic Web

Documentation Congress 2016, Vitoria-Gasteiz, Spain

Eero Hyvönen, Prof., DirectorAalto University and University of Helsinki

Heldig – Helsinki Centre for Digital Humanitieshttp://heldig.fi

Semantic Computing Research Group (SeCo)http://www.seco.tkk.fi/

Page 2: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

2

Contents

• Background: History 2002-2016• Vision: Semantic Web of Cultural Heritage• Challenges: Content Complexity & Production• Solution: Linked Data Publishing Model ”Sampo”• Realization: Three Sampo Applications

Page 4: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

4

History behind this talk Semantic Portals for Cultural Heritage

– 2004 MuseumFinland – Finnish Museums on the Semantic Web

» http://www.museosuomi.fi – 2008 CultureSampo – Finnish Culture on the Semantic Web 2.0

» http://www.kulttuurisampo.fi – 2011 BookSampo – Fiction Literature on the Semantic Web

» http://www.kirjasampo.fi – 2012 TravelSampo -- Mobile Contextualized Services of Cultural

Tourism» http://www.travelsampo.fi

– 2015 WarSampo – Finnish World War II on the Semantic Web» http://www.sotasampo.fi

Page 5: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

5

Ontology and Data Services – 2009 National Ontology Library Service ONKI

» http://onki.fi – 2014 ONKI.fi -> Finto.fi of the National Library

» http://finto.fi – 2014 Linked Data Finland Data Service & Tools

» http://ldf.fi – 2016 Finnish Ontology Service for Historical Places and Maps

» http://hipla.fi Publications available at:

– http://www.seco.tkk.fi/publications/

Page 6: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

6

• Over 30 resarchers at SeCo including Eetu Mäkelä, Tomi Kauppinen, Jouni Tuominen, Kim Viljanen, Tuukka Ruotsalo, Suvi Kettula, Kaisa Hypen, Erkki Heino, Petri Leskinen, Minna Tamper, Esko Ikkala, Mikko Koho, …

• Some 50 organizations involved

Joint Work During 2002-2016

Antikvaria-ryhmä

Page 9: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

9

Wouldn’t it be nice if

cultural organizations could publish easily their contents together on the web just by pushing a button,

the contents would be automatically linked with other publishers contents and get semantically enriched,

researchers and citizens could contribute with their own contents and knowledge,

Page 10: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

10

---

contents could be accessed easily from different thematic perspectives and contexts,

intelligent search and browsing systems could find answers to questions in addition to data records,

the aggregated contents and services could be reused in external applications easily,

language barriers could be overcome?

Page 18: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

18

Biographical Registries Collect Data about Persons

henkilö nimi ammatti syntymapaikka ...H1 Akseli Gallen-Kallela taiteilija LemuH2 Gustaf Mannerheim marsalkka Askainen

...

H1

Lemu

ArtistPerson

”Akseli Gallen-Kallela”

H2

Askainen

Marshall

”Gustaf Mannerheim”

type

type

name

nanme

profession

profession

birthPlace

birthPlace

Biography Center

Person Name Profession Birth Place

Page 19: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

19

Art Museum Catalogs Paintings

...

T1

1929

Painting

creator

time

type

”Gustaf Mannerheim”nimi

subject

name”Akseli Gallen-Kallela”

teos nimi tekijä aika aihe ...T1 Mannerheimin muotokuva Akseli Gallen-Kallela 1929 Gustaf MannerheimT2 Aino-triptyykki Akseli Gallen-Kallela 1891 Aino, Kalevala

...

Art Museum Collection

Page 20: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

20

Land Survey Organizations Know Places

Varsinais-Suomen lääni Finland

Askainen

Lemu

Turku

kunta lääniAskainen Varsinais-Suomen lääniHelsinki Uudenmaan lääniLemu Varsinais-Suomen lääniTurku Varsinais-Suomen lääni...

part-ofpart-of

part-of

part-of

County

type

Province

type...

type Land Survey

Page 21: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

21

Ontologies are Developed by Semantic Web Researchers

ArtistPerson

Marshall

Painting

Concept

Endurant

Place

Profession CountysubClassOf

TimePeriod

AbstractPerdurant

PhysicalObject

Province

KOKO-ontologySubclass Hierarchy

FinnONTO

subClassOf

subClassOf

subClassOf

subClassOf

Page 22: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

22

RDF Connects and Harmonizes Linked Data into a GGG

H1

Lemu

ArtistPerson

”Akseli Gallen-Kallela”

H2

Askainen

Marshall

”Gustaf Mannerheim”

type

type

name

name

profession

profession

birthPlace

birthPlace

T1

1929

maalaus

tekijä

aiheaika

tyyppi

Varsinais-Suomen lääni Finland

Turku

part-of part-of

part-of part-of

Concept

Endurant

Place

Profession County

type

type

type

subClassOf

subClassOf

subClassOf

subClassOf

yläluokka

Time

subClassOfA bstractPerdurant

PhysicalObject

Province

yläluokka

...

PortalTriplestore

Serendipity: 1+1 > 2

Page 23: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

23

Why is This Useful?Limitations of Non-semantic Data

• NBA-H26069-467 :object ”cup and plate” ; :material ”porcelain” ; :creationPlace ”Germany” ; :creator ”Meissen”.

• This metadata cannot answer the following queries/questions:– Find all vessels?– Find all ceramic products?– Find artifacts manufactured in Europe?– Does the city of Meissen manufacture ceramics?

Page 24: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

24

Semantic Web Solution:Understanding the Ontological Context

NBA-H26069-467 :object ”cup and plate” ; :object_concept object:cup ; :object_concept object:plate ;

:material ”porcelain” ; :material_concept object:porcelain ;

:creationPlace ”Germany” ; :creationPlace_concept place:Germany ;

:creator ”Meissen” :creator_concept actor:Meissen .

NBA-H26069-467

place:Germany

object:cup

creationLocation_concept

place:Europe

loc:partOf

rdfs:subClassOf

object:vessel

object_concept

object_conceptobject:plate

rdfs:subClassOf

...

...

...

Find all vessels?Find all ceramic products?Find artifacts manufactured in Europe?Does the city of Meissen manufacture ceramics?

object ontology

place ontology

actor ontologymaterial ontology

place:Meissen

actor:Meissen

material:porcelain

material_conceptmaterial:ceramic

Page 30: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

30

Content Production System- Model- Standards & Best Practices- Ontology & Data Services- Annotation and other tools

Content Infrastructure- W3C etc. standards- Ontology Infrastructure - Metadata schemas - Domain ontologies- Linked Datasets

Portal - Humans: user-interfaceData Service- Machines: AJAX widgets, REST, Web Services, SPARQL

Cultural HeritagePortal System

The Components of a Semantic Portal

Page 33: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

Three Case Studies Using the Sampo Model

CultureSampoBookSampoWarSampo

Page 39: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

39

CultureSampo Content Providers (28+) :museums, libraries, archieves, researcher organizations, media companies + citizens

International Content Providers1 Geonames2 Google (Maps)3 Iconclass (vocab.)4 Panoramio5 Paul J. Getty Foundation (vocab.)6 Wikipedia

Finnish Content Providers1 Agricola – Suomen historiaverkko2 Espoon kaupunginmuseo3 Helsingin kaupunginkirjasto4 Hiihtomuseo5 Jyväskylän yliopisto, musiikin laitos6 Kansallisbiografia7 Kansallismuseo8 Kuopion kulttuurihistoriallinen museo9 Laatokan-Karjalan museo

10 Lahden kaupunginmuseo11 Museovirasto12 Pohjois-Karjalan museo13 Radio- ja TV-museo14 Seurasaaren ulkomuseo15 Suomalaisen Kirjallisuuden Seura SKS16 Suomen maatalousmuseo Sarka17 Suomen merimuseo18 Taideteollisen korkeakoulun kirjasto19 Valtion taidemuseo20 Veljekset Karhumäki Oy21 Viipurin historiallinen museo22 Yleisradio Oy

Thanks for cooperation!

Page 40: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

40

Metadata SchemasMetadata schema Content type

1 artifact artifacts2 art paintings, sculpture, drawings, abstract art3 literature novels, short stories, comics4 WWW page WWW pages5 poetry 3 subtypes of poetry6 fictive object places and persons in Kalevala 7 folk music 5 subtypes of folk music8 photograph photographs9 aerial photograph aerial photos

10 actors persons, organizations11 biography biographies12 historical event historical events13 skill cultural process descriptions14 video documented processes15 built objects buildings etc. in nature16 archeological sites archeological sites

Page 41: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

41

Data Alignment Principles Used Dublin Core like metadata schemas

– Element subproperty-of hierarchies– Dump-down principle

Harmonized element values– Taken from large shared domain ontologies– Objects, Actors, Places, Actions, ...

Page 42: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

42

Events and Narratives as Semantic Glue

Events and narratives make cultural heritage alive!– Historical events

» Finnish history ontology– Events and processes of intangible cultural heritage

» Farming, arts & craft, …– Events in stories

» Semantic Kalevala (Finnish national Epic)

Page 43: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

Preserving Intangible Cultural Heritage:Cataloging Boot Making Process

Espoolainen Onni Wirlander valmistaa saappaat

Page 46: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

46

Narrative Semantic Web- Case Semantic Kalevala

National epic of Finland Compiled by Elias Lönnrot from

a vast collection of folk poems Publication

– 1835 ”Old Kalevala”– 1849 ”New Kalevala”

» 50 poems, 22 795 lines– Translated into some 60 languages since 1841

Semantic Kalevala– First translation in a ”machine” language (RDF)!

Page 48: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

48

Semantic Kalevala online:The computer ”understands” the national epic Kalevala

Semantically annotated 50 poems of the national epic

- Events and narratives

Translation into modern Finnish

Links to related art etc.

Page 49: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

49

CultureSampo RDF Knowledge Base (March 17, 2009)

Metadata– 134,000 cultural collection items (artifacts, books, videos etc.)– 285,000 other resources (places, persons etc.)– 204 property types in metadata

Ontologies– KOKO ontologies (ca. 37,000 concepts)– Additional international vocabularies

» AAT, ULAN, Iconclass– 253 property types in ontologies

Size– 11,4 million triples

» 2,7 million triplets» 8,7 million additional reasoned triplets

Page 53: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

Nine Thematic Perspectives into Cultural Heritage

Three languages

Nine perspectives

Lately commented items

Lately viewed items

Page 62: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

62

CultreSampo in Short

1. Highly cross-domain: 26 content types and 16 metadata schemas

2. Sophisticated semantic annotation models including events and processes

3. Semantic search and recommending techniques4. Versatile selection of semantic visualizations (map views,

timelines, graphs, process visualization, semantic video viewing)

5. Based on a large nation wide collaboratively maintained infrastructure of ontologies and ontology services

6. Includes models of and tools for collaborative semantic content creation

7. Services are available for machines, too.

Page 64: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

64

BookSampo – Finnish Fiction Literature on the Semantic Web Why

– Helping library customers in finding literature and related content How

– Semantic annotation of all Finnish fiction literature (for adults) Who

– Finnish Public Libraries and FinnONTO project When

– In use since 2011 (50 000+ monthly users)» http://www.booksampo.fi

(Mäkelä et al., IFLA 2012, SWJ 2012)

Page 67: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

Key Idea: Data Services Supporting Linked Data in Applications

BookSampo LD

CultureSampo LDAPI

API

YOUR NEXTAPPLICATION

API

LDF.fi SPARQLEndpoint

Page 74: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

74

Semantic Portal http://sotasampo.fi/en 7 Perspectives to War

More info: [Hyvönen et al., ESWC 2016; Koho et al., WHiSe 2016]

In-use semantic portal 2015Nearly 20 000 end-users during first 3 days

1. Events 2. Persons 3. Army Units

4. Places 5. Deaths 6. Memoirs

7. Photos

Page 78: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

78

Conclusions: Semantic Web Makes a Difference

End-user’s perspective– Global view to heterogeneous, distributed contents– Automatic content aggregation– Semantic search– Semantic browsing and recommendations– Other intelligent services (knowledge discovery, personalization,

visualization, …) Content publisher’s perspective

– Distributed content creation– Enriching each other’s contents semantically– Automated link maintenance– Shared content publication channel – Reusing aggregated content in other applications

Page 79: VIII Encuentros de Centros de Documentación de Arte Contemporáneo en Artium - Eero Hyvönen

79

But the Lunch is not Free

More collaboration is need -> complicates work Integration of semantic portals with legacy systems Manual annotations are costly and may not scale up Automatic annotation lowers data quality