Team: Digital “Terroirs”
Team Leader: Adrien Di Mascio
Team Members: Romain Wenz, Vincent Michel, Agnès Simon
“Find digital resources about the French countryside”
Disclaimer: This “challenge team” also works on the data.bnf.fr project, but the problems solved here are parallel to our main project.
Problem: In France, anybody can go down the street and buy a Croissant or a Baguette of bread.
And you can find pictures of the Eiffel tower anywhere. But finding relevant digital resources is sometimes a bit more difficult. It is hard to find online documents about places. Tools have been made to describe historical monuments, but not yet to describe the diversity of geographic entities. Several datasets are available, and there is a wealth of available documents about all places of France. But few of them are properly indexed. This would meet both a strong demand from researchers, and an ongoing tourist interest.
Why LOD?: Linked Open data technologies could be used to map the digital resources from the French national Library (BnF) with geographic information, so as to provide both a proper indexing, and easy-to-use maps that would provide you with relevant digital documents about the French Terroirs. This service would have to address issues such as: named entity extraction, data management, and coordination of research applications with an easy-to-use end-user interface.
Any data available?
The first datasets would be:
– Works and Writers from data.bnf.fr, so as to rely on authority data.
– “Manifestations” from data.bnf.fr for bibliographic information.
– “Rameau” subject headings from data.bnf.fr for topics.
– Digital items from Gallica for the content provided to the end-user.
– Geonames and other geographic services for extracting coordinates.
– Other map services for implementing maps in an end-use interface.
Next steps: It would be necessary run some intelligence on the publication information, so as to extract towns and places. It would also be possible to integrate information from the performing arts, and to show the activities that have taken place throughout the time. Depending on the outcome, it could even be possible to use some OCR on the documents from the “Gallica” digital library, by extracting the named entities.”