Accéder directement au contenu Accéder directement à la navigation
Article dans une revue

Mapping the Bentham Corpus: Concept-based Navigation

Abstract : British philosopher and reformer Jeremy Bentham (1748-1832) left over 60,000 folios of unpublished manuscripts. The Bentham Project, at University College London, is creating a TEI version of the manuscripts, via crowdsourced transcription verified by experts. We present here an interface to navigate these largely unedited manuscripts, and the language technologies the corpus was enriched with to facilitate navigation, i.e Entity Linking against the DBpedia knowledge base and keyphrase extraction. The challenges of tagging a historical domain-specific corpus with a contemporary knowledge base are discussed. The concepts extracted were used to create interactive co-occurrence networks, that serve as a map for the corpus and help navigate it, along with a search index. These corpus representations were integrated in a user interface. The interface was evaluated by domain experts with satisfactory results , e.g. they found the distributional semantics methods exploited here applicable in order to assist in retrieving related passages for scholarly editing of the corpus.
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger
Contributeur : Pablo Ruiz Fabo Connectez-vous pour contacter le contributeur
Soumis le : mardi 12 février 2019 - 14:59:55
Dernière modification le : lundi 10 janvier 2022 - 17:30:14
Archivage à long terme le : : lundi 13 mai 2019 - 17:16:07


Fichiers éditeurs autorisés sur une archive ouverte


  • HAL Id : hal-01915730, version 2



Pablo Ruiz, Thierry Poibeau. Mapping the Bentham Corpus: Concept-based Navigation. Journal of Data Mining and Digital Humanities,, 2019, Special Issue: Digital Humanities between knowledge and know-how (Atelier Digit_Hum). ⟨hal-01915730v2⟩



Les métriques sont temporairement indisponibles