Enriching Content with User Data and Semantic Information


Oxford University Press, a department of the University of Oxford, has been developing semantic enrichment capabilities over a number of years, to improve the management and usage of our books, journals and dictionary content.

This talk is about how we’re combining human-authored semantic information with semantic tags and taxonomy classifications automatically extracted from our content. I’ll touch on the requirements we have as a user of text mining  applications and cover our learning experiences of implementing schema.org markup on our sites.

I’ll also introduce the Oxford Global Languages project, which links lexical information from multiple global and also digitally under-represented  languages such as isiZulu and Urdu in a triple store. The strong community involvement allows users who are native speakers of those languages to contribute to the dictionaries, building up a better interlinking of all languages which can then be accessed via an API, to be publicly launched this autumn.