Skip to main content

Text & Data Mining: Home

Welcome to this LibGuide supporting TDM practitioners in Cambridge, students and researchers considering a project employing TDM, and librarians fielding enquiries about TDM from their library users. 

This guide is a work in progress and as far as we are aware, the first LibGuide on TDM in the UK!  We want to make this useful to you, so please email us with any suggestions or ideas - write to  Thank you.

Contact us

If you have any questions about this service or have any feedback for us, please let us know.


OpenMinted Platform

OpenMinTed is working on a platform that will be a gateway to many types of language data, including tagsets, ontologies, publications and corpora. The platform will also offer services and functionalities that are useful for text and data mining, and allow miners to share their tools and build their own workflows.

New "TDM Test Kitchen"

The TDM Test Kitchen is an experimental service supported by Cambridge Digital Humanities, Cambridge University Library and Cambridge University Press.

The TDM Test Kitchen aims to:

Explore the application of TDM (Text and Data-Mining) methods to CUP and UL collections.

Provide a ‘live’ learning environment where researchers, CUP and library staff involved either using TDM methods or developing TDM support services can learn more about TDM methods, share good practice and exchange knowledge about how to overcome challenges.

Facilitate discussion between researchers, the UL and CUP about how to develop TDM methods and services in future.

Read more about the TDM Test Kitchen

Next steps for TDM: Cambridge Symposium

Sharing thinking on TDM in Cambridge: Links to description and social media on the Cambridge Symposium on TDM

New bite-size educational videos on TDM

OpenMinTeD "sets out to create an open, service-oriented e-Infrastructure for Text and Data Mining (TDM) or scientific and scholarly content.  Researchers can collaboratively create, discover, share and re-use knowledge from a wide range of text-based scientific related resources in a seamless way".

OpenMinTeD has a Knowledge Base comprising a range of materials including visualizations of the TDM workflows, textual guides, Webinars, and training videos showing methods applied in practice by experts in the field:

- Key concepts and areas in TDM explained - part 1

- Key concepts and areas in TDM explained - part 2: Knowledge representation

- Key concepts and areas in TDM explained - part 3: Recommenders and filtering 

- Key concepts and areas in TDM explained - part 4: Semantic search

- Key concepts and areas in TDM explained - part 5: Knowledge discovery