National Forum Data Mining Research Using In Copyright and Limited Access Text Datasets Shaping a Research and Implementation Agenda for Researchers Libraries and Content Providers

Time Frame


Total Funding to Date



  • Bertram Ludäscher

Copyright law and resource licensing complicate the application of text data mining for research. This project convened a National Forum on Text Data Mining with Use-Limited Data in April 2018 that brought together 25 leading stakeholders selected among researchers, librarians, content providers, legal experts, and representatives of scholarly societies to articulate an agenda that provides guidelines for libraries to facilitate research access, implement best practices, and mitigate issues associated with methods, practice, policy, security, and replicability in research that incorporates text datasets that are subject to IP-related restrictions.  The project deliverables, which include an environmental scan, structured community-driven analysis, and ACRL-published white paper, will make recommendations for best practices and policy to guide libraries as they develop text data mining services. With the support of IMLS under the rubric of the National Digital Platform theme, this forum can serve to catalyze, organize, coordinate, and synthesize the conversation into a cohesive agenda that will serve as a foundation for research and practice in libraries, and in the scholarly community.

text mining icon


  • Beth Sandore-Namachchivaya (University of Waterloo)
  • Megan Senseney (University of Arizona)
  • Eleanor Dickson
  • MT Campbell

Funding Agencies

  • Institute of Museum and Library Services, 2017 – $99,536.00