This resource was designed with absolute beginners in mind. For those who are entirely new to text analysis, HTRC provides introductory tools and access to one of the world's largest online libraries. While you can refer to HTRC's official documentation for more information, this abbreviated guide will walk you through the following processes:
HathiTrust Research Center (HTRC) is an organization that makes the millions of scanned books in the HathiTrust Digital Library available for researchers to analyze with computer programs and machine learning algorithms to reveal insights about literature at a hitherto unprecedented scale. Using tools provided by HTRC, you can:
HTRC also provides tools that offer researchers with coding skills access to pre-calculated data from every page in every volume in HathiTrust via the Extracted Features Dataset, and the chance to run their own code on the full text of HathiTrust works via Data Capsules.
HTRC is made possible by its policy of non-consumptive access, which makes it legal for HTRC to allow researchers to export facts about HathiTrust books (like counts of words, beginning and ending letters of lines, identified parts of speech etc.) but not the full texts of in-copyright works.
HTRC was founded in 2011 and is hosted by Indiana University and the University of Illinois. Ohio University Libraries has been a member of HathiTrust since 2020. You can login to HTRC with your Ohio credentials.