Appropriate Use of Purchased or Licensed Resources
Most of the library's electronic resources are governed by license agreements that limit use to the Emory community or to individuals who are physically present at Emory University Library facilities.
Adapted from Yale's Resources for Text Mining Guide
See table below for info about databases that allow TDM. If you don't see the resource you want to use listed here, please contact your subject librarian.
Vendor | Fee? | Details | Help/Guidelines | Examples |
---|---|---|---|---|
Adam Matthew | FREE |
All databases from Adam Matthew (which digitize unique primary source collections) are available for mining. Contact Emory Libraries to initiate the process. |
Data Mining/Text Mining Statement from Adam Matthew
Data Mining with Adam Matthew Primary Source Collections from UCLA |
|
Gale Primary Source Collections |
Gale Digital Scholar Lab DSL allows you to analyze the Gale digital collections that we have with the help of a suite of DH tools available via this cloud-based platform. Researchers can:
Gale Artemis: Primary Sources, searches across 22 of our Gale primary source databases covering 1500-2012, has a Term Frequency search option and Term Clusters viewer (available from the articles results list).
To download large datasets Emory Libraries will have to request data on your behalf from our Gale sales representative. It can take several weeks to process requests. Gale will send a hard drive with the data requested to the libraries for you to use. |
Digital Scholar Lab Webinar | ||
JSTOR | FREE | Data for Research (DfR) - provides a self-service system for text mining. By creating a free DfR account you can download the metadata, word frequencies, citations, key terms, and N-grams of up to 1,000 documents. To get larger datasets (>1,000) or a type of data not available through the main site, you have to contact JSTOR directly: support@ithaka.org. | Introduction to using DfR from DH @ Washington Lee University | |
Nexis Uni | FREE (in small amounts) | Does not officially support TDM, however, patrons wishing to create a data corpora can download up to 500 articles in RTF form at a time. | ||
Oxford English Dictionary (OED) | FREE | Opening Up the Oxford English Dictionary | ||
ProQuest | Mostly FREE for newspapers to which we have purchased |
Does not include Alexander Street Press products of yet
Emory also has PAL's to the following databases: Historical Chinese Newspapers, Historical Jewish Newspapers, Vogue, Women's Magazine Archive, News, Policy and Politics (Includes Newsweek), American Periodical Series You may NOT at this time download the datasets themselves (although ProQuest is looking to expand to owned content) |
TDM Libguide with quick onboarding guide, webinars, etc. |
Robots Reading Vogue |
Readex | Price available upon request |
Collections for which we have perpetual access licenses may be available upon request. These include:
African American Newspapers, African American Periodicals, America's Historical Newspapers
For more information, contact Emory Libraries.
|
||
Worldcat | Free | Ask for access to the various APIs for Worldcat content due to Emory Libraries' OCLC Cataloging and Metadata subscription (full cataloging) and a FirstSearch/WorldCat Discovery subscription |
Adapted from the University of Southern California's Content Mining Guide.