Text and data mining (TDM) uses automated tools in order to identify, extract, and present relevant data to one's research from large or numerous sources. By processing the available data in this way, researchers hope to show trends or patterns in the available data. TDM is used in both the humanities and sciences, and can apply to a wide variety of types of data sets.
As a general rule, check with the relevant subject librarian before beginning any project that involves TDM. The complete list of Emory subject librarians can be found here.
Databases often have their own rules and restrictions on what is and is not permissible when it comes to applying TDM methods to their data. In addition, access to these databases comes in a variety of forms, mediated by Emory Libraries.
Broadly, databases fall into four categories