Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
University LibGuides on Text Mining
LibGuides on Text Analysis by Duke University Libraries
Written by Duke University Libraries, this LibGuides offers a good introduction to text Analysis, its types, design and related tools. The LibGuides also provides resources and tutorials related to text analysis.
LibGuides on Text Mining by Georgetown University Library
Written by Georgetown University Library, this LibGuides offers an introduction to text Analysis.
LibGuides on Text Mining by Hong Kong Baptist University Library
Written by Baptist University Library, this LibGuides offers an introduction to text analysis.
LibGuides on Text Mining Tools and Methods by Illinois University Library
Written by members in Illinois University on introducing text mining tools and methodologies.
LibGuides on Text Analysis by The University of Hong Kong Library
Written by The University of Hong Kong Library, this LibGuides offers an introduction to text Analysis.
CUHK LibGuides on Introduction to Digital Humanities
Written by CUHK Library, this Guide will provide you some introductory information about Digital Humanities (GIS) such as software, examples, useful websites, etc.
Text Mining & Analysis Resources
Chinese Text Project
It is an online open-access digital library of pre-modern Chinese texts made available to readers and researchers all around the world. The site attempts to make use of the digital medium to explore new ways of interacting with these texts that are not possible in print.
Kaggle is an online community of data scientists and machine learning practitioners. Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
Academia Sinica Digital Humanities Research Platform
Developed by Academia Sinica (中央研究院) in Taiwan, this platform hosts a large, open and diverse collection of digital texts, authority files and linked open data. It provides functionalities for text upload, text analysis download, proximity search, text annotation, text comparison, word frequency statistics, N-gram statistics, co-occurring statistics, statistical charts, geographic information system, social network analysis, and other tools for analysis and visualization, allowing researchers to conduct both private individual or large collaborative humanities research projects.
DocuSky allows users to upload text contents to build their personal database. It supports fulltext retrieval, post-classification over a search result, as well as analysis on tagged terms.
GitHub is a provider of Internet hosting for software development and version control using Git. It offers the distributed version control and source code management (SCM) functionality of Git, plus its own features. It provides access control and several collaboration features such as bug tracking, feature requests, task management, continuous integration and wikis for every project.