Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update 01-introduction.md #16

Open
wants to merge 1 commit into
base: gh-pages
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 4 additions & 1 deletion _episodes/01-introduction.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,10 @@ keypoints:
Welcome to this hands-on lesson to learn some text and data mining skills. We will first run through some of the basics that you will need when exploring and analysing text.

## What is Text Mining?
FIXME

Text mining, or text analysis, is a method using natural language processing (NLP) and computer science to transform unstructured data or text from documents, websites or databases into a more manageable structured corpus suitable for using machine learning (ML) algorithms. The purpose is to gain a deeper understanding and discover patterns in the content being analysed.
According to [Wikipedia](https://en.wikipedia.org/wiki/Text_mining) text mining involves discovery by computer of new, previously unknown information, by automatically extracting information from different written resources.
Text mining tasks include a broad range of methods, from text categorization, text clustering, concept extraction and even production of granular taxonomies or document summarization.

## Terminology

Expand Down