speedlabeling

Labeled data is important for machine learning, but hand-labeling a dataset can be time-consuming and prohibitively expensive. Being able to generate labeled data quickly and accurately is crucial to an effective, timely model. Application specifications might also change over time, requiring the training dataset to be changed. In such cases, we need to devise tools that can act as quickly as possible by generating the required training dataset. Our solution to this problem is speed labeling, a project designed to create labeled training data faster and smarter.

This project involves a suite of tools to help users efficiently annotate the data in question. The first part, the label preparation interface, enables users to input different kinds of data, processes the data, and suggests potential label classes. The user can then edit the suggested classes and set the final list.

The second part is a recommendation system. Here, suitable labeling algorithms are selected by the system for the specific type of input data. For example, images and video may be classified differently than textual data. Then the system is ready for the process of tagging.

The third part of the system, the generative model, deals with different inputs collected from crowdsourcing, existing knowledge bases, and weak supervision. The labels returned from these three sources are usually not accurate and may conflict with one another. The generative model learns the accuracies of the sources and makes use of the majority vote, weights assigned to the sources, and other factors to choose the right label.

Acquiring the large datasets needed by supervised learning is arguably the most challenging part of building artificial intelligence. By designing a system that utilizes the power of crowdsourcing and machine learning, we will be able to classify datasets cheaper, faster, and more accurately, leading to better learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
css		css
images		images
js		js
README.md		README.md
index.html		index.html
lasso.html		lasso.html
login.html		login.html
multi_class.html		multi_class.html
text.html		text.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speedlabeling

A prototype trinary labeling interface

A prototype multi-class labeling interface

About

Releases

Packages

Languages

oudalab/speedlabeling

Folders and files

Latest commit

History

Repository files navigation

speedlabeling

A prototype trinary labeling interface

A prototype multi-class labeling interface

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages