NEW Fine-Tuning OpenAI Models: A Guide 🚀

Label Studio overview

What is Label Studio?

Label Studio is an open source data labeling tool that supports multiple projects, users, and data types in one platform. It allows you to do the following:

  • Perform different types of labeling with many data formats.

  • Integrate Label Studio with machine learning models to supply predictions for labels (pre-labels), or perform continuous active learning. See Set up machine learning with your labeling process.

Label Studio is also available an Enterprise cloud service with enhanced security (SSO, RBAC, SOC2), team management features, data discovery, analytics and reporting, and support SLAs. A free trial is available to get started quickly and explore the enterprise cloud product.

Interface


Project List Screenshot


Data Manager Screenshot


Quick View Screenshot

Labeling workflow

Start and finish a labeling project with Label Studio by following these steps:

  1. Install Label Studio.
  2. Start Label Studio.
  3. Create accounts for Label Studio. Create an account to manage and set up labeling projects.
  4. Set up the labeling project. Define the type of labeling to perform on the dataset and configure project settings.
  5. Set up the labeling interface. Add the labels that you want annotators to apply and customize the labeling interface.
  6. Import data as labeling tasks.
  7. Label and annotate the data.
  8. Export the labeled data or the annotations.

Architecture

Enterprise

You can use any of the Label Studio components in your own tools, or customize them to suit your needs. Before customizing Label Studio extensively, you might want to review Label Studio Enterprise Edition to see if it already contains the relevant functionality you want to build. See Label Studio Features for more.

The component parts of Label Studio are available as modular extensible packages that you can integrate into your existing machine learning processes and tools.

Module Technology Description
Label Studio main app Python and Django The main app with most of the backend code for Label Studio; used to perform data labeling.
Label Studio frontend JavaScript web app using React and MST Located within the main app repo. web/apps/labelstudio acts as the central integration point for all frontend elements. web/libs/editor is the frontend library.
Data Manager JavaScript web app using React Manage data and tasks for labeling. Located under web/libs/datamanager in the main app repo.
Machine Learning Backends Python Predict data labels at various parts of the labeling process.