Open Source
Data Labeling
Platform
The most flexible data labeling platform to fine-tune LLMs, prepare training data or validate AI models.
Last Commit:
Latest version:
# Install the package
# into python virtual environmentpip install -U label-studio
# Launch it!
label-studio
# Install the cask
brew install humansignal/tap/label-studio
# Launch it!
label-studio
# clone repo
git clone https://github.com/HumanSignal/label-studio.git
# install dependencies
cd label-studio
pip install poetry
poetry install# apply db migrations
poetry run python label_studio/manage.py migrate
# collect static files
poetry run python label_studio/manage.py collectstatic
# launch
poetry run python label_studio/manage.py runserver
# Run latest Docker version
docker run -it -p 8080:8080 -v `pwd`/mydata:/label-studio/data heartexlabs/label-studio:latest
# Now visit http://localhost:8080/
Label every data type.
GenAI
LLM Fine-Tuning
Label data for supervised fine-tuning or refine models using RLHF
LLM Evaluations
Response moderation, grading, and side-by-side comparison
RAG Evaluation
Use Ragas scores and human feedback
Quick Startdata:image/s3,"s3://crabby-images/baac7/baac7c27a1b5bc5c5ba48082729efe07676e077e" alt=""
Computer Vision
Image Classification
Put images into categories
Object Detection
Detect objects on image, boxes, polygons, circular, and keypoints supported
Semantic Segmentation
Partition image into multiple segments. Use ML models to pre-label and optimize the process
Quick Startdata:image/s3,"s3://crabby-images/844c8/844c879c8f84574f803df9f437f2c3119492b93b" alt=""
Audio & Speech Applications
Classification
Put audio into categories
Speaker Diarization
Partition an input audio stream into homogeneous segments according to the speaker identity
Emotion Recognition
Tag and identify emotion from the audio
Audio Transcription
Write down verbal communication in text
Quick Startdata:image/s3,"s3://crabby-images/a7edf/a7edff4a6f9f5bb61363dd2443dcd1933c40294a" alt=""
NLP, Documents, Chatbots, Transcripts
Classification
Classify document into one or multiple categories. Use taxonomies of up to 10000 classes
Named Entity
Extract and put relevant bits of information into pre-defined categories
Question Answering
Answer questions based on context
Sentiment Analysis
Determine whether a document is positive, negative or neutral
Quick Startdata:image/s3,"s3://crabby-images/86f67/86f67dbfbdbdd9c848818b350f089f42b64b82f7" alt=""
Robots, Sensors, IoT Devices
Classification
Put time series into categories
Segmentation
Identify regions relevant to the activity type you're building your ML algorithm for
Event Recognition
Label single events on plots of time series data
Quick Startdata:image/s3,"s3://crabby-images/0388e/0388ed9cffc49f895fd2c3caf720d4f80d79c032" alt=""
Multi-Domain Applications
Dialogue Processing
Call center recording can be simultaneously transcribed and processed as text
Optical Character Recognition
Put an image and text right next to each other
Time Series with Reference
Use video or audio streams to easier segment time series data
Quick Startdata:image/s3,"s3://crabby-images/878ec/878ec2e2cd56437f4511a2acd45a33f80bb0e555" alt=""
Video
Classification
Put videos into categories
Object Tracking
Label and track multiple objects frame-by-frame
Assisted Labeling
Add keyframes and automatically interpolate bounding boxes between keyframes
Quick Startdata:image/s3,"s3://crabby-images/4353e/4353e45e670f010794bc74b82b8e461df1c8ba7f" alt=""
Flexible and configurable
Configurable layouts and templates adapt to your dataset and workflow.
Integrate with your ML/AI pipeline
Webhooks, Python SDK and API allow you to authenticate, create projects, import tasks, manage model predictions, and more.
ML-assisted labeling
Save time by using predictions to assist your labeling process with ML backend integration.
Connect your cloud storage
Connect to cloud object storage and label data there directly with S3 and GCP.
Explore & understand your data
Prepare and manage your dataset in our Data Manager using advanced filters.
Multiple projects and users
Support multiple projects, use cases and data types in one platform.
From the Blog
View All Articles-
Announcing Label Studio 1.16.0
We’re excited to announce the 1.16.0 release of Label Studio! In this release we have included a number of awesome new features, some critical security updates, and other improvements and bug fixes.
Label Studio Team
February 12, 2025
-
Reinforcement Learning from Verifiable Rewards
Learn about Reinforcement Learning with Verifiable Rewards, one of the leading training strategies for injecting learning signals into LLMs, successfully employed by models such as DeepSeek R1 and Tülu 3.
Nikolai Liubimov
February 7, 2025
-
Top 5 Most Successful Data Curation Strategies in DeepSeek
Learn the data curation and human supervision techniques that we believe are crucial to DeepSeek’s success by examining technical reports from DeepSeek-R1, DeepSeek-V3, and its predecessors.
Nikolai Liubimov
January 30, 2025
Trusted by companies large and small
Global Community
Join the largest community of Data Scientists working on enhancing their models.