Hugging Face
Hugging Face (🤗) Datasets is a library for accessing and sharing machine learning datasets. It features powerful data processing methods to quickly get your dataset ready for training deep learning models.
Export
You can export a Segments dataset release as a 🤗 Dataset using the release2dataset
function in the Python SDK:
The returned object is a 🤗 Dataset object. The columns of the exported dataset depend on the task type of the dataset, and closely follow our documented sample and label formats. The columns can be inspected as follows:
The 🤗 Dataset documentation explains how you can modify the structure and contents of the exported dataset in all kinds of ways:
Reorder the rows
Rename and remove columns
Apply a processing function to each row in the dataset
Use as a PyTorch or TensorFlow dataset
Publish to the Hugging Face Hub
You can also easily publish your dataset to the Hugging Face Hub:
Train models on your dataset
This tutorial shows how you can fine-tune a semantic segmentation model on a custom dataset exported from Segments. The process is similar for other task types.
Last updated