The World's Best Toxicity Dataset

Saving the internet is fun. Combing through thousands of online comments to build a toxicity dataset isn't. That's why we're creating the world's largest dataset of social media toxicity — so you can skip the slog and get to work.

Download Dataset

Dataset Preview

No items found.

Built by an Elite Workforce

Surge AI is a data labeling platform and workforce. Our data labeling team pored over tens of thousands of social media comments to build this toxicity dataset. Each comment was then evaluated by multiple members of our team to determine its severity level.

Download Dataset

What’s in the dataset?

This is some text inside of a div block.

What’s a Rich Text element?

The rich text element allows you to create and format headings, paragraphs, blockquotes, images, and video all in one place instead of having to add and format them individually. Just double-click and easily create content.

Static and dynamic content editing

A rich text element can be used with static or dynamic content. For static content, just drop it into any page and begin editing. For dynamic content, add a rich text field to any collection and then connect a rich text element to that field in the settings panel. Voila!

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.

What’s a Rich Text element?

Static and dynamic content editing

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.

What’s a Rich Text element?

Static and dynamic content editing

How to customize formatting for each rich text

Headings, paragraphs, blockquotes, figures, images, and figure captions can all be styled after a class is added to the rich text element using the "When inside of" nested selector system.

Download Dataset

Explore the diversity of insults, racism, sexism, personal attacks, hateful speech, and more in this toxicity dataset.

Thanks for your message — we’ll be in touch soon!

Oops! Something went wrong while submitting the form. Try again.

Other Datasets

View all

InstructGPT-style Dataset

Build state-of-the-art large language models, in the style of InstructGPT and ChatGPT.

Download it today

RLHF Dataset for Reinforcement Learning with Human Feedback

Build state-of-the-art AI by training your large language models on human feedback.

Download it today

Japanese Hate Speech, Insults, and Toxicity Dataset

A dataset of online comments in Japanese that contain hate speech, insults, and toxicity.

Download it today

Dataset of Search Queries and Intents

This dataset contains search queries, as well as the user's intent when performing the search query.

Download it today

Google Search Quality Dataset

This Google Search Quality dataset contains search queries, intents, result URLs, and a human-evaluated rating.

Download it today

Search Evaluation Dataset

This search evaluation dataset contains search queries, the intent behind each search query, result URLs, and a human-evaluated search quality rating.

Download it today

Love language?
So do we.

We're a team of engineers and researchers from Google, Facebook, Harvard, and MIT. We're building the modern data labeling infrastructure needed to power the next wave of AI.

Our data labeling platform and data labeling teams help AI companies around the world solve their core machine learning and language problems — from detecting hate speech and categorizing user reviews, to training powerful language models.

Our team comes from

The World's Best Toxicity Dataset

Download Dataset

Dataset Preview

Built by an Elite Workforce

What’s in the dataset?

What’s a Rich Text element?

Static and dynamic content editing

How to customize formatting for each rich text

What’s a Rich Text element?

Static and dynamic content editing

How to customize formatting for each rich text

What’s a Rich Text element?

Static and dynamic content editing

How to customize formatting for each rich text

Download Dataset

Other Datasets

We're Launching More!

Love language?So do we.

Meet the world's largest RLHF platform

Love language?
So do we.

Meet the world's largest
RLHF platform