Edwin Chen
It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout. The point of using Lorem Ipsum is that it has a more-or-less normal distribution of letters, as opposed to using 'Content here, content here', making it look like readable English. Many desktop publishing packages and web page editors now use Lorem Ipsum as their default model text, and a search for 'lorem ipsum' will uncover many web sites still in their infancy. Various versions have evolved over the years, sometimes by accident, sometimes on purpose (injected humour and the like).

How Anthropic uses Surge AI to Train and Evaluate Claude
Edwin ChenLearn how Anthropic partnered with Surge AI to gather high-quality human feedback at scale using the RLHF platform, resulting in one of the safest and most advanced large language models on the planet.

HellaSwag or HellaBad? 36% of this popular LLM benchmark contains errors
Edwin ChenWe analyzed HellaSwag, a popular LLM benchmark, and found errors in 36% of its rows.

30% of Google's Emotions Dataset is Mislabeled
Edwin ChenLast year, Google released their “GoEmotions” dataset: a human-labeled dataset of 58K Reddit comments categorized according to 27 emotions. The problem? A whopping 30% of the dataset is mislabeled! Check out some of the egregious errors, and learn how to build better datasets.30% of Google's Emotions Dataset is Mislabeled

How Surge AI Built OpenAI's GSM8K Dataset of 8,500 Math Problems
Edwin ChenWe built a dataset of 8,500 Grade School Math Problems for OpenAI. The goal of the dataset: to train language models like GPT-3 to solve natural language math problems and measure their reasoning ability. Learn about our process in this blog post!
Shaping AGI with
Human Intelligence
Human Intelligence
Contact
© 2025 Surge AI