The SimpleToM dataset is designed to evaluate models' ability to reason about beliefs and actions in various scenarios. It includes a variety of situations with multiple choice questions and answers, covering topics such as food items, personal belongings, and service industries.
The SimpleToM dataset provides a comprehensive set of scenarios to test models' understanding of beliefs and actions. Each scenario includes a context, a question, and multiple choice answers, making it suitable for researchers working on theory of mind and natural language processing. The dataset is available on Hugging Face, ensuring easy access and integration with existing models.
Tobii Pro Lab is a comprehensive eye tracking software designed for behavioral research, offering a complete solution for researchers to conduct experiments from test design to data analysis.
FineWeb is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. It is optimized for LLM performance and processed using the datatrove library. The dataset aims to provide high-quality data for training large language models and outperforms other commonly used web datasets.We’re on a journey to advance and democratize artificial intelligence through open source and open science.
This dataset contains survey responses from individuals in the tech industry about their mental health, including questions about treatment, workplace resources, and attitudes towards discussing mental health in the workplace. By analyzing this dataset, we can better understand how prevalent mental health issues are among those who work in the tech sector—and what kinds of resources they rely upon to find help—so that more can be done to create a healthier working environment for all.