The SimpleToM dataset is designed to evaluate models' ability to reason about beliefs and actions in various scenarios. It includes a variety of situations with multiple choice questions and answers, covering topics such as food items, personal belongings, and service industries.
The SimpleToM dataset provides a comprehensive set of scenarios to test models' understanding of beliefs and actions. Each scenario includes a context, a question, and multiple choice answers, making it suitable for researchers working on theory of mind and natural language processing. The dataset is available on Hugging Face, ensuring easy access and integration with existing models.
This study surveys the attitudes and behaviors of US higher education faculty members regarding online resources, the library, and related topics. It covers a wide range of issues, including faculty dependence on electronic scholarly resources, the transition from print to electronic journals, publishing preferences, e-books, and the preservation of scholarly journals.
This repository provides code and data for automatic depression detection using a GRU/BiLSTM-based model. It includes an emotional audio-textual corpus designed to support the diagnosis of psychological distress conditions such as anxiety, depression, and post-traumatic stress disorder.
FineWeb is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. It is optimized for LLM performance and processed using the datatrove library. The dataset aims to provide high-quality data for training large language models and outperforms other commonly used web datasets.We’re on a journey to advance and democratize artificial intelligence through open source and open science.