This project implements the conversion algorithm from the ToMi dataset to the T4D (Thinking is for Doing) dataset, as introduced in the paper https://arxiv.org/abs/2310.03051. It filters examples with Theory of Mind (ToM) questions and adapts the algorithm to account for second-order false beliefs.
The t4d project is a direct implementation of the conversion algorithm from the ToMi dataset to the T4D dataset. It is designed to filter and process examples that involve Theory of Mind questions, providing a valuable resource for researchers working on cognitive and social AI models. The project is built to convert a predefined dataset A (ToMi) to dataset B (T4D) and is licensed under the Apache License, Version 2.0.
The data is originally source from (Sun et al,2021). (Liu et al, 2023) processed the data to make it a dataset vis huggingface api with taining/validation/testing splitting
The Mental Health Corpus contains labeled comments on mental health issues, used for sentiment and toxic language analysis.
This paper discusses Helply - a synthesized ML training dataset focused on psychology and therapy, created by Alex Scott and published by NamelessAI. The dataset developed by Alex Scott is a comprehensive collection of synthesized data designed to train LLMs in understanding psychological and therapeutic contexts. This dataset aims to simulate real-world interactions between therapists and patients, enabling ML models to learn from a wide range of scenarios and therapeutic techniques.