An evolving list of electronic media datasets used to model mental health status. This repository curates a variety of datasets from different sources, including social media platforms, online forums, and academic studies, to support research in mental health modeling and AI applications.
The Mental Health Datasets repository is a curated list of datasets that can be used to model and analyze mental health status. It includes datasets from various sources such as Reddit, Twitter, and online support forums, covering a wide range of mental health conditions like depression, anxiety, and suicidal ideation. This resource is invaluable for researchers and developers working on AI models for mental health support and intervention.For an overview of existing datasets, please consider reading the paper 'On the State of Social Media Data for Mental Health Research'.
MentalManip数据集是由Wang等人(2024b)引入的,专门用于检测和分类心理操纵的对话数据集。该数据集包含4000个多轮虚构对话,来源于在线电影剧本,并进行了多层次的标注,包括操纵的存在、操纵技巧和目标脆弱性。数据集的创建旨在通过高质量的标注确保数据的一致性和准确性,从而支持心理操纵检测的研究。
The Substance Abuse and Mental Health Data Archive (SAMHDA) provides a comprehensive collection of data sets related to mental health and substance use. It includes ongoing studies, population surveys, treatment facility surveys, and client-level data, offering valuable insights for researchers and policymakers.
The data is originally source from (Sun et al,2021). (Liu et al, 2023) processed the data to make it a dataset vis huggingface api with taining/validation/testing splitting