The Emotional First Aid Raw Dataset is a collection of raw, unannotated psychological counseling Q&A data, designed to support research in AI applications for mental health. It contains over 172,000 topics with 2,381,273 messages, totaling 44,514,786 characters, providing a rich source of data for natural language processing and AI development.
This dataset is a valuable resource for researchers and developers working on AI-powered psychological counseling tools. It includes a wide range of topics and detailed messages, making it suitable for tasks such as data preprocessing, model training, and dialogue generation. The data is sourced from public websites and has been anonymized and desensitized for privacy protection.
HeartLink is an empathetic psychological model that uses a large language model fine-tuned on a large empathetic Q&A dataset. It can perceive users' emotions and experiences during conversations and provide empathetic responses using rich psychological knowledge, aiming to understand, comfort, and support users. The responses include emoji expressions to bridge the gap with users, offering psychological support and help during consultations.
The data is originally source from (Sun et al,2021). (Liu et al, 2023) processed the data to make it a dataset vis huggingface api with taining/validation/testing splitting
HappyDB is a crowd-sourced collection of 100,000 happy moments designed to advance the understanding of happiness through text analysis. The database is publicly available and aims to support research in natural language processing (NLP) and positive psychology. It provides insights into the causes of happiness and suggests sustainable actions for improving well-being.