The Emotional First Aid Raw Dataset is a collection of raw, unannotated psychological counseling Q&A data, designed to support research in AI applications for mental health. It contains over 172,000 topics with 2,381,273 messages, totaling 44,514,786 characters, providing a rich source of data for natural language processing and AI development.
This dataset is a valuable resource for researchers and developers working on AI-powered psychological counseling tools. It includes a wide range of topics and detailed messages, making it suitable for tasks such as data preprocessing, model training, and dialogue generation. The data is sourced from public websites and has been anonymized and desensitized for privacy protection.
HeartLink is an empathetic psychological model that uses a large language model fine-tuned on a large empathetic Q&A dataset. It can perceive users' emotions and experiences during conversations and provide empathetic responses using rich psychological knowledge, aiming to understand, comfort, and support users. The responses include emoji expressions to bridge the gap with users, offering psychological support and help during consultations.
tartuNLP/reddit-anhedonia by huggingface-mirror (hf-mirror)
This paper discusses Helply - a synthesized ML training dataset focused on psychology and therapy, created by Alex Scott and published by NamelessAI. The dataset developed by Alex Scott is a comprehensive collection of synthesized data designed to train LLMs in understanding psychological and therapeutic contexts. This dataset aims to simulate real-world interactions between therapists and patients, enabling ML models to learn from a wide range of scenarios and therapeutic techniques.