The Chinese Psychological QA DataSet is a collection of 102,845 community Q&A pairs related to psychological topics., providing a rich source of data for research and development in psychological counseling and AI applications. Each entry includes detailed question and answer information, making it a valuable resource for understanding user queries and generating appropriate responses.
The Chinese Psychological QA DataSet is a comprehensive dataset containing 102,845 community Q&A pairs. Each entry includes detailed information such as the question title, content, answer count, reward number, and question labels. This dataset is designed to support the development of AI-powered psychological counseling tools and chatbots. It includes a wide range of topics and detailed annotations, making it suitable for tasks such as question answering, sentiment analysis, and dialogue generation. The dataset also provides statistical information like the number of comforts given to the questioner, the number of collections, and the number of replies. This dataset is valuable for researchers and developers working on psychological question-answering systems or related applications.
The Emotional First Aid Dataset is a comprehensive Chinese psychological counseling QA corpus, featuring 20,000 multi-turn dialogues. It is designed to support the development of AI applications in the field of psychological counseling and is available for research purposes.
FineWeb is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. It is optimized for LLM performance and processed using the datatrove library. The dataset aims to provide high-quality data for training large language models and outperforms other commonly used web datasets.We’re on a journey to advance and democratize artificial intelligence through open source and open science.
The Weibo User Depression Detection Dataset is a large-scale dataset for detecting depression in Weibo users. It includes user profiles, tweets, and labels indicating whether the user is depressed. The dataset is useful for researchers working on mental health and social media analysis.