Psy-Insight: Mental Health Counseling Dataset

Psy-Insight: Mental Health Counseling Dataset

Psy-Insight is a bilingual, interpretable multi-turn dataset for mental health counseling dialogues. It includes 6,208 rounds of multi-turn counseling dialogues in English and 5,776 rounds in Chinese, annotated with step-by-step reasoning labels and multi-task labels. This dataset is designed to support the application of large language models in mental health and is suitable for tasks such as emotion classification and psychological treatment interpretation.

Psy-Insight: Mental Health Counseling Dataset

Introdução Detalhada

Psy-Insight is a comprehensive dataset designed to support the development of AI applications in mental health counseling. It includes detailed multi-turn dialogues, emotional labels, psychological treatment methods, and step-by-step reasoning annotations. This dataset is ideal for researchers and developers looking to fine-tune large language models for mental health applications.

Mais
Conjunto de Dados

ToM QA Dataset: Evaluating Theory of Mind in Question Answering
Ver detalhes

ToM QA Dataset: Evaluating Theory of Mind in Question Answering

The ToM QA Dataset is designed to evaluate question-answering models' ability to reason about beliefs. It includes 3 task types and 4 question types, creating 12 total scenarios. The dataset is inspired by theory-of-mind experiments in developmental psychology and is used to test models' understanding of beliefs and inconsistent states of the world.

HuggingFaceFW/fineweb-2
Ver detalhes

HuggingFaceFW/fineweb-2

FineWeb-2 is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. This is the second iteration of the popular 🍷 FineWeb dataset, bringing high quality pretraining data to over 1000 🗣️ languages.The 🥂 FineWeb2 dataset is fully reproducible, available under the permissive ODC-By 1.0 license and extensively validated through hundreds of ablation experiments.In particular, on the set of 9 diverse languages we used to guide our processing decisions, 🥂 FineWeb2 outperforms other popular pretraining datasets covering multiple languages (such as CC-100, mC4, CulturaX or HPLT, while being substantially larger) and, in some cases, even performs better than some datasets specifically curated for a single one of these languages, in our diverse set of carefully selected evaluation tasks: FineTasks.

Question-Level Feature Extraction on DAIC-WOZ Dataset
Ver detalhes

Question-Level Feature Extraction on DAIC-WOZ Dataset

The DAIC-WOZ dataset contains clinical interviews designed to support the diagnosis of psychological distress conditions such as anxiety, depression, and post-traumatic stress disorder. This repository provides code for extracting question-level features from the DAIC-WOZ dataset, which can be used for multimodal analysis of depression levels.

Keywords

Psy-InsightMental Health CounselingBilingual DatasetMulti-turn DialoguesEmotion ClassificationPsychological Treatment

Compartilhar