Hugging Face Dataset - NamelessAI Helply

Hugging Face Dataset - NamelessAI Helply

This paper discusses Helply - a synthesized ML training dataset focused on psychology and therapy, created by Alex Scott and published by NamelessAI. The dataset developed by Alex Scott is a comprehensive collection of synthesized data designed to train LLMs in understanding psychological and therapeutic contexts. This dataset aims to simulate real-world interactions between therapists and patients, enabling ML models to learn from a wide range of scenarios and therapeutic techniques.

Hugging Face Dataset - NamelessAI Helply

詳細介紹

The Helply dataset is a comprehensive synthetic ML training dataset created by Alex Scott and released by NamelessAI, focusing on the fields of psychology and therapy. The dataset is designed to train large language models (LLMs) to understand and simulate human psychological processes. By combining existing psychology literature, therapy session records, and patient self-report data, the Helply dataset covers a variety of treatment scenarios, such as cognitive behavioral therapy (CBT), internal family systems (IFS), and internet-based cognitive behavioral therapy (iCBT). In addition, the dataset emphasizes the dynamic interaction between patients and therapists, capturing communication details that affect treatment outcomes. Despite challenges such as ethical considerations and model generalization, the Helply dataset has revolutionary potential to change the understanding and application of therapeutic practices in digital environments.

更多
數據集

ISSP: International Social Science Survey Program
查看詳情

ISSP: International Social Science Survey Program

The ISSP is a cross-national collaboration program conducting annual surveys on diverse topics relevant to social sciences. Established in 1984, it includes members from various cultures around the globe. Over one million respondents have participated in ISSP surveys, and all collected data and documentation are available free of charge.

Mental Health Data at WHO
查看詳情

Mental Health Data at WHO

The World Health Organization (WHO) provides a comprehensive collection of global health data, including mental health statistics. This resource offers insights into various mental health conditions and their prevalence, helping researchers and policymakers understand and address mental health challenges worldwide.

HuggingFaceFW/fineweb-2
查看詳情

HuggingFaceFW/fineweb-2

FineWeb-2 is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. This is the second iteration of the popular 🍷 FineWeb dataset, bringing high quality pretraining data to over 1000 🗣️ languages.The 🥂 FineWeb2 dataset is fully reproducible, available under the permissive ODC-By 1.0 license and extensively validated through hundreds of ablation experiments.In particular, on the set of 9 diverse languages we used to guide our processing decisions, 🥂 FineWeb2 outperforms other popular pretraining datasets covering multiple languages (such as CC-100, mC4, CulturaX or HPLT, while being substantially larger) and, in some cases, even performs better than some datasets specifically curated for a single one of these languages, in our diverse set of carefully selected evaluation tasks: FineTasks.

關鍵詞

Hugging FaceML TrainingNamelessAIiCBTCBTIFSCognitive Behavioral TherapyInternal Family SystemsInternet-based Cognitive Behavioral Therapy

分享