Hugging Face Dataset - NamelessAI Helply

This paper discusses Helply - a synthesized ML training dataset focused on psychology and therapy, created by Alex Scott and published by NamelessAI. The dataset developed by Alex Scott is a comprehensive collection of synthesized data designed to train LLMs in understanding psychological and therapeutic contexts. This dataset aims to simulate real-world interactions between therapists and patients, enabling ML models to learn from a wide range of scenarios and therapeutic techniques.

詳細介紹

The Helply dataset is a comprehensive synthetic ML training dataset created by Alex Scott and released by NamelessAI, focusing on the fields of psychology and therapy. The dataset is designed to train large language models (LLMs) to understand and simulate human psychological processes. By combining existing psychology literature, therapy session records, and patient self-report data, the Helply dataset covers a variety of treatment scenarios, such as cognitive behavioral therapy (CBT), internal family systems (IFS), and internet-based cognitive behavioral therapy (iCBT). In addition, the dataset emphasizes the dynamic interaction between patients and therapists, capturing communication details that affect treatment outcomes. Despite challenges such as ethical considerations and model generalization, the Helply dataset has revolutionary potential to change the understanding and application of therapeutic practices in digital environments.

Visit Website

更多
數據集

ISSP: International Social Science Survey Program

The ISSP is a cross-national collaboration program conducting annual surveys on diverse topics relevant to social sciences. Established in 1984, it includes members from various cultures around the globe. Over one million respondents have participated in ISSP surveys, and all collected data and documentation are available free of charge.

Mental Health Data at WHO

The World Health Organization (WHO) provides a comprehensive collection of global health data, including mental health statistics. This resource offers insights into various mental health conditions and their prevalence, helping researchers and policymakers understand and address mental health challenges worldwide.

HuggingFaceFW/fineweb-2

FineWeb-2 is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. This is the second iteration of the popular 🍷 FineWeb dataset, bringing high quality pretraining data to over 1000 🗣️ languages.The 🥂 FineWeb2 dataset is fully reproducible, available under the permissive ODC-By 1.0 license and extensively validated through hundreds of ablation experiments.In particular, on the set of 9 diverse languages we used to guide our processing decisions, 🥂 FineWeb2 outperforms other popular pretraining datasets covering multiple languages (such as CC-100, mC4, CulturaX or HPLT, while being substantially larger) and, in some cases, even performs better than some datasets specifically curated for a single one of these languages, in our diverse set of carefully selected evaluation tasks: FineTasks.

網站 URL

https://huggingface.co/datasets/namelessai/helply

關鍵詞

Hugging FaceML TrainingNamelessAIiCBTCBTIFSCognitive Behavioral TherapyInternal Family SystemsInternet-based Cognitive Behavioral Therapy

Hugging Face Dataset - NamelessAI Helply

詳細介紹

更多
數據集

ISSP: International Social Science Survey Program

Mental Health Data at WHO

HuggingFaceFW/fineweb-2

網站 URL

更多分類

關鍵詞

分享

Hugging Face Dataset - NamelessAI Helply

詳細介紹

更多數據集

ISSP: International Social Science Survey Program

Mental Health Data at WHO

HuggingFaceFW/fineweb-2

網站 URL

更多分類

關鍵詞

分享

更多
數據集