The CaiTI_dataset repository contains datasets for Motivational Interviewing and Cognitive Behavioral Therapy, curated by therapists to train CaiTI.
The CaiTI_dataset repository is a valuable resource for researchers and developers working in the field of mental health and therapeutic interventions. It provides a collection of datasets specifically curated for training CaiTI, a conversational AI system designed to assist in Motivational Interviewing and Cognitive Behavioral Therapy. These datasets are essential for developing and improving AI-driven therapeutic tools, ensuring they are effective and aligned with clinical practices.
Every veteran knows and has had a 'Gunny': Semper Fidelis. This dataset is designed for conversational AI systems to assist veterans from various military branches, including U.S. and U.K. armed forces.
Psychology LLM、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2 - SmartFlowAI/EmoLLM
FineWeb is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. It is optimized for LLM performance and processed using the datatrove library. The dataset aims to provide high-quality data for training large language models and outperforms other commonly used web datasets.We’re on a journey to advance and democratize artificial intelligence through open source and open science.