APA PsycInfo is the premier abstracting and indexing database covering the behavioral and social sciences. It provides over 5,000,000 peer-reviewed records, 144 million cited references, and spans 600 years of content. The database is updated twice-weekly and includes research in 30 languages from 50 countries.
For over 55 years, APA PsycInfo has been the most trusted index of psychological science in the world. It offers a comprehensive and precise indexing of abstracts, supporting students, scientists, and educators. The newest features leverage artificial intelligence and machine learning to provide a personalized research assistant, enhancing the discovery and usage of essential psychological research.
The CaiTI_dataset repository contains datasets for Motivational Interviewing and Cognitive Behavioral Therapy, curated by therapists to train CaiTI.
FineWeb-2 is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. This is the second iteration of the popular 🍷 FineWeb dataset, bringing high quality pretraining data to over 1000 🗣️ languages.The 🥂 FineWeb2 dataset is fully reproducible, available under the permissive ODC-By 1.0 license and extensively validated through hundreds of ablation experiments.In particular, on the set of 9 diverse languages we used to guide our processing decisions, 🥂 FineWeb2 outperforms other popular pretraining datasets covering multiple languages (such as CC-100, mC4, CulturaX or HPLT, while being substantially larger) and, in some cases, even performs better than some datasets specifically curated for a single one of these languages, in our diverse set of carefully selected evaluation tasks: FineTasks.
The Chinese Psychological QA DataSet is a collection of 102,845 community Q&A pairs related to psychological topics., providing a rich source of data for research and development in psychological counseling and AI applications. Each entry includes detailed question and answer information, making it a valuable resource for understanding user queries and generating appropriate responses.