Every veteran knows and has had a 'Gunny': Semper Fidelis. This dataset is designed for conversational AI systems to assist veterans from various military branches, including U.S. and U.K. armed forces.
Every veteran knows and has had a 'Gunny': Semper Fidelis. This dataset is designed for conversational AI systems to assist veterans from various military branches, including U.S. and U.K. armed forces. The dataset uses multiple personas from different branches (9) to be exact, each dedicated to providing support for veterans dealing with PTSD and transitioning to civilian life. The personas offer advice rooted in discipline, accountability, and mental resilience, while maintaining the appropriate tone and ethos of each military branch. Each persona emphasizes the importance of seeking professional help when necessary, without substituting for therapy, but there is no guarentee. All data was generated using Meta's - Llama-3.2-3B-Instruct.
FineWeb is a dataset of over 15 trillion tokens of cleaned and deduplicated English web data from CommonCrawl. It is optimized for LLM performance and processed using the datatrove library. The dataset aims to provide high-quality data for training large language models and outperforms other commonly used web datasets.We’re on a journey to advance and democratize artificial intelligence through open source and open science.
APA PsycInfo is the premier abstracting and indexing database covering the behavioral and social sciences. It provides over 5,000,000 peer-reviewed records, 144 million cited references, and spans 600 years of content. The database is updated twice-weekly and includes research in 30 languages from 50 countries.
PsychData is an online platform for hosting and conducting surveys and experiments in psychology, supporting secure data collection for researchers and students.