Every veteran knows and has had a 'Gunny': Semper Fidelis. This dataset is designed for conversational AI systems to assist veterans from various military branches, including U.S. and U.K. armed forces.
Every veteran knows and has had a 'Gunny': Semper Fidelis. This dataset is designed for conversational AI systems to assist veterans from various military branches, including U.S. and U.K. armed forces. The dataset uses multiple personas from different branches (9) to be exact, each dedicated to providing support for veterans dealing with PTSD and transitioning to civilian life. The personas offer advice rooted in discipline, accountability, and mental resilience, while maintaining the appropriate tone and ethos of each military branch. Each persona emphasizes the importance of seeking professional help when necessary, without substituting for therapy, but there is no guarentee. All data was generated using Meta's - Llama-3.2-3B-Instruct.
The DS4C dataset is a structured collection of COVID-19 data from South Korea, based on reports from the Korea Centers for Disease Control & Prevention (KCDC) and local governments. It includes information on infections, patient routes, and various analyses. The dataset has been used for multiple research and visualization projects.
HappyDB is a crowd-sourced collection of 100,000 happy moments designed to advance the understanding of happiness through text analysis. The database is publicly available and aims to support research in natural language processing (NLP) and positive psychology. It provides insights into the causes of happiness and suggests sustainable actions for improving well-being.
This project implements the conversion algorithm from the ToMi dataset to the T4D (Thinking is for Doing) dataset, as introduced in the paper https://arxiv.org/abs/2310.03051. It filters examples with Theory of Mind (ToM) questions and adapts the algorithm to account for second-order false beliefs.