DeepSeek-R1

DeepSeek-R1 is a reasoning model trained via large-scale reinforcement learning (RL) without the need for supervised fine-tuning (SFT). It demonstrates remarkable performance in reasoning tasks, including self-verification and reflection. The model addresses challenges such as endless repetition and poor readability, and achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks.

Yksityiskohtainen esittely

DeepSeek-R1 is an advanced reasoning model that leverages large-scale reinforcement learning to achieve significant performance in reasoning tasks. It incorporates cold-start data before RL to enhance reasoning capabilities and address issues like repetition and readability. DeepSeek-R1 is designed to provide high accuracy in reasoning tasks and is suitable for a wide range of applications.

Visit Website

Lisää
AI

Psy-Insight: Mental Health Counseling Dataset

Psy-Insight is a bilingual, interpretable multi-turn dataset for mental health counseling dialogues. It includes 6,208 rounds of multi-turn counseling dialogues in English and 5,776 rounds in Chinese, annotated with step-by-step reasoning labels and multi-task labels. This dataset is designed to support the application of large language models in mental health and is suitable for tasks such as emotion classification and psychological treatment interpretation.

LIGHT WITHIN LIFE

When you have an idea, just write it down. Leave the tedious work to the software and gain insights into information that is valuable to you. XinGuang is an AI-driven, intelligent, proactive and feedback recording tool.

Mind Trainer - Clairity Healing - Strengthen Your Mind

For men who are struggling. Just click or enter whats bothering you and find meditative relief. Solutions for you.

Sivuston URL

https://github.com/deepseek-ai/DeepSeek-R1

Kategoriat

AI LLM Tutkimus

Avainsanat

DeepSeek-R1Reasoning ModelOpen Source Large Language ModelOpen Source LLMOpen SourceReinforcement LearningSupervised Fine-TuningLanguage ModelCode GenerationMathematical ReasoningMachine LearningNatural Language Processing