DeepSeek-R1: Reinforcement Learning for LLM Reasoning • Libertify