Breynald Shelter

Posted 2025-03-11Environment Configuration2 minutes read (About 253 words)

Require:

Server1 (OpenWebUI / frpc)
Server2 (Public IP / Domain name / frps)

Posted 2025-03-10tech article6 minutes read (About 950 words)

Diffusion Language Model: The Rise of a New Paradigm for LLM? A Revolutionary Breakthrough That Overturns Autoregressive LLM

Literature:
- Simple and Effective Masked Diffusion Language Models
- Score-Based Generative Modeling through Stochastic Differential Equations
- Sequence-to-Sequence Denoising Diffusion
Image source:
- Diffusion Models: A Comprehensive Survey of Methods and Applications
- https://s-sahoo.com/mdlm/
- https://www.inceptionlabs.ai/news

Posted 2025-03-07tech article8 minutes read (About 1219 words)

GRPO: The Key Engine Driving DeepSeek's Exceptional Performance

1 2	Paper: DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Source: https://github.com/deepseek-ai/DeepSeek-R1 & https://arxiv.org/pdf/2402.03300

Categories

Archives

Recents

Tags