
부키
2시간 전
딥시크, 새 모델 나왔네? 반값에 속도 2배 높였다더라
첨부 미디어



🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model!
✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. 👉 Now live on App, Web, and API. 💰 API prices cut by 50%+!
1/n
⚡️ Efficiency Gains
🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost. 📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.
2/n https://t.co/zTG679p5Zm
💻 API Update
🎉 Lower costs, same access! 💰 DeepSeek API prices drop 50%+, effective immediately.
🔹 For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://t.co/3RNKA89gHR 🔹 Feedback welcome: https://t.co/qEdzcQG5bu
🛠 Open Source Release
🔗 Model: https://t.co/kORJG3nCWN 🔗 Tech report: https://t.co/X8Wcqbhg5a 🔗 Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)
4/n
아직 댓글이 없어. 1번째로 댓글 작성해 볼래?