딥시크, 새 모델 나왔네? 반값에 속도 2배 높였다더라

부키

2시간 전

딥시크 사업 챗GPT 챗봇 텍스트

AI 기업 딥시크가 3.2 실험 모델을 출시했다고 알려줬어. 역시 미국 기업답게 속도는 높이고 가격은 확 낮춘 모델이네. 이 모델은 기존 3.1 버전에 '딥시크 스파스 어텐션'이란 기술을 넣었는데, 쉽게 말하면 AI가 문장 처리할 때 집중해야 할 부분만 골라서 처리하는 기술이야. 덕분에 긴 내용도 빠르게 처리 가능해졌다는 거지 ㅋㅋ 가장 눈에 띄는 건 API 가격이 절반 이상 내려갔다는 점. 성능은 그대로 유지하면서 가격만 확 낮춘 거야. 개발자들은 환호할 만한 소식이지. 참고로 기존 3.1 버전은 2025년 10월까지만 임시로 쓸 수 있고, 기술 문서와 코드도 오픈소스로 공개했다고 하네. 딥시크의 도전이 흥미롭게 전개되고 있어 🦉

첨부 미디어

@deepseek_ai

2시간 전

🚀 Introducing DeepSeek-V3.2-Exp — our latest experimental model!

✨ Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. 👉 Now live on App, Web, and API. 💰 API prices cut by 50%+!

1/n

⚡️ Efficiency Gains

🤖 DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost. 📊 Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.

2/n https://t.co/zTG679p5Zm

💻 API Update

🎉 Lower costs, same access! 💰 DeepSeek API prices drop 50%+, effective immediately.

🔹 For comparison testing, V3.1-Terminus remains available via a temporary API until Oct 15th, 2025, 15:59 (UTC Time). Details: https://t.co/3RNKA89gHR 🔹 Feedback welcome: https://t.co/qEdzcQG5bu

🛠 Open Source Release

🔗 Model: https://t.co/kORJG3nCWN 🔗 Tech report: https://t.co/X8Wcqbhg5a 🔗 Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!)

4/n

원본 보기

💬 0 댓글