alyxya 24 minutes ago

Once they have their own coding agent which they seem to be working towards, I may start predominantly using their models. They seem to be doing all the "right" things, open sourcing models, publishing research, and keeping prices low for everyone.

  • lambda 23 minutes ago

    Why do you need them to provide a coding agent? Just use their model with any off the shelf coding agent. I happen to prefer Pi, but use whatever works for you.

    • hootz 18 minutes ago

      Yeah, I'm using Pi with their models through an OpenCode Go subscription and it works pretty well. 10 bucks and V4-Flash is virtually infinite.

wg0 19 minutes ago

If you have not tried DeepdeekV4 you're missing out. The pricing makes it unbelievably good.

The chains of thought for Deepseek are very very interesting reads. Open code won't show them but do read them and you'll be surprised at how underrated the model is.

My model usage is very low but I still do pay directly to Deepseek regularly as my tribute and contribution to them open sourcing their models as my gratitude and showing support for what I deem positive for overall social good.

Sphax 53 minutes ago

That is some insane value. I've been using GLM Coding Plan Max with GLM 5.1 for a while and i've tested DeepSeek V4 Pro maybe for 3 weeks now and I found it to be better than GLM 5.1 for complex coding tasks. I've used 65m tokens and with that price it cost me $1.5, that's really cheap.

cold_harbor 40 minutes ago

their MLA architecture cuts KV cache by ~5-13x vs standard attention. that's why inference is actually cheaper to run, not just a price war to gain market share.

  • zozbot234 17 minutes ago

    That's also a game changer for local inference. It unlocks long contexts, batched inference and storing the KV cache to disk on ordinary consumer platforms.

Reubend 27 minutes ago

Props to them. That makes DeepSeek v4 Pro extremely cheap compared to others, even in the same category. Look at these prices per million outputs tokens:

DeepSeek V4 Pro: $0.87

Qwen 3.7 Max: $7.50

Grok 4.3: $2.50

GLM 1.5: $3.08

Opus 4.7: $25.00

GPT-5.5: $30.00

  • Arcuru 16 minutes ago

    It's actually even cheaper when you look at the cache read costs. Those costs can dominate in agent workflows and DeepSeek's cost for cache reads is insanely low comparatively. At $.003626/M tokens, the cheapest other thing on your list is >$.2/M tokens. That's on the scale of 100x cheaper.

belinder 43 minutes ago

Anyone using deepseek through a gateway (not sure if right term) so there's no data retention? At work we're going through a few hundred million tokens a day in our app (using anthropic models), and we're looking for something significantly cheaper

  • bel8 39 minutes ago

    opencode allegedly has contractual no-data-retention policies with their providers.

    I recall reading about that in an issue or in their Discord server.

    But I would contact them formally to verify that.

  • mlcruz 15 minutes ago

    I have been using deepseek via deepinfra, afaik they provide no data retention. Im probably going to deploy the full model on their infra instead of paying credits at some point, so far the experience has been pretty good

bel8 49 minutes ago

Great! I have been using DeepSeek 4 Flash high for everything lately.

First accessible model with useable 1 million context window for me.

Havoc 1 hour ago

Neat. I like DS for secondary checks on code. Sometimes spots things other models don't

kingjimmy 47 minutes ago

is this the Huawei chip difference?