cdurth 1 day ago

I tried the Fugu models with some real world tales in C# and unity using mcp and open code. I exhausted the $20 plan 5 hour window in one prompt to review my theme system and plan some color changes. So I upgraded to the $100 to see the implementation and result. Well the result was worse than Opus, incredibly slow, and I ended up exhausting the new 5 hour window and have used 35% of the weekly now and it hardly created something opus was able to do at a fraction of the time and cost.

Do what you wish with this info, but it seems to be a complete waste of $$.

  • OsrsNeedsf2P 19 hours ago

    We provide a similar service for Godot instead of Unity, and 20$ plan being exhausted in one prompt on a top model like Opus sounds about right. That's the life when you pay API prices and can't afford 10x subsidies.

    • rtpg 11 hours ago

      Is this true even for more targeted prompts that aren't about looking over an entire codebase or w/e? I just stick to my sub pricing and find good success on targeted requests, but I wonder if I would run up against things even then if not for subsidies.

      • delusional 8 hours ago

        One prompt might be a bit much, but experience shows that $20 is roughly 3 or 4 context windows on GTP-5.5

    • cdurth 1 hour ago

      Not sure if you meant Fable/Mythos instead of Opus, but I can comfortable work several hours a day using Opus on Max-Ultracode on the CC 10x.

  • zzleeper 19 hours ago

    I tested Fable through Cursor; asked for ideas on how to make a data website I have less "Claude-like" (IYKYK what are the usual tells), and it spun out the most useless, Claude-like CSS styling ever, wasting $40 in 10 minutes.

    The website was created through Opus, so you could also say the results were worse than Opus. (This is just to say that I had the same experience using the US models, so perhaps those Asian models are Mythos-like lol)

    • nonethewiser 16 hours ago

      If you actually give it an example of the style it can copy it well. even just screenshots of other websites or UIs. It just sucks at producing it itself.

    • addandsubtract 16 hours ago

      Why would the model know what "Claude-like" is?

      • valleyer 16 hours ago

        Well, in part because the phenomenon has been discussed on Web forums that (a) have at this point made their way back into training data and (b) are accessible in Web searches that the model can invoke. And in part because the model can "know" what its initial instinct is and "decide" to go against it.

        • jazzyjackson 15 hours ago

          Still it’s like asking an image generator to produce an image without an elephant. Just tell it what you want.

          • ninjalanternshk 14 hours ago

            It’s like saying “make this bad animal drawing look better, and like someone else made it” without telling them what animal was supposed to be.

        • ninjalanternshk 14 hours ago

          Yeah this is about the worst way you could imagine to evaluate an AI model.

          If you’d given it a real task you’d have been impressed.

          I was floored by the day I spent with Fable. Got weeks of work done.

          • cevn 14 hours ago

            Same. It was one shotting unbelievably well compared to 4.8.

            • jaapz 1 hour ago

              And 4.8 on xhigh was/is already pretty impressive

          • valleyer 11 hours ago

            Oh, I was also quite happy with Fable. I was just answering the question asked.

    • cheema33 13 hours ago

      > I tested Fable through Cursor;

      I tested Fable for a whole day. And my experience was quite the opposite. I was blown away. Admittedly, I did not try it through a middleman like Cursor. I used Claude Code CLI.

      • josephg 13 hours ago

        Me too. It’s output was fabulous. And it acted like a senior engineer - actually coding up hypotheses, testing them, finding problems and presenting good, usable recommendations backed by solid evidence and wisdom. It can probably do most of my job, which gave me a bit of an existential crisis.

        I’ve paused my Claude subscription until they bring it back. Opus makes mistakes constantly, on every level of abstraction. Every time I look closely at its work I find problems.

        • zeristor 11 hours ago

          Opus 4.8 works like that for me. I have it writing ADRs, then my main architect worker challenging it.

        • gwerbin 6 hours ago

          Even Sonnet does that if you gently prompt it to.

      • solenoid0937 7 hours ago

        I also felt that Fable was mindblowing.

  • Bombthecat 19 hours ago

    Same experience with web search / research, it was bad compared to opus.

    Missed half the stuff the other half was outdated/ didn't verify.

  • hmokiguess 18 hours ago

    I experienced the same exact thing, however, I will say that I had misconfigured it on `pi` at first.

    I was using its chat endpoint not the responses with tool calling and everything, and I haven't tried it again since, learned that recently and I am planning on giving it another shot.

  • zobzu 17 hours ago

    thata what i usually expect. its all as good as the top 3 u til you try em and then theyre.. just ok at best

  • pranaym 16 hours ago

    This is useful info. For the couple of days that Fable was live - it was clearly a step above Opus 4.8 and I was able to get 8-10 prompts in using my $20 plan.

  • LarsDu88 12 hours ago

    Which unity mcp do you use? I've been playing around with the official one, but was wondering what other folks use.

    Ran into a package conflict issue with the popular coplay one

    • cdurth 1 hour ago

      I have been using the coplay MCP for Unity.

  • ezoe 7 hours ago

    Because Fugu is not an independent model. They just use multiple existing SaaS models from OpenAI, Anthropic etc in background, gather response and generate results based on these response.

    They claims that combining the results of multiple AI models and generate final result by using their in-house proprietary model improve the quality than using the single backend model.

    It cause all sorts of doubts like: is their in-house model really exists? Is their in-house model really capable?

    Personally, even if their claim is correct, such feature can be easily implemented in client side like Claude Code etc, using the equally capable model from background model to generate the final result.

    I smell something fishy on their service.

    • GTP 5 hours ago

      I saw something similar on openRouter.

chillfox 18 hours ago

Fugu Ultra [0] is not actually a model, it's a system (harness in the cloud?) that routes to several models, looks like it's a bit like OpenRouters Fusion [1].

  "Rather than a single monolithic model, Fugu is a learned multi-agent orchestration system: a language model trained to route tasks across a swappable pool of underlying models and to recursively call instances of itself." - https://openrouter.ai/sakana/fugu-ultra

[0] https://sakana.ai/fugu/ [1] https://openrouter.ai/openrouter/fusion

devcatapult 19 hours ago

The "Mythos-like" talk is getting kinda annoying. Us normal people have no way to compare it outside of looking at benchmarks

  • atherton33 17 hours ago

    "Mythos-like" just means "hyped via hearsay". It's being used correctly here.

    • resonious 16 hours ago

      It means scores well in common benchmarks.

  • siva7 5 hours ago

    Asian ai startups have also no way to compare despite making bold claims, and one could argue the whole point of the trump intervention was to prohibit them from distilling faible.

  • an0malous 5 hours ago

    And there aren’t even any public benchmarks right?

glimshe 1 day ago

Without reliable benchmarks, they are Mythos-like only in the sense that they accept text as input and produce text as output.

  • theplumber 21 hours ago

    Well if they are hyped like Mythos then we can add that to the list of “like Mythos”. Perhaps what’s missing is their CEO warning the world that their model is too unsafe to be released on the internet and someone must stop them before it’s too late.

  • irthomasthomas 20 hours ago

    They provide benchmarks in the paper https:// arxiv.org/abs/2606.21228

    • khurs 18 hours ago

      Think 'indepenent' was implied when glimshe wrote 'reliable' benchmarks. Needs to be on the usual leaderboards.

  • chrsw 17 hours ago

    I don't even look at benchmarks anymore. I just try different models as they're released on our large, proprietary, systems software codebases in real, shipping products or projects that will ship eventually. It's pretty clear which models help me do my job better or faster. I'm fortunate enough to have the token budget to use basically as much as I need, for now.

    No need for benchmarks, evals, marketing, system cards or anything like that. I read the web for tips, practices and release announcements. My colleagues and I share our experiences with each other but beyond that, everything else is just noise.

    • BlaDeKke 16 hours ago

      This is the way. Not that big of budget here. But if there’s something promising, I just try that for a month or so. But even then… at this moment I’m using z.ai models and those do the job. No need for anything else. So I’m staying until there is something new, same affordability, but a lot better. (Using a coding plan)

    • buthowjejddjeu 10 hours ago

      I am not against AI but I do wonder how you guys handle the fact this leaks all your code and is stored forever on servers belonging to God knows who?

      I “trust” OpenAI and Anthropic (somewhat) but to be honest I still feel only safe using it on code without any secret sauce whatsoever. Luckily that’s a lot of code, but still. I wonder how others are looking at this?

      (FYI I feel the same about Github and we also don’t store our code there)

kingforaday 1 day ago

They have an impressive set of investors [1]. Also, HN Headline [2] from the other day with 100+ comments.

1. https://sakana.ai/company-info/?lang=en

2. https://news.ycombinator.com/item?id=48624782

  • UncleOxidant 19 hours ago

    Has either of these companies released models before this? It's hard to believe that they could release a supposed Mythos-level model just out-of-the-blue. Deepseek, Z.ai, Alibaba/Qwen have been at this for a lot longer and have been releasing models with steadily increasing capabilities for about 18 months now. I find it hard to believe that these new companies would just suddenly release a Mythos-level model without releasing anything prior.

    • pogue 19 hours ago

      How do we define "mythos level" exactly outside of marketing buzz? I don't even think the majority of us can access Mythos yet even to make a comparison.

      • codemog 18 hours ago

        There's benchmarks on their page that directly compare to Mythos. Yes, I already know benchmarks aren't the territory.

    • khurs 18 hours ago

      https://sakana.ai/company-info/?lang=en

      1 x Co-author of the landmark "Attention Is All You Need" paper

      1 x google researcher

      20+ big name investors funding

      Founded July 2023 so been working on it a few years, it may fall short but they have the money and talent to be potentially decent.

      • jsemrau 17 hours ago

        But did they deliver anything yet? There are many startups with notable founders that are not being able to get models launched. And even if they are able to launch, so they get PMF. In case of Sakana, they clearly focus on the Japanese market an have buildup a good pipeline on sovereign AI. But similar to Aleph Alpha or to a certain extend Mistral, I don't see how they can keep up.

        • khurs 17 hours ago

          I think every country/major country at the government level will back any home grown talent in view of USA restrictions.

          The models don't need to be as good, just good enough for the task. I am using a Claude Q3 2024 model mostly (Haiku 4.5) at present, and it delivers what I need.

        • resonious 16 hours ago

          I live in Japan and yet can't seem to pay for their API in JPY... I bet their enterprise customers don't have that problem but it was pretty annoying given "AI in Japan" appears to be there only selling point.

          • jsemrau 12 hours ago

            The only way they can compete in Japan is the enterprise-game. Their partnership with Daiwa and MUFG is probably exactly what they should be doing. I doubt that they get that far though. Like Mistral who has partnerships with BNP Paribas and Airbus, they deploy in an on-premise or private cloud and in those settings, their models make good PoC's but cant compete where it gets interesting -- Workspace Agents. If you look at them from a Chatbot over RAG perspective, maybe they can do it. But that's tech from 2023.

      • siva7 5 hours ago

        That paper had many co-authors and only two who were considered to be the primary drivers and geniuses behind (and they are american citizens).

    • Spooky23 17 hours ago

      Mythos is extreme hype. We are at a combo of authoritarian politicians peddling fear for power and tech bros trying to extract maximum investment returns.

      We’re going to have many LLM/toolset combos that do what mythos does.

firefoxd 19 hours ago

I'm expecting a ban of "foreign" llms due to "safety concerns" before the year is over.

It will have nothing to do with the actual performance. But anthropic has set the bar for mythos-like systems, and whatever meets that loosely defined bar will be unsafe for the public.

  • neom 18 hours ago

    How would that work in practice?

    • firefoxd 18 hours ago

      All American services won't be allowed to provide the models. Huggingface for example. The same way BYDs are illegal in the US.

      • esikich 17 hours ago

        You wouldn't VPN a car

        • esafak 17 hours ago

          And your company wouldn't VPN a banned model.

          • addandsubtract 16 hours ago

            The same way a company wouldn't torrent books?

    • root_axis 12 hours ago

      If you host it or are caught running it locally, then you go to jail.

      Since it's a national security issue, they'll be inclined to enforce aggressively.

      • winrid 12 hours ago

        Next we'll be putting smart people in jail for being too dangerous.

        • buthowjejddjeu 10 hours ago

          You jest, but clever folks with “dangerous knowledge” have to deal with pretty hefty guardrails.

        • an0malous 5 hours ago

          A tale as old as time

          > Despite his research, Semmelweis's observations conflicted with the established scientific and medical opinions of the time and his ideas were rejected by the medical community. He could offer no theoretical explanation for his findings of reduced mortality due to hand-washing, and some doctors were offended at the suggestion that they should wash their hands and mocked him for it. In 1865, the increasingly outspoken Semmelweis allegedly suffered a nervous breakdown and was committed to an asylum by his colleagues. In the asylum, he was beaten by the guards. He died 14 days later from a gangrenous wound on his right hand that may have been caused by the beating.

  • teliosix 9 hours ago

    As if there is no actual risk as the models progress, give me a break.

    I seem to live in an alternate reality that Claude is the only thing that seems to ever say anything smart at this point.

    Almost everything I read from humans is a hallucination and performative bullshit.

  • realusername 8 hours ago

    Well, goodbye to US tech and welcome China then I guess, nobody will wait and you can't put the genie back in the bottle.

GTP 19 hours ago

My cinic take is, if the model is decent it would be hard to disprove their claim of it being Mythos-like, since now Mythos is unavailable.

  • ai_slop_hater 18 hours ago

    What is Mythos like? Asking as someone who never had access to it.

    • p1esk 18 hours ago

      It’s got to be similar to Fable, which I experienced for 3 days, and which impressed me (compared to Opus 3.8)

      • SOLAR_FIELDS 17 hours ago

        It is materially better, but I didn’t feel a huge loss when it was yanked. My use case is large complex legacy modernization projects and its ability is definitely better than opus 4.8 at the job. But it’s more like an optimization, I could have a single or 2 pass in fable vs 8-10 with opus to arrive at the same solution.

fwipsy 1 day ago

First impression: Third-party benchmarks or gtfo. Personally, I've never heard of either of these companies before. We're just supposed to take their word that they've matched the best models on the market?

Sakana describes their model as a "Orchestration Model." Does that mean that it's actually a bunch of different models glued together?

  • OutOfHere 1 day ago

    Did Anthropic give you third-party benchmarks? Is that what you said to them? Yes, they're important, but the attitude is wrong.

    • bloppe 1 day ago

      Anthropic always publishes 3p benchmarks every time they announce a new model

      • MostlyStable 1 day ago

        And even if they didn't, they have a track record. Even if we did have benchmarks in this case I would still wait until people got there hands on it and formed a more holistic opinion.

      • OutOfHere 18 hours ago

        No, stop right there. Anything published by Anthropic implicitly is not third party. For it to be third party, the third party has to be the one publishing it.

    • fwipsy 23 hours ago

      Fudging benchmarks is a cheap way to get attention. If the model is really that good, it will have plenty of attention soon enough.

      • greenavocado 23 hours ago

        Yeah, what happened to that scam startup that alleged to have made a model context window breakthrough a few weeks ago?

  • lifeformed 1 day ago

    Is it actually that hard to make good models or is it just about the amount of resources you have to do training? (This is an actual question, I really don't know.) I'm sure it's not trivial but does it really take world class secret knowledge to build off of the known existing techniques? I feel like there's tons of low hanging fruit still to explore, and time and resources are the limiting factor.

    • MostlyStable 1 day ago

      The gap between grok and Gemini to Claude and chatgpt suggests that yes it is that hard.

      • arw0n 22 hours ago

        I suspect that Grok has been ironically lobotomized by pressures to correct its political views.

        Similarly, I could imagine the Gemini folks working in a significantly more complex corporate climate, with different parts of Google pushing for different capability focuses. They are only lagging behind less than a year, so it isn't too large of a gap yet.

        That said, the fact that Anthropic is currently the top dog suggests that talent and execution is incredibly important. A year ago none of my normie friends new them, and when i suggested using Claude looked at me like when I recommend Linux.

        • janalsncm 21 hours ago

          That shouldn’t affect Grok’ coding ability. How often are people discussing politics with Claude code? Writing decent code is just hard and it’s not just Grok.

          • bwhiting2356 21 hours ago

            It affects their ability to hire and retain talent.

            • janalsncm 20 hours ago

              If training a good model requires talent then that’s the answer to the question this thread is trying to answer: is training a good model actually that hard?

              • buthowjejddjeu 10 hours ago

                Talent to do.. what? This could mean a lot of things.

                Navigating astronomically huge fundamentally not so hard but still really tangly and hairy projects requiring both excellent short- and long-term vision in an overheated domain with angry people and lots of money is a skill all of its own.

          • black_knight 20 hours ago

            Why would these be independent?

            • janalsncm 20 hours ago

              More specifically, political lobotomy shouldn’t affect coding ability.

              • girvo 19 hours ago

                You’d be quite surprised, I think. Fine tuning a model on one axis can have drastic impacts on another that as a human we would expect to be completely unrelated.

              • Hamuko 19 hours ago

                It's all a bunch of weights isn't it? Why wouldn't fiddling with some parts of the weights have cascading effects?

          • thot_experiment 19 hours ago

            Not true, aggressive post training makes models notably dumber.

          • KaiserPro 10 hours ago

            > That shouldn’t affect Grok’ coding ability.

            If you are spending all your time having to re-train because the boss doesn;t like the output, it will hamper coding

        • Oreb 18 hours ago

          > A year ago none of my normie friends new them, and when i suggested using Claude looked at me like when I recommend Linux.

          Isn’t that still the case? Normies haven’t even heard about Claude, in my experience.

          • buthowjejddjeu 10 hours ago

            In my experience it has improved a bit, but 90% of my normies still have no idea. (It was 100% before)

      • khurs 18 hours ago

        All of the 11 grok co-founders alongside Elon quit: https://techcrunch.com/2026/03/28/elon-musks-last-co-founder... so that will have hampered grok.

        Like Zuckerberg, top talent may not work with a polarising character if they disagree with his behaviour. Space focused talent don't have many choices aside from SpaceX but ai companies are a plenty and a top AI person can pick and choose.

        • MostlyStable 18 hours ago

          The fact that you need top talent also suggests that it is indeed that hard

          • IshKebab 10 hours ago

            I dunno, if most of the top of a company quits it's extremely disruptive even if everyone else in the company is competent.

    • fwipsy 1 day ago

      Not hard to be a fast follower. Lots of companies are ~6-9 months behind. Reaching the actual bleeding edge is much harder.

    • khurs 18 hours ago

      >Is it actually that hard to make good models

      Didn't take DeepSeek long. Or XAI to launch grok.

      If they have a top team and the money then appears to be a matter of a year or two? And one startup mentioned is Japanese not Chinese so they won't be banned from buying US tech.

  • Ifkaluva 1 day ago

    Their release post was on HN recently. The comments seemed to think that it was similar to OpenRouter, not an actual model.

cloudengineer94 11 hours ago

Just like many comments have been saying here, I also tested Fugu and some others and what I noticed is that they are quite expensive models, 20$ is not enough to complete a full workflow which in Opus it's possible, sure you might need to improve your prompt from the get go with Opus if you want the best results but so far that's my experience.

My next test will be Agentic systems and see how they perform

lelanthran 1 day ago

Feels like I need to repeat myself more than once a day now: https://news.ycombinator.com/item?id=48697258

> These companies providing tokens, whether SOTA or not, that want to IPO are so fucked as time goes on.

>Can't sell their SOTA models, only slightly better than the open source models for the models they can sell, cost 20x to 50x for good models, a TAM that consists almost solely of developers, with no customer of theirs actually boasting increased profits as a result of AI...

> I fear their time to IPO may have passed.

What on earth could Anthropic and OpenAI Pivot to now?

  • outside1234 1 day ago

    Propaganda? Pay for “facts” to be placed in the model?

  • fassssst 1 day ago

    > a TAM that consists almost solely of developers

    That’s the wrong assumption. These models are good at office docs too.

    • airstrike 1 day ago

      They're passable at those. And still no moat.

    • dgellow 1 day ago

      But you can do office docs work with way cheaper models

    • AndrewKemendo 1 day ago

      I have yet to see a model that can make a consistent and repeatable powerpoint deck that doesn’t need considerable manual revision

      Find me someone who is putting raw text in and getting out a usable weekly staff meeting deck that doesn’t require massive revisions

      • yggy 1 day ago

        I agree but why is that?

        Let’s face it - without the humans these machines ain’t shit - aka we have mechanically figured out ways to make machines better than us at certain things (on demand memory) but this idea they are intelligent is horse shit.

        Btw the bar is low too! Most human created decks are garbage. And yet LLM’s don’t even beat those.

        • AndrewKemendo 16 hours ago

          Unfortunately matters of taste like design aren’t as easy to specify

          • yggy 13 hours ago

            I doubt they’ll ever be.

            Machines have nothing to do with what speaks to the human - at best they are media to transmit what a human wants to express.

            Having all the data in the world doesn’t solve that. Although if enough of society gets exposed to slop and there is an erosion of taste and quality… in fact I’m gonna stop there. It’s a dark path.

      • buthowjejddjeu 10 hours ago

        It’s hard not to be cynical as a dev. Every time I see a non-dev messing around with tooling I feel pain and discomfort.

        You can bet your ass I can make AI make a consistent and repeatable powerpoint deck.

        Problem is a) the need to produce this “deck” is questionable, at best and b) thinking in “powerpoint” is severely lobotomizing and limits you and your company in ways I find hard to even enumerate.

        The exact same bullshit is going on with “word”. People can’t separate content from form apparently and that’s a debilitating disability no AI can fix. Creating AI to help you with “word” documents is an infinitude of stupidity so dark and so deep it makes my eyes water and fear for the future of humanity.

        All I can say is imagine you do NOT have “powerpoint”. Let’s say all you have is a whiteboard with a 10% pen so you have to keep it short. What do you intend to communicate? Drill down and stop thinking in “powerpoint” and start thinking in content.

        Content and form are orthogonal, repeat after me.

        Once you have a stable and consistent information schema you can create “decks” till the cows come home and they’ll all be great. You won’t even need special AI for it in all likelihood. If you do not have a firm handle on the payload itself you’ll be forever tangling with “fonts” and “charts” getting none the wiser.

        Oh man ya’ll are going to hate me so much so I’ll just spew further so I can get along with my day.

        Imagine REST APIs being built around button designs, serialized CSS in business objects, in short imagine completely mishandling information and tanging it up with vague social indicators like “looks like he did work” and “this looks like an executive would be impressed”. Businesses are peddling proxies instead of raw information and for the love of God I can’t figure out why. AI or no AI this is a self-inflicted mutilation of such epic proportions I find it hard to imagine AI even budging the needle in such a low-density low-value environment.

    • lelanthran 1 day ago

      > That’s the wrong assumption. These models are good at office docs too.

      The cheap models handle that very well. The SOTA models still only have target TAM of developers only.

      You only need SOTA for development. The $1t investment is in SOTA companies.

  • clusterhacks 1 day ago

    I used to agree with you but now do not. I now think the floor for this market is probably no worse than the annual revenue of cell phone plans in the US market. So say, $250 billion.

    Now, that probably doesn't justify the valuations and hype being thrown around, but I think it gets at a real revenue number.

    I also don't know how that number fits into the funding rounds already raised and VC dreams of IPOs for these two.

    This isn't coming from deep analysis on a verifiable source, but I started asking people in my social circle (includes white-collar and blue-collar folks) about their LLM use. The biggest surprise in 2026 for me was that almost all of these people told me about regular (and sometimes sophisticated) use.

    A more intriguing observation - I work on the side with high school students and have two college kids of my own. Their LLM usage (and their peers) is much, much lower than expected . . . that's a little counterintuitive given "popular" perceptions I read.

    • lelanthran 1 day ago

      > I used to agree with you but now do not. I now think the floor for this market is probably no worse than the annual revenue of cell phone plans in the US market. So say, $250 billion.

      I don't think we're talking about the same thing. I'm talking about what their IPO is going to do to their share price.

      In any case, $250b revenue translates to, best case scenario, $50b profit. On an investment of $1t. It does not look good for those companies making up the $1t investment.

      • clusterhacks 1 day ago

        Gotcha. I'm past the point of having any confident thoughts about what happens to their share price at IPO.

        What about the idea that there is a high likelihood that the potential share price for OpenAI and Anthropic are both going to be pretty divorced from a rational market price for either?

    • throw310822 20 hours ago

      Interesting idea and reasonable number, but cell phones need a lot of infrastructure and they need interconnection. The risk here is that in the future a combination of near-sota open weights models optimised to use as little resources as possible and a reasonable drop in compute price, will make possible for small and tiny providers to compete with Anthropic/ OpenAI or even for people to run their own private models for most applications. Then large, expensive sota models would only be used for research and to answer the small subset of general user queries that need that kind of intelligence.

  • a34729t 20 hours ago

    The open models are half the equation. The other half is Apple's hardware, which is likely to see major memory bandwidth improvements over the next 2-3 generations and will be capable of running substantial models locally. By that point the open models will be beyond today's SOTA.

  • khurs 18 hours ago

    >I fear their time to IPO may have passed.

    They may not get the valuation they want, but as it appears to be on a plateau may be better to offload now?

    As per SpaceX, so many big names are involved the media will be controlled to hype it up and the investment banks will forecast 100x revenue in 2 years...

  • neumann 14 hours ago

    I agree with everything you said about their situation, but it's not like that is what will be evaluated in an IPO. There will be continued hype by the companies, lobbying to win support of a corrupt administration, and a narrative spin by clueless media about this AI revolution that will give investors fomo.

matheusmoreira 18 hours ago

I doubt it will rival Mythos or the upcoming Sol, and if it's not open weights it doesn't really matter in the grand scheme of things. Still, I applaud the asian LLM efforts and hope they keep up the pressure on the americans.

dev_l1x_be 9 hours ago

GLM produces pretty decent websites.

terekhindc 8 hours ago

if fugu really is an orchestrator dispatching to opus/gpt under the hood (as the openrouter page suggests), the $20-in-one-prompt complaints actually start making sense — you're paying api markup twice.

sheeshkebab 14 hours ago

unless they launched 10t param models, or figured out some amazing new way to compress as many params into say 100b, I doubt it's anywhere near "mythos level". and I have no idea how many params mythos has but that was just some hear say.

Hasan121212 14 hours ago

Competition is accelerating, but the next breakthrough isn't just better models it's better connectivity. AgentKey bridges AI agents with real-world tools, APIs, and data.

qsxfthnkp2322 1 day ago

So now as a regular American we are behind because gatekeepers saying super intelligence is too scary

It was bound to happen soon.

  • microgpt 1 day ago

    People who are shielded by walls are always surprised when the same walls shield the people outside from them

  • lagrange77 1 day ago

    It is scary.

    • w4yai 1 day ago

      It is not. Where's the danger ? We will need to adapt, as in every technology progress, but what do you think will happen ? Realistically ? Don't feed the fearmongering. Yes, we're disrupting the status quo, if that's the danger, then welcome to the world.

      • Certhas 1 day ago

        Actually the real danger is mass labor market disruption, and a massive shift of power from labour to capital.

        As was highlighted in previous discussions, the industrial revolution took 80 years to start benefiting workers. The continued impact of automation at least contributed to the rise of right wing extremism and an erosion of democracy all over the west. Now we face a development that has the potential to be faster than those that came before, in the context of political systems more fragile and worse equipped to manage the change.

        So yeah, disrupting the status quo can absolutely be dangerous. It has been dangerous (and deadly) in the past and in the present.

        • w4yai 1 day ago

          > the industrial revolution took 80 years to start benefiting workers

          Come on. This is dishonesty and isn't the reality. We may agree that the Industrial Revolution may have taken decades (certainly not 80 years) for its benefits to be *clearly and widely* felt by workers, but anything further is an abusive claim. So what, because the progress doesn't benefit to workers instantly, we shouldn't do it ?

          In the end, whatever your position, industrialization eventually raised living standards. So what's wrong with that ?

          > The continued impact of automation at least contributed to the rise of right wing extremism and an erosion of democracy all over the west

          This is oversimplifying and correlation at the best, not causation

      • h26d3r 1 day ago

        Any superintelligence operating under a consistent moral framework will decide to extinguish humanity with as little ecological damage as possible, because humans cannot coexist with other life on this planet. It will realize that a bioweapon is the ideal choice.

        • dragonwriter 1 day ago

          > Any superintelligence operating under a consistent moral framework will decide to extinguish humanity with as little ecological damage as possible, because humans cannot coexist with other life on this planet.

          There are plenty of internally consistent moral frameworks which would not favor this action even if the premise were true (and that premise seems at best unjustified and at least overstated.)

        • victorbjorklund 20 hours ago

          That doesn’t make sense. You could make the same claim about intelligent people who operate under a consistent moral framework.

      • lagrange77 22 hours ago

        > Where's the danger ?

        It's not one single danger, its a can full of dangers in various domains. And in contrast to other dangerous technologies, we are talking about one that has the potential to self improve. This smells like exponential growth doesn't it? Exponential growth is something we are not very likely to adapt to successfully, even if you say we are supposed to.

        But before you complain, here are just a few concrete dangers, that come into my mind right now:

        - mass layoffs in a system that is far from being prepared for sth like that. (no UBI)

        - a Mr. Robot tier blackhat at the fingertips of every teenager in their mom's basement in a software landscape that is far from being prepared for sth like that. Side note: Big parts of the world including critical infrastructure runs on software.

        - because it automates more and more intellectual work, it can cause mass brain atrophy, which isn't a hopeful sign for the human branch of the evolution

        - increasing dependence on a technology, that is in the hands of those with capital.

        And to the OP: this technology has potential implications that are far beyond 'being behind' other nation states economically.

  • cultofmetatron 21 hours ago

    SOTA AI is the only place where we are ahead. china has us beat on manufacturing, logistics and workhorse grade models like deepseek pro and glm5.2.

    we're increasingly irrelevant

    • verdverm 19 hours ago

      they are also generating >2x the amount of electricity, core to ai, robots, and manufacturing

      • cultofmetatron 10 hours ago

        no kidding. I learned last week that they have intercontinental high voltage DC transmission lines.

    • yggy 13 hours ago

      Brand too.

      American firms are toxic - there’s an implicit gamble they are all making.

  • blurbleblurble 18 hours ago

    intelligence has always been a threat to the idiocracy

zkmon 1 day ago

I think it is time that we had a UN-sponsored standards body dedicated to bench-marking the newest models from around the world, for everyone's benefit.

throwatdem12311 17 hours ago

wtf even is “mythos-like” when smaller models can find all the same kinds of issues if you just prod it a bit more

w4yai 1 day ago

Excellent. I'm very thankful the asian/chinese don't give a fuck about the US government. It feels good to have a competitor.

jdw64 1 day ago

Where can I get the API?

ottotarc 1 day ago

Given the national security implications, it's no surprise that Japan and China are rushing to build sovereign models post-ban. But when these startups claim parity with "Mythos," could it be that they are just optimizing for very specific inference tasks? I wonder if we are seeing the real battleground shift from raw training scale toward specialized inference.

visha1v 1 day ago

asian is bad wording. this is a japanese startup backed by khosla ventures. japan is an ally of west. the title makes it sound like a chinese company did this.

  • mksreddy 1 day ago

    The article talks about 1 Chinese and 1 Japanese model.

    • khurs 17 hours ago

      Should have said East Asia.

      As I clicked expecting India from the title (as China are already there and India has so many devs surprising they haven't yet).

      • ihateolives 13 hours ago

        Are you from UK perhaps? Because UK is the only place I know where "asian" also includes India and Pakistan. Everywhere else India is sort of separate entity and always mentioned separately.

        • gnabgib 13 hours ago

          You don't get out much? (in alpha) Asia, Australia, Canada, Europe, Middle-East (sorry, that's a western term) all include west Asia in the definition.

          • ihateolives 12 hours ago

            I'm from Europe and never hear indians clumped together with chinese as "asian". When speaking of geography, then yes, but that's about it. It's always "Indian startup", never "Asian startup" for example. Indian food, not Asian food etc. YMMV

            • defrost 12 hours ago

              The UK is part of Europe despite leaving the EU (which is of course not equivalent to "Europe").

              In the UK it is common for the British to refer to both Pakistanis and Indians as Asian and not uncommon for the Chinese to be lumped in the Asian category as an umbrella term (and also to highlight someone or some company as being specifically Chinese).

              I guess it really comes down to habits in particular parts of Europe and which parts of Europe you travel in.

            • N_Lens 11 hours ago

              Same in Australia. India/Bangladesh/Pakistan is specifically referred to as South Asian.

          • khurs 6 hours ago

            >You don't get out much?

            Now, now. No need to be rude!

        • khurs 7 hours ago

          yes!

          In Uk asian = south asian. As many decades ago we had large migration from those countries and less of others.

  • colordrops 1 day ago

    Is that really the most sailent facet of this story? Boxing it by official friend vs foe designations? Don't american academic institutions and corporate entities cooperate closely with Chinese companies as well?

    • WarmWash 1 day ago

      The US and China are in a cold war right now, whether that is fully recognized or not, the fight has already begun. The US is blocking models from getting out of the country and China is blocking researchers from getting out of the country. The expectation should be only more closing off in the future.

  • threethirtytwo 1 day ago

    Patriotism makes people biased. Better to not hold an identity in this area.

    • exidex 23 hours ago

      People are biased by definition

      • threethirtytwo 21 hours ago

        I’m talking about biases as a noun.

        People have many biases. Patriotism is one form of bias. By having no identity you cannot have that form of bias.

  • vcryan 1 day ago

    We are all people. This ally-of-the-west framing is propaganda. Who has harmed me more: this US or China? Who do I have more in common with: a tech worker in China or a US government official?

    (I'm based in US - I use the best tech for the task).

    • JumpinJack_Cash 17 hours ago

      WHen push came to shove Trump backed down on Jan 6th / the institutions held their ground.

      On the other hand Xi is President for Life and absolute autocrat of China.

      This answers your question in a pretty compelling way.

      • deaux 16 hours ago

        It doesn't answer their question at all, let alone compellingly. You're pretending to engage while completely refusing to do so.

skeledrew 1 day ago

YES! Now things become even more interesting. US, your move.

  • khurs 18 hours ago

    Let's wait till some independent benchmarks appear.

    But encouraging for Japan to announce competition along with China.

prng2021 1 day ago

[flagged]

  • amarcheschi 1 day ago

    Anthropic just stole the internet and put it in a transformer and pat itself on the back for it - well no to be honest we have to suffer through hearing them saying that this model is really really dangerous until they got a reaction for they fear mongering

  • itsdesmond 1 day ago

    [flagged]

    • renoir 1 day ago

      This exactly.

      YC companies literally steal competing company 1:1 and you turn blindeye.

      Then a thief steals from a thief to give it out at better prices than you write low quality comment.

      Shame that America will greet 250th anniversary with this kind living in it.

    • prng2021 23 hours ago

      Thanks for the irrelevant comment. I’m criticizing these Chinese companies because these aren’t accomplishments. Where did I praise Anthropic?

  • visha1v 1 day ago

    they mined the internet first. now they’re upset someone brought a shovel.

  • TheGoddessInari 1 day ago

    Both of the mentioned models are model orchestrators using a vastly different multi model paradigm.

    Saying they in particularare distilled from Anthropic is really [citation needed].

  • nullbio 1 day ago

    [flagged]

    • ceejayoz 1 day ago

      > Now no one can access ChatGPT 5.6 because of their 5 year long fearmongering regulatory capture campaign.

      I'm sympathetic to this arguement, but it's silly to ignore the other half; that the administration has openly feuding with them for months over limits to military capabilities.

      https://www.politico.com/news/2026/06/04/hegseth-anthropic-d...

      • nullbio 1 day ago

        No one is ignoring the other half, the feud is rooted in Anthropic's insatiable desire for power and control over everyone and everything, including the administration. The same desire that is fueling the strategic fearmongering campaign underpins all of their behavior and the repercussions and sentiment they're facing from the administration and the general public.

        If their company hadn't been posturing like this for 5 years they'd have played ball with the administration like all of the other AI companies and they wouldn't have caught all that heat and taken down the AI industry with them. Just remember that Dario was pushing the narrative that GPT 2 was too dangerous to release to the public, while he was working at OpenAI. GPT 2!

        Now it's an inevitability that China takes the lead - which was probably the case anyway, but a certainty if this continues.

  • Alifatisk 1 day ago

    I seriously do not comprehend how a consumer like you can have sympathy for Anthropic, as if you are part of their organisation or something. Competition is good for us. Wouldn't it have been for asian labs, we would would be fully dependent on OpenAI, Anthropic and Googles services.

  • dang 21 hours ago

    Please don't take HN threads into flamewar hell. It's not what this site is for, and destroys what it is for.

    The idea here is: if you have a substantive point, make it thoughtfully; if not, please don't comment until you do.

    https://news.ycombinator.com/newsguidelines.html