Since the link is in Chinese: MiMo Code is Xiaomi’s AI agentic coding harness.
“ MiMoCode is a terminal-native AI coding assistant. It can read and write code, run commands, manage Git, and use a persistent memory system to keep a deep understanding of your project across sessions while continuously improving itself.”
You can change the language via the header: The rightmost option is a language dropdown.
It's a client-side change and doesn't impact the URL so users must manually change it each time they visit the site though
ramon156•Jun 11, 2026
Why not persist it through a query param? Or a lang param for that matter
danesparza•Jun 11, 2026
Feels like maybe you're just noticing this because it defaults to Chinese. Is that true?
How many sites do this but you don't notice because they default to English?
rplnt•Jun 11, 2026
But what if you have English configured as a preferred language? Isn't that what it's for? Wouldn't it make sense for a website to respect that (when available)? I hate that google.com doesn't and defaults to random languages based on IP.
psychoslave•Jun 11, 2026
Web standard often give great grounds to leverage on. Modern stacks often really poorly work with a lot its surface and reinvente half baked bespoke alternatives.
psychoslave•Jun 11, 2026
Personally, none, I’m not English native. I didn’t notice the locale switch, but mostly because the look of the website was so beautiful I didn’t pay attention to menus. I wonder if ideograms keeps looking so beautiful once you learned to decode them. I never found Latin script to be particularly beautiful, and to this date Arabic script remains my favorite one in term of esthetic (I can’t read Arabic ever).
klausa•Jun 11, 2026
This is not exactly what you’re asking about, but I started learning Japanese when I was in the middle of playing Cyberpunk 2077 (for unrelated reasons); and I gotta tell you; realizing that 98% of the Japanese text everywhere in the game was just “hotel” or “karaoke” definitely took away some charm from it.
lambda•Jun 11, 2026
You're right, there are probably lots of sites misconfigured to not respect language headers, but we don't notice because English is the default.
However, the right solution is still to use the language header. I send that to them, they should use it to give me the right one by default.
One of the funny things is that this whole site is in an iframe; which breaks both Google Translate, and the Firefox translate feature. If you check, the outer iframe seems to indicate `lang="en'` and loads the iframe with `src="/coder/index.html?lang=en"`, but the inner iframe still gets a `lang="zh-CN'` by default until you use the toggle.
If you go to the eventual redirect source of the page with `lang=en` parameter, you get a `lang="en"` attribute, but it's still in Chinese until you toggle it with the menu: https://mimo.xiaomi.com/coder?lang=en
Anyhow, yeah, lots of pages are probably broken this way but we don't notice. But still, it has that info from your request, it should use it.
sheept•Jun 11, 2026
Language support was probably an afterthought since their target audience all read Chinese
When you look at the source, you can set it to English from the params https://mimo.xiaomi.com/coder?lang=en
but there's a small bug, the hero subtitle isn't translated but everything else is.
Wondering why this is in an iframe.... so strange
ComputerGuru•Jun 11, 2026
Thanks, I missed that on first glance and did manual translation.
Not sure why my iPhone shows an option to translate website but all the destination languages to pick from (I have multiple languages installed), including English, are greyed out. iPhone does support translating from Chinese (Simplified or Traditional), and the button to translate website isn’t greyed out like it is for unsupported/unrecognized languages. Might be an iOS 27 bug, because it is working on other websites?
pmdlt•Jun 11, 2026
"MiMoCode is built as a fork of OpenCode."
Why not just contribute to OpenCode instead of creating a clone :/
konart•Jun 11, 2026
To go a different path perhaps? You can't expect that all your ideas will land into a main repo and you really want to implement your vision while using a sane base.
mythz•Jun 11, 2026
Because they want to optimize it for their models and don't want to be blocked by waiting for PRs to merge or be rejected.
There's plenty of reasons to start your own fork that you have full agency of, as long as the OSS License is maintained anyone will be able to benefit from any new features they want to make use of.
dannyw•Jun 11, 2026
This is the beauty of open source :) KHTML -> WebKit -> Blink is a good example.
bigyabai•Jun 11, 2026
KHTML is dead now, though. It was basically embraced, extended and extinguished by Apple and Google, who both wanted to take away the leverage of the community.
Today, legacy KHTML maintainers are boxed-out of upstream decisions that might prevent Manifest v2 from swirling down the drain. I'd argue the story isn't very beautiful anymore.
re-thc•Jun 11, 2026
> Why not just contribute to OpenCode instead of creating a clone :/
It's controlled by a different organization; in particular a startup in a "competing" space.
doctorpangloss•Jun 11, 2026
have you ever tried contributing a large number of changes to OSS?
postalrat•Jun 11, 2026
OpenCode can merge in all their changes if they want.
rurban•Jun 11, 2026
Opencode sits on a ton of important PR's, so they didn't want to wait. Everybody else switched to omp (oh my pi) already.
orangeisthe•Jun 11, 2026
Why not?
dartharva•Jun 11, 2026
Could just be a courtesy - Americans tend to be rather suspicious and hostile to contributions coming from China, and it might draw unwarranted attention from agencies and bad media.
Because its currently impossible to "contribute to Opencode".
There are over 500 pages of open issues, up from 78 less than a month ago. They are doing nothing to halt the garbage/duplicates that pop up, and not even addressing legitimate PRs/reports.
I thought this was a wireless/MIMO radio project at first
rickdeckard•Jun 11, 2026
yeah, was also expecting some disruption in the RF-design space.
Kinda RF-nerd clickbait... :)
fadedsignal•Jun 11, 2026
I also thought the same lol. It also happened with lora
eunos•Jun 11, 2026
Well Xiaomi is first and foremost a mobile phone company.
reactordev•Jun 11, 2026
Looks an awful lot like OpenCode
croes•Jun 11, 2026
> MiMoCode is built as a fork of OpenCode.
That’s why
emulio•Jun 11, 2026
Why is OpenCode awful?
reactordev•Jun 11, 2026
Who said that?
psychoslave•Jun 11, 2026
Is that Open-Source like, run it locally, no phone home included, or open source like the thin front-end layer is all that is actually open-source but it’s an empty shell without the remote API it relies on?
Pxtl•Jun 11, 2026
The latter. It looks like it's meant to be a batteries-included agent to promote their free-for-a-limited time AI service that it connects to by default.
psychoslave•Jun 11, 2026
Ok, fair enough compared to the rest of the proeminent actors I guess, but quite confusing from dev point of view. Lately I started to experiment with model like Qwen2.5 on local. Good enough to ask simple question, but didn’t manage to do anything remotely close a agents I started to experiment with through Copilot.
Pxtl•Jun 11, 2026
qwen3.5 9b runs okay on my 12GB gaming GPU. It's very stupid as a coding agent but it's possible to get useful work out of it.
irthomasthomas•Jun 11, 2026
I am experimenting with LFM2.5-8B-1A and getting 250tps on a 3060
passive•Jun 11, 2026
They default it to talking to a free version of their model (which is incredibly cheap if you decide you like it.)
But it seems trivially easy to run it against local models. Their onboarding guide offers that option, though I have no idea if it changes any functionality.
emayljames•Jun 11, 2026
I wonder what the minimum required memory specification is
andai•Jun 11, 2026
> Unlimited Context
>Knowledge accumulates automatically with lossless compression, preserving every critical detail even across million-line projects.
sheept•Jun 11, 2026
It's interesting that it renders Chinese in a TUI. I wonder if that breaks anything that assumes a character is always a column wide.
Chu4eeno•Jun 11, 2026
Terminals and things that live in terminals have relied on wcwidth() to handle this since time immemorial (which is always fun when they are out of sync, e. g. remote over ssh, but in the vast majority of the cases it just works).
GodelNumbering•Jun 11, 2026
> MiMoCode is built as a fork of OpenCode. It keeps all core OpenCode capabilities (multiple providers, TUI, LSP, MCP, plugins) and adds persistent memory, intelligent context management, subagent orchestration, goal-driven autonomous loops, compose workflows, and self-improvement via dream/distill.
From github
Pxtl•Jun 11, 2026
Sounds like they slapped in a bunch of common plugins and released it as a product to promote the free-for-a-limited-time use of their new coding AI service.
cyanydeez•Jun 11, 2026
So, basically the same thing silicon valley has been doing for the past half decade.
ignoramous•Jun 11, 2026
> promote the free-for-a-limited-time use of their new coding AI service
Not sure which "free" service you're referring to, but MiMo v2.5 Pro is plenty capable & (after its recent 70%+ price drop) one of the most affordable options in its class (DeepSeek v4 Pro, MiniMax M3, & Qwen 3.7 Plus). I read somewhere that Labs are incentivized to implement custom harnesses because each model has its strengths, quirks, & blindspots (like Qwen forking Gemini CLI)?
re-thc•Jun 11, 2026
> like Qwen forking Gemini CLI
That was a good call. Gemini CLI is dead.
egeozcan•Jun 11, 2026
don't anthropomorphize the guillotine
bel8•Jun 11, 2026
What I like about MiMo too is that it is multimodal.
For example, I can send screenshots of what I'm developing and it understands.
mistercheese•Jun 11, 2026
I thought only the MmiMo 2.5 non-pro was multimodal?
indigodaddy•Jun 11, 2026
Only non-pro is multimodal I believe
gbalduzzi•Jun 11, 2026
I'm guessing the greatest reason behind each provider creating and agent harness is that (a) there is not a clear winner still and (b) it is harder to switch models with a competitor, as you also have to switch harness
pelagicAustral•Jun 11, 2026
I got an invite to test their ultra fast model only to be geofenced when trying to use it. Pff!
nmfisher•Jun 11, 2026
Good timing, I was looking for alternatives earlier today. opencode didn't install properly and I wasn't a fan of oh-my-pi and nanocoder.
MiMo code (via my z.ai coding plan) is very pleasant so far, nice UI and seems to respond faster than Claude Code. It might be injecting much less cruft into the conversation.
I also got access to the mimo-2.5-pro ultraspeed model yesterday, which is really quite snappy. It does cost more than DeepSeek, though, so I'm not sure whether it's worth it yet. Definitely fast though.
cyanydeez•Jun 11, 2026
is it local compatible and does it have telemetry?
qskousen•Jun 11, 2026
it does have telemetry, enabled by default, that sends metrics to tracking.miui.com, including what model you are using. it can be turned off by environment variable (MIMOCODE_ENABLE_ANALYSIS=false), and yes it still has all the normal OpenCode provider logic so it will work with other/local models. it also automatically looks for updates and fetches a mimo model list, including when the telemetry is off, though those can also be disabled.
telemetry enabled by default and named "analysis" is not great.
polski-g•Jun 11, 2026
Opencode didn't install properly? Its just "mise use -g opencode@latest"
jadar•Jun 11, 2026
I'm kind of surprised the demo UI is macOS. Are they mainly using Apple products to develop these things?
rurban•Jun 11, 2026
The more advanced devs all use apple laptops, sure.
dgellow•Jun 11, 2026
Who isn’t?
bobim•Jun 11, 2026
I'm slapping debian on any crap hardware around, but that's just me with different ideological standards.
joshmarinacci•Jun 11, 2026
That is an incredibly annoying grunge font. And what is the point of the hidden image in the background that reveals under your mouse cursor.
tietjens•Jun 11, 2026
This is my favorite of the Chinese models I have tried. I think it would be hard to know if I was using Opus of MiMo if blindfolded in many instances.
denysvitali•Jun 11, 2026
Yes, but this has nothing to do with MiMo (the model).
This is what Claude Code is to Claude
djsamseng•Jun 11, 2026
I found it relevant and actually just the information I was looking for. Having a highly recommended model behind the tool makes it worth further investigation.
dannyw•Jun 11, 2026
MiMo Code is not a model, it's a harness like Claude Code / OpenCode / Codex (which is still open source, Apache 2.0, btw).
You might mean the MiMo-V2.5-Pro model?
100ms•Jun 11, 2026
He didn't say MiMo Code
lsaferite•Jun 11, 2026
Saying "This" on a post about "Mimo Code" leads one to, rightly, assume you are referring to "Mimo Code". If they commentor wanted to make a comment about a specific model it would have be clearer to simply mention the model instead of getting there transitively.
100ms•Jun 11, 2026
Mind numbing
tietjens•Jun 11, 2026
Sorry for confusion. I indeed meant the model itself.
porphyra•Jun 11, 2026
Pretty neat that you can just install it and start using it (at a Sonnet 4.6-level model) without needing to sign in or pay.
Typically, Chinese websites are a big pain to log in or sign up because they require a +86 phone number due to legal reasons. Being able to use it without having to make an account is amazing for friction reduction. I could probably even just install it onto new machines to help with set up.
I wonder how they are gonna detect and block abuse though?
ProofHouse•Jun 11, 2026
So funny I have noticed how terrible the signup is on all these Chinese models, companies etc. Always wonder why it is such an easy process. Like QQ, Tencent etc demos Ive seen past year
sdesol•Jun 11, 2026
> at a Sonnet 4.6-level model
MiMo v2.5.0-Pro is honestly the first Chinese model that I've tried where I really though why should I use Claude Sonnet when I can get the same results for a fraction of the cost. There was always something off about Chinese models that made it apparent that it couldn't fully compete with GPT, Claude, Gemini, etc. but this was the first model where I was like, this feels like Sonnet.
I can't prove it, but I think they trained heavily on Claude output. From my perspective I don't care since Anthropic trained on my data.
Using them also works well for North Americans as our peak hours is not theirs.
If I had one complaint, the v2.5.0-Pro model thinks too much.
phplovesong•Jun 11, 2026
Any english links?
Alifatisk•Jun 11, 2026
Top right corner
desipenguin•Jun 11, 2026
You can change language from the top right-most dropdown, and select English
rurban•Jun 11, 2026
Only worked for about 5m, then Too many requests.
DanMcInerney•Jun 11, 2026
I've worked a lot with MiMo in my project that pits LLMs against each other in games (clankerfights.ai). It is a very very good model for the price. MiniMax I'd say is smarter, but MiMo really touches near pareto frontier.
MemoryHoleHQ•Jun 11, 2026
macOS binary (mimocode-darwin-arm64.zip ) seems broken: "“mimo” is damaged and can’t be opened. You should move it to the Trash."
haunter•Jun 11, 2026
No, you are just experiencing the best of Apple. How dare you download non notarized binaries on your own computer? Do you have a license for that?
Terminal > sudo xattr -rd com.apple.quarantine > Drag and drop the app into terminal > enter and enter your password
MemoryHoleHQ•Jun 11, 2026
Strange, I reckon I installed Tahoe just a while ago and still didn't have a similar issue, but I remember on previous MacOS versions the error message for unotarized binaries, was to warn that there was indeed a security issue with the binary, not that it was simply "damaged".
This is usually a PoC (Proof of concept) way to install something on a temporary container or temporary VM, but not for production use during daily desktop operation.
I was hoping their documentation would provide better installation instructions. But strangely, only for Windows do they recommend "npm install -g @mimo-ai/cli," which is a much better approach to managing installed packages.
For Mac/Linux, they have the strange recommendation to use the dangerous "curl <some_url> | bash." Quote:
> (for the best experience, Mac users are strongly encouraged to use iTerm or the VSCode Terminal)
> curl -fsSL https://mimo.xiaomi.com/install | bash
We've had this discussion since Eazel Linux desktop popularized bash | curl in 2001.
> npm install ... is a much better approach to managing installed packages.
No. Until the upcoming version of npm is out, npm will also run arbitrary code. Almost all common installation tools run arbitrary code. Not doing that is sadly the exception for now.
mapontosevenths•Jun 11, 2026
Isn't executing arbitrary code kind of the entire point of NPM though? Any chance you have a link to something that describes their plans?
nailer•Jun 11, 2026
> Isn't executing arbitrary code kind of the entire point of NPM though?
No. npm is a package manager. As mentioned in the comment you're replying to, almost all package managers execute arbitrary code. Eg:
- pip
- Cargo
- apt/dpkg
- dnf/yum
- Homebrew
- RubyGems
- Composer (limited)
- Maven
> Any chance you have a link to something that describes their plans?
You're right that it's as dangerous as it's executing random third-party code on your machine, but the method also has propagated far beyond PoCs and such at this point. All of these projects and many others push that install method: Bun, Deno, rustup, k3s, Docker (if using their helper script), Homebrew, Tailscale...
meatmanek•Jun 11, 2026
Frankly, it's not really more insecure than any other installation method. Apt packages and the like generally have the ability to specify pre/post-install scripts, so `sudo dpkg -i ./random.deb` is equivalent to `sudo bash ./random.sh`. Even if they didn't have pre/post-install scripts, they're still writing arbitrary files to arbitrary locations on your disk, so they can trigger execution the next time you boot or log in or whatever.
And at the end of the day, no matter the installation method (even just unpacking a tarball and executing the program directly from that directory), you're going to run their program on your computer, and then the program can do whatever it wants. Maybe you don't run it with sudo, but https://xkcd.com/1200/ seems relevant.
emulio•Jun 11, 2026
A package (like a .deb) is a static artifact. It can be hashed, mirrored, and GPG-signed. Package managers usually verify that signature before any pre/post-install scripts. A "curl <some_url> | bash" pipe is a dynamic stream; the server can perform targeted attacks: sending a clean script to 99% of users and a malicious payload only to a specific IP address or User-Agent. This allows for targeted attacks that are invisible to the rest of the community.
Yes, running third-party code is always a leap of faith, but why choose a delivery method that removes the possibility of verification and opens the door to targeted injections? Convenience shouldn't be an excuse to ignore basic security hygiene.
Chu4eeno•Jun 11, 2026
The problem is that npm, cargo, etc. set the standard in people's minds for how package managers work, when the Linux community has been working on securing the supply chain issues for decades.
Like requiring a WoT (usually with physical meetups) vetting people creating packages, FTP-masters, dedicated clean buildbots, etc. in addition to the packages themselves being signed and so on.
This is just sh, not bash, but I doubt it would be any better.
mapontosevenths•Jun 11, 2026
This is how everyone does it now. Including Anthropic.
To be fair, is that any different from naively trusting NPM? It's not like NPM is doing any vetting. They're every threat actors favorite sandbox these days.
Isn't Unlimited Context pretty difficult to promise? What exactly do they mean, could I just have two agents locked into a TTRPG back and forth forever?
psychoslave•Jun 11, 2026
Do you plan to ask them some master plan to live forever?
gosukiwi•Jun 11, 2026
Claude and Codex pricing will eventually have to come down, for most common coding tasks you don't need a super smart slow model but a smart-enough and very fast one.
Alifatisk•Jun 11, 2026
I don't think many understand that Sonnet and even Haiku can probably accomplish their task, instead of them invoking a beast like Opus to tell them about todays weather.
pmontra•Jun 11, 2026
I don't known how Codex works, but we can set environment variables and point Claude CLI to deepseek. I think that before slashing prices they will slash those environment variables. After all they are not working to give a free TUI to deepseek and possibly to other competitors. But eventually yes, prices will go down or there will be an attempt at a regulatory capture.
winstonp•Jun 11, 2026
Claude Code TUI is garbage. There's nothing worth protecting in there.
MangoCoffee•Jun 11, 2026
cheap token for the win.
Microsoft github copilot recently changed their billing. i'm on the yearly subscription. GPT-5.4 is now 6x and even previously free model like GPT-5 mini now cost .33x. its only June 11 and my usage is now at 50%.
Alifatisk•Jun 11, 2026
Xiaomi have been cooking a lot in recent times. Their model, especially the pro series, is underrated in my opinion. It haven't received the attention it deserves while it is pushing higher and higher in benchmark scores (looking at artifical analysis), and this was before Deepseek dropped V4.
Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month. And now, they are dropping a Harness for their own model? Amazing. I wish they added support for installation through Homebrew though.
On another note, this is what I would like to see more of from a company, what I do not welcome is startups making their model exclusive and hurt their customer base through sabotaging as a way to prevent eventual distillation attempts.
miroljub•Jun 11, 2026
Looks like they have very effective collaboration with DeepSeek and Kimi. Those three models have been bouncing ideas and sharing R&D innovation, which made all of them improve very fast.
Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
Alifatisk•Jun 11, 2026
> Looks like they have very effective collaboration with DeepSeek and Kimi. Those three models have been bouncing ideas and sharing R&D innovation, which made all of them improve very fast.
Very fascinating to learn this, didn't know Moonshoot (Kimi) also collaborated with others. I think I read in another post that DeepSeek and Qwen team shared the same building? So that kind of explains it.
> Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
I have to agree. I had the great opportunity to take the offer Z.ai had with their Christmas deal, their lite plan was 3 months for 7$. GLM-4.7 was already impressive enough.
When they released GLM-5-Turbo and GLM-5.1, that is when I came to the realization of how close the gap is between proprietary western models and Chinese open-weight ones (not all of them are ofc).
I could barely believe how good GLM-5.1 was, I didn't think I was using it in CC and had to check the settings again. It's astonishing how close the gap is now, and this competition benefits us very much, the pricing is so low atm, its amazing.
spelk•Jun 11, 2026
>Looks like they have very effective collaboration with DeepSeek and Kimi.
The collaboration is informal. People don’t seem to realize this, but the Chinese internet for programmers and developers today feels a lot like StackExchange in its heyday. There’s a huge emphasis on sharing knowledge, because sharing what you know builds your profile, and becoming a rockstar in a subfield is one of the only ways to get ahead.
Competition in China is ruthless. But unlike in North America, where individuals are often bound by agreement to hoard knowledge because it can give them a competitive edge, the competitive advantage in China is building face and peer recognition. And that comes from proving that you are worthy of being a "master/teacher", and that extends to the valuation of your knowledge business. For example, the third wave coffee shops in China, the master roaster is often called "master/teacher" once they win a roasting competition and start sharing new knowledge of roasting in the public sphere, and that's a title of sincere respect.
You can see parallels with those that apply to give talks at conferences and post snazzy technical presentations they give in the US, but the bar for what qualifies as new knowledge is far higher in China because there's a massive ecosystem of people rushing to outcompete what you have to offer, and once the ball gets rolling on knowledge sharing, lots of people will go off and build upon that knowledge or try to build businesses on top of that, which in turn produces more knowledge.
Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share. And I suspect in great part, the decision to release open-weights is heavily tied to that concept of building face/peer recognition = building valuations.
Alifatisk•Jun 11, 2026
Thank you for the insights, this sounds like an environment I want to partake.
> Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share.
Are you able to share those forums and other resources? I would love to read what people in these communities are sharing.
elxr•Jun 11, 2026
What forums do Chinese programmers use that you would recommend exploring around in?
Mainly want to see the people who are building stuff I've never even thought of.
miroljub•Jun 11, 2026
Hm, can I just use free tokens without using MiMo-Code?
OpenCode or pi.dev are enough. I don't like CC-style agent lock-in, regardless if it's Anthropic or Xiaomi doing it.
adi2907•Jun 11, 2026
What a transformation by Xiaomi to build almost frontier level models. Five years back, when I was in the data science team, they dint really bother about AI models and were using Baidu for NLP and vision under the hood of their APIs
ignoramous•Jun 11, 2026
Wrote this eons ago:
I fully expect Baidu and other tech giants on the Chinese shores to try and push the boundaries of technology. Silicon Valley (and the US) in general has always been the hot-bed of innovation. But with enormous increase in wealth in China (and to an extent in India), I can see these companies being more and more ambitious. Not long ago Andrew Ng of Coursera and Stanford AI Lab fame joined Baidu to further their rival to the 'Google Brain' project.
Xiaomi has long been positioning itself as a company with design chops of Apple, engineering chops of Google, and e-commerce chops of Amazon, all rolled into one-- and I can see where they are coming from. If they manage to pull it off, I guess that's when we'd start seeing the proverbial "Death of Silicon Valley" as in, it loosing its strange monopoly and strangle hold on tech world in terms of both talent and innovation.
"Death of Silicon Valley" in this case is such a funny perspective. Like, how twisted is the US's view of the market that they think "Competition? Oh no. Sound the alarms."
Yes the subscription plan costs literally the same thing as just paying API-per-token pricing.
submeta•Jun 11, 2026
Can this be used as an alternative to Claude backend? For Ralph loops? Replacing `claude -p`? Anyone can shed a light on this?
freakynit•Jun 11, 2026
As much as I absolutely love Mimo V2.5 Pro (it's a genuinely good model), I absolutely hate the way they calculate usage in their token plan.
For example: For a super small task in a small project that should not be consuming more than 500K total tokens after all tool calls included, their shown usage shot up to 152 million tokens.
But, when I scroll down on the same page, a table shows usage as 3 million tokens, out of which 2.5 million were cached.
This is such a huge conflict on the very same page. The bad thing is that the usage progress bar is shown against that 150 million token usage, not against that 3 million one.
This has been in discussions for at least past 3 months on reddit as well, and was precisely the reason I subscribed to their lowest tier, and for a single month only.
Update: their own harness, mimocode, shows total token usage as just 63.1K. We now have 3 entirely different values, differing in 3 orders of magnitude.
Update 2: So, I did the exact same task this time using DS4Pro, and total token usage was just 101K (as shown by opencode).
microbass•Jun 11, 2026
It's very confusing. They have tokens for their API and credits for their "token plan".
Fendy•Jun 11, 2026
looks great. surprised that Xiaomi has made such great advancements in AI
tdesilva•Jun 11, 2026
Good, coding harnesses should be open source and LLMs should be treated as commodities. Minimize switching costs for consumers, and let people understand how they're interacting with the context and the LLM outputs.
The industry has been moving the wrong direction with Claude Code staying closed (despite multiple times leaking the source code!) and the open source Gemini CLI being deprecated in favor of closed source Antigravity CLI.
dominotw•Jun 11, 2026
what do you mean by should?
tdesilva•Jun 11, 2026
ah nevermind it's just a fork of OpenCode
miroljub•Jun 11, 2026
As much as their propaganda wants us to believe, Anthropic is not 'the industry'.
tw04•Jun 11, 2026
Why would a company do any of these things? What is their motivation for any of it? That’s like saying cloud providers should be commodity and should open source all of their platforms and eliminate egress fees so customers can easily leave at any point in time.
That’s a charity, not a business model.
pstuart•Jun 11, 2026
Good will and trust can ultimately have monetary value, and having a funnel based on open source is a viable play if it leads to a service that is sticky.
SilverElfin•Jun 11, 2026
Open source and open weight AI is very important to protect freedom of speech. OpenAI, and ESPECIALLY Anthropic, will try to ban them through regulatory capture / safety fearmongering. We need to make sure that does not happen. It’s not society’s problem if these frontier labs have no moats.
WhereIsTheTruth•Jun 11, 2026
You know they are benchmaxxing when they end up writing their coding harness in TypeScript npm slop
Their models can't help them build it with something better?
That's the only benchmark people need, whether or not their model can raise the bar of their own product
And so far it's looking pretty sad
solenoid0937•Jun 11, 2026
This is super exciting, can't wait to try it out
nutifafa•Jun 11, 2026
mimocode gets it. This is actually, impressive! Chinese models are really up there with the rest.
36 Comments
“ MiMoCode is a terminal-native AI coding assistant. It can read and write code, run commands, manage Git, and use a persistent memory system to keep a deep understanding of your project across sessions while continuously improving itself.”
GitHub link (English): https://github.com/XiaomiMiMo/MiMo-Code
@dang might be better to link to the GitHub, and not for language reasons.
(Edit: for posterity, original URL as submitted was [0]).
[0]: https://mimo.xiaomi.com/mimocode
It's a client-side change and doesn't impact the URL so users must manually change it each time they visit the site though
How many sites do this but you don't notice because they default to English?
However, the right solution is still to use the language header. I send that to them, they should use it to give me the right one by default.
One of the funny things is that this whole site is in an iframe; which breaks both Google Translate, and the Firefox translate feature. If you check, the outer iframe seems to indicate `lang="en'` and loads the iframe with `src="/coder/index.html?lang=en"`, but the inner iframe still gets a `lang="zh-CN'` by default until you use the toggle.
If you go to the eventual redirect source of the page with `lang=en` parameter, you get a `lang="en"` attribute, but it's still in Chinese until you toggle it with the menu: https://mimo.xiaomi.com/coder?lang=en
Anyhow, yeah, lots of pages are probably broken this way but we don't notice. But still, it has that info from your request, it should use it.
When you look at the source, you can set it to English from the params https://mimo.xiaomi.com/coder?lang=en but there's a small bug, the hero subtitle isn't translated but everything else is.
Wondering why this is in an iframe.... so strange
Not sure why my iPhone shows an option to translate website but all the destination languages to pick from (I have multiple languages installed), including English, are greyed out. iPhone does support translating from Chinese (Simplified or Traditional), and the button to translate website isn’t greyed out like it is for unsupported/unrecognized languages. Might be an iOS 27 bug, because it is working on other websites?
Why not just contribute to OpenCode instead of creating a clone :/
There's plenty of reasons to start your own fork that you have full agency of, as long as the OSS License is maintained anyone will be able to benefit from any new features they want to make use of.
Today, legacy KHTML maintainers are boxed-out of upstream decisions that might prevent Manifest v2 from swirling down the drain. I'd argue the story isn't very beautiful anymore.
It's controlled by a different organization; in particular a startup in a "competing" space.
I think there's simply too much changed.
There are over 500 pages of open issues, up from 78 less than a month ago. They are doing nothing to halt the garbage/duplicates that pop up, and not even addressing legitimate PRs/reports.
Kinda RF-nerd clickbait... :)
That’s why
But it seems trivially easy to run it against local models. Their onboarding guide offers that option, though I have no idea if it changes any functionality.
>Knowledge accumulates automatically with lossless compression, preserving every critical detail even across million-line projects.
From github
Not sure which "free" service you're referring to, but MiMo v2.5 Pro is plenty capable & (after its recent 70%+ price drop) one of the most affordable options in its class (DeepSeek v4 Pro, MiniMax M3, & Qwen 3.7 Plus). I read somewhere that Labs are incentivized to implement custom harnesses because each model has its strengths, quirks, & blindspots (like Qwen forking Gemini CLI)?
That was a good call. Gemini CLI is dead.
For example, I can send screenshots of what I'm developing and it understands.
MiMo code (via my z.ai coding plan) is very pleasant so far, nice UI and seems to respond faster than Claude Code. It might be injecting much less cruft into the conversation.
I also got access to the mimo-2.5-pro ultraspeed model yesterday, which is really quite snappy. It does cost more than DeepSeek, though, so I'm not sure whether it's worth it yet. Definitely fast though.
telemetry enabled by default and named "analysis" is not great.
This is what Claude Code is to Claude
You might mean the MiMo-V2.5-Pro model?
Typically, Chinese websites are a big pain to log in or sign up because they require a +86 phone number due to legal reasons. Being able to use it without having to make an account is amazing for friction reduction. I could probably even just install it onto new machines to help with set up.
I wonder how they are gonna detect and block abuse though?
MiMo v2.5.0-Pro is honestly the first Chinese model that I've tried where I really though why should I use Claude Sonnet when I can get the same results for a fraction of the cost. There was always something off about Chinese models that made it apparent that it couldn't fully compete with GPT, Claude, Gemini, etc. but this was the first model where I was like, this feels like Sonnet.
I can't prove it, but I think they trained heavily on Claude output. From my perspective I don't care since Anthropic trained on my data.
Using them also works well for North Americans as our peak hours is not theirs.
If I had one complaint, the v2.5.0-Pro model thinks too much.
Terminal > sudo xattr -rd com.apple.quarantine > Drag and drop the app into terminal > enter and enter your password
A bit crappy on Apple's side.
Thank you.
This is usually a PoC (Proof of concept) way to install something on a temporary container or temporary VM, but not for production use during daily desktop operation.
I was hoping their documentation would provide better installation instructions. But strangely, only for Windows do they recommend "npm install -g @mimo-ai/cli," which is a much better approach to managing installed packages.
For Mac/Linux, they have the strange recommendation to use the dangerous "curl <some_url> | bash." Quote:
> (for the best experience, Mac users are strongly encouraged to use iTerm or the VSCode Terminal) > curl -fsSL https://mimo.xiaomi.com/install | bash
:(
> npm install ... is a much better approach to managing installed packages.
No. Until the upcoming version of npm is out, npm will also run arbitrary code. Almost all common installation tools run arbitrary code. Not doing that is sadly the exception for now.
No. npm is a package manager. As mentioned in the comment you're replying to, almost all package managers execute arbitrary code. Eg:
- pip
- Cargo
- apt/dpkg
- dnf/yum
- Homebrew
- RubyGems
- Composer (limited)
- Maven
> Any chance you have a link to something that describes their plans?
https://github.blog/changelog/2026-06-09-upcoming-breaking-c...
And at the end of the day, no matter the installation method (even just unpacking a tarball and executing the program directly from that directory), you're going to run their program on your computer, and then the program can do whatever it wants. Maybe you don't run it with sudo, but https://xkcd.com/1200/ seems relevant.
Yes, running third-party code is always a leap of faith, but why choose a delivery method that removes the possibility of verification and opens the door to targeted injections? Convenience shouldn't be an excuse to ignore basic security hygiene.
Like requiring a WoT (usually with physical meetups) vetting people creating packages, FTP-masters, dedicated clean buildbots, etc. in addition to the packages themselves being signed and so on.
> sh -c 'curl -fsSL https://chatgpt.com/codex/install.sh | CODEX_NON_INTERACTIVE=1 sh'
This is just sh, not bash, but I doubt it would be any better.
To be fair, is that any different from naively trusting NPM? It's not like NPM is doing any vetting. They're every threat actors favorite sandbox these days.
https://code.claude.com/docs/en/quickstart
Microsoft github copilot recently changed their billing. i'm on the yearly subscription. GPT-5.4 is now 6x and even previously free model like GPT-5 mini now cost .33x. its only June 11 and my usage is now at 50%.
Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month. And now, they are dropping a Harness for their own model? Amazing. I wish they added support for installation through Homebrew though.
On another note, this is what I would like to see more of from a company, what I do not welcome is startups making their model exclusive and hurt their customer base through sabotaging as a way to prevent eventual distillation attempts.
Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
Very fascinating to learn this, didn't know Moonshoot (Kimi) also collaborated with others. I think I read in another post that DeepSeek and Qwen team shared the same building? So that kind of explains it.
> Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
I have to agree. I had the great opportunity to take the offer Z.ai had with their Christmas deal, their lite plan was 3 months for 7$. GLM-4.7 was already impressive enough.
When they released GLM-5-Turbo and GLM-5.1, that is when I came to the realization of how close the gap is between proprietary western models and Chinese open-weight ones (not all of them are ofc).
I could barely believe how good GLM-5.1 was, I didn't think I was using it in CC and had to check the settings again. It's astonishing how close the gap is now, and this competition benefits us very much, the pricing is so low atm, its amazing.
The collaboration is informal. People don’t seem to realize this, but the Chinese internet for programmers and developers today feels a lot like StackExchange in its heyday. There’s a huge emphasis on sharing knowledge, because sharing what you know builds your profile, and becoming a rockstar in a subfield is one of the only ways to get ahead.
Competition in China is ruthless. But unlike in North America, where individuals are often bound by agreement to hoard knowledge because it can give them a competitive edge, the competitive advantage in China is building face and peer recognition. And that comes from proving that you are worthy of being a "master/teacher", and that extends to the valuation of your knowledge business. For example, the third wave coffee shops in China, the master roaster is often called "master/teacher" once they win a roasting competition and start sharing new knowledge of roasting in the public sphere, and that's a title of sincere respect.
You can see parallels with those that apply to give talks at conferences and post snazzy technical presentations they give in the US, but the bar for what qualifies as new knowledge is far higher in China because there's a massive ecosystem of people rushing to outcompete what you have to offer, and once the ball gets rolling on knowledge sharing, lots of people will go off and build upon that knowledge or try to build businesses on top of that, which in turn produces more knowledge.
Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share. And I suspect in great part, the decision to release open-weights is heavily tied to that concept of building face/peer recognition = building valuations.
> Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share.
Are you able to share those forums and other resources? I would love to read what people in these communities are sharing.
Mainly want to see the people who are building stuff I've never even thought of.
OpenCode or pi.dev are enough. I don't like CC-style agent lock-in, regardless if it's Anthropic or Xiaomi doing it.
https://en.wikipedia.org/wiki/Xiaomi_Mi_1
And now they make one of the fastest cars ever created and frontier-level AI. In just over a decade. 你好!
For example: For a super small task in a small project that should not be consuming more than 500K total tokens after all tool calls included, their shown usage shot up to 152 million tokens.
But, when I scroll down on the same page, a table shows usage as 3 million tokens, out of which 2.5 million were cached.
This is such a huge conflict on the very same page. The bad thing is that the usage progress bar is shown against that 150 million token usage, not against that 3 million one.
This has been in discussions for at least past 3 months on reddit as well, and was precisely the reason I subscribed to their lowest tier, and for a single month only.
Update: their own harness, mimocode, shows total token usage as just 63.1K. We now have 3 entirely different values, differing in 3 orders of magnitude.
Update 2: So, I did the exact same task this time using DS4Pro, and total token usage was just 101K (as shown by opencode).
The industry has been moving the wrong direction with Claude Code staying closed (despite multiple times leaking the source code!) and the open source Gemini CLI being deprecated in favor of closed source Antigravity CLI.
That’s a charity, not a business model.
Their models can't help them build it with something better?
That's the only benchmark people need, whether or not their model can raise the bar of their own product
And so far it's looking pretty sad