Starlink service set to launch in Korea this year following law revision (koreajoongangdaily.joins.com)

I wonder if they will use this model for their AI assistant on their Xiaomi 15 series phones. They most likely will. I'm not really sure what to expect from it.

keepamovin · 2h ago

I think it's funny that everything from Xiaomi is "Mi" because for me, the "mi", is "rice". Hahaha - so like all their stuff is "Rice-" this or that. Hahaha

fredwu · 32m ago

Not sure why it would be "funny" as this is literally why they named the company Xiaomi.

Source (Chinese): https://finance.sina.cn/tech/2020-11-26/detail-iiznctke33979...

keepamovin · 8m ago

[delayed]

amazingamazing · 21m ago

just as funny as an Apple, for sure.

keepamovin · 2m ago

[delayed]

cruzcampo · 1h ago

Does Xiaomi literally mean Little Rice? That's what my very limited mandarin would suggest

keepamovin · 1h ago

That is what my literally also rather limited Chinese would suggest. haha

But with many single characters in Chinese, a Chinese person will tell you, if you ask for what a single character means, something like, "Well it's not so easy to pin down the meaning of that one. Sometimes we use it like this, and sometimes like that."

Sure, some characters have an easy meaning (for me, I think the rice in Mi is one of them!) but there's plenty where you cannot get a Chinese person to easily tell you what a single character means. I guess it's a little like, but not the same as, asking an English person to tell you, what any given "morpheme" (word part, like fac-) means. Hahaha. Not a perfect analogy tho! :)

Here's this list of morphemes I found just now thinking about this: https://www.fldoe.org/core/fileparse.php/16294/urlt/morpheme...

Seems incomplete list when you consider etymology of English words are often composed of parts from ages past! :)

kzz102 · 50m ago

Xiaomi can also mean millet. I think it's a reference to this Mao quote: https://en.wikipedia.org/wiki/Millet_plus_rifles?wprov=sfla1

keepamovin · 38m ago

Wow, that's interesting. I guess that's like a US company being called "MRE". We would view that like a veteran's owned and operated company. Interesting.

And all the products would be "MRE-Phone", "MRE-Pod", hehehe :)

rs186 · 1h ago

https://en.wikipedia.org/wiki/Foxtail_millet

os2warpman · 1h ago

小米

little rice

Yes.

But it's more complicated than that.

iszomer · 1h ago

Yes.

jedisct1 · 13m ago

GGUF version (for LM Studio, Ollama, etc): https://huggingface.co/jedisct1/MiMo-7B-RL-GGUF

mobilio · 2h ago

Waiting for GGUF or MLX models.

Probably within few hours will be released.

Havoc · 2h ago

FYI making a gguf yourself isn't hard and doesn't even need a GPU.

But yeah waiting is the easier option

mobilio · 1h ago

I know - but i'm on holiday break with Chromebook.

ukuina · 1h ago

Now there's a challenge!

jedisct1 · 13m ago

https://huggingface.co/jedisct1/MiMo-7B-RL-GGUF

userbinator · 1h ago

...and searching for things related to multiple antennae just got harder.

They could've called it Xiaomimo.

arghwhat · 1h ago

multiple-input, multiple-output was horribly generic to begin with. Terms like multipath propagation and spatial multiplexing will do just fine.

CodeCompost · 1h ago

Open Source or Open Weights?

ilrwbwrkhv · 7m ago

And this point everybody will open source their models or weights. The only one which will not is open AI.

NitpickLawyer · 41m ago

MIT - so open source

Davidzheng · 21m ago

Weights

w4yai · 2h ago

Anyone tried it ?

Alifatisk · 2h ago

No, where can I try it? I saw a huggingface link but I wonder if they host it themselves somewhere to like how Alibaba does with Qwen chat.

yorwba · 2h ago

There is a HuggingFace space (probably not official) at: https://huggingface.co/spaces/orangewong/xiaomi-mimo-7b-rl You might have to wait a minute to get a response. Also, the space doesn't seem to have turn-taking implemented, so after giving the Assistant's response, it kept on generating the Human's next message and so on and so forth.

ramesh31 · 2h ago

These benchmark numbers cannot be real for a 7b model

strangescript · 1h ago

The smaller models have been creeping upward. They don't make headlines because they aren't leapfrogging the mainline models from the big companies, but they are all very capable.

I loaded up a random 12B model on ollama the other day and couldn't believe how good it competent it seemed and how fast it was given the machine I was on. A year or so ago, that would have not been the case.

apples_oranges · 1h ago

exactly, it seems to validate my assumption from some time ago, that we will mostly use local models for everyday tasks.

pzo · 1h ago

yeah especially that this simplifies e.g. doing mobile app for 3rd party developers - not extra cost, no need to setup proxy server, monitoring usage to detect abuse, don't need to make complicated subscription plan per usage.

We just need Google or Apple to provide their own equivalent of both: Ollama and OpenRouter so user either use inference for free with local models or BringYourOwnKey and pay themself for tokens/electricity bill. We then just charge smaller fee for renting or buying our cars.

jillesvangurp · 1h ago

Including figuring out which more expensive models to use when needed instead of doing that by default. Early LLMs were not great at reasoning and not great at using tools. And also not great at reproducing knowledge. Small models are too small to reliably reproduce knowledge but when trained properly they are decent enough for simple reasoning tasks. Like deciding whether to use a smarter/slower/more expensive model.

wg0 · 1h ago

But who will keep them updated and what incentive they would have? That's I can't imagine. Bit vague.

cruzcampo · 1h ago

Who keeps open source projects maintained and what incentive do they have?

jsheard · 1h ago

Most open source projects don't need the kinds of resources that ML development does. Access to huge GPU clusters is the obvious one, but it's easy to forget that the big players are also using huge amounts of soulcrushing human labor for data acquisition, cleaning, labeling and fine tuning, and begrudgingly paying for data they can't scrape. People coding in their free time won't get very far without that supporting infrastructure.

I think ML is more akin to open source hardware, in the sense that even when there are people with the relevent skills willing to donate their time for free, the cost of actually realizing their ideas is still so high that it's rarely feasible to keep up with commercial projects.

cruzcampo · 1h ago

That's a fair point. I think GPU clusters are the big one, the rest sounds like a good fit for volunteer work.

simiones · 34m ago

For the bigger open source projects, companies who use that code for making money. Such as Microsoft and Google and IBM (and many others) supporting Linux because they use it extensively. The same answer may end up applying to these models though - if they really become something that gets integrated into products and internal workflows, there will be a market for companies to collaborate on maintaining a good implementation rather than competing needlessly.

nickip · 1h ago

What model? I have been using api's mostly since ollama was too slow for me.

patates · 26m ago

I really like Gemma 3. Some quantized version of the 27B will be good enough for a lot of things. You can also take some abliterated version[0] with zero (like zero zero) guardrails and make it write you a very interesting crime story without having to deal with the infamous "sorry but I'm a friendly and safe model and cannot do that and also think about the children" response.

[0]: https://huggingface.co/mlabonne/gemma-3-12b-it-abliterated

estsauver · 1h ago

Qwen3 and some of the smaller gemma's are pretty good and fast. I have a gist with my benchmark #'s here on my m4 pro max (with a whole ton of ram, but most small models will fit on a well spec'ed dev mac.)

https://gist.github.com/estsauver/a70c929398479f3166f3d69bce...

justlikereddit · 1h ago

Last time I did that I was also impressed, for a start.

Problem was that of a top ten book recommendations only the first 3 existed and the rest was a casually blended hallucination delivered in perfect English without skipping a beat.

"You like magic? Try reading the Harlew Porthouse series by JRR Marrow, following the orphan magicians adventures in Hogwesteros"

And the further towards the context limit it goes the deeper this descent into creative derivative madness it goes.

It's entertaining but limited in usefulness.

omnimus · 1h ago

LLMs are not search engines…

Philpax · 8m ago

An interesting development to look forward to will be hooking them up to search engines. The proprietary models already do this, and the open equivalents are not far behind; the recent Qwen models are not as great at knowledge, but are some of the best at agentic functionality. Exciting times ahead!

mirekrusin · 28m ago

Exactly, I think all those base models should be weeded out from this nonsense, kardashian-like labyrinths of knowledge complexities that just makes them dumber by taking space and compute time. If you can google out some nonsense news, it should stay there in search engines for retrieval. Models should be good at using search tools, not at trying to replicate their results. They should start from logic, math, programming, physics and so on, similar to how education system is suppose to equip you with. IMHO small models can give this speed advantage (faster to experiment ie. with parallel diverging results, ability to munch through more data etc). Stripped to this bare minimum they can likely be much smaller with impressive results, tunable, allow for huge context etc.

bearjaws · 38m ago

My guess is that it is over fitted to the tests.

mirekrusin · 56m ago

Today's best models will be worse models for the rest of your life.

GaggiX · 2h ago

https://qwenlm.github.io/blog/qwen3/

Go look at the benchmark numbers of qwen3-4B if you think these are unrealistic.

andrepd · 1h ago

Every LLM is basically being trained on benchmarks so "benchmark" as applied to LLMs is a pretty meaningless term.

Show HN: VideoDB – 80 % fewer hallucinations on NFL game analysis (docs.videodb.io)

Diverting the Flood of History: Ada Palmer's "Inventing the Renaissance" (chireviewofbooks.com)

Synchronizing a post across 2 sites, including Meta Box metadata (gatographql.com)

Haut.ai introduces skincare recommendation system (beautynewsdaily.com)

Photography from the Vietnam War Changed America (nytimes.com)

Tcl Release Calendar (core.tcl-lang.org)

Offshore wind: Promises of new jobs (rte.ie)

Firefox Git Migration, the unofficial guide (glandium.org)

Solar panels between the tracks on an active railway line: a dreadful idea (jonworth.eu)

Show HN: Fermi Chain. A Wordle-style game for order-of-magnitude thinking (fermichain.com)

The Signal Chat Leak and the NSA (schneier.com)

Starlink service set to launch in Korea this year following law revision (koreajoongangdaily.joins.com)

Show HN: Malai – Share your dev server (and more) over P2P (malai.sh)

U.S. Economy Shrank in First Quarter (nytimes.com)

Direct chat with LLM from address bar in Firefox (robertdruska.com)

Eight Charts That Sum Up Trump's First 100 Days (nytimes.com)

Evaluating AI's Impact on Haskell Open Source Development (well-typed.com)

The Study of Man: Adjusting Men to Machines (commentary.org)

So you want to price your AI features (elenaverna.com)

Netmd-JS, a library to interact with MiniDisc (github.com)

Belgium wants to protect teenagers against TikTok (belganewsagency.eu)

ByteDance Proposes Faster Linux Inter-Process Communication (phoronix.com)

Why do all graphic designers use Macs? (creativebloq.com)

Uncle Bob is against SQL in programing languages (twitter.com)

A2a for Java (github.com)

Debezium to olake.io – PhysicsWallah switch for CDC

Mellum Goes Open Source (blog.jetbrains.com)

Show HN: Open-source sound effects and react library to spice up your website (reactsounds.com)

Using Vortex to accelerate Apache Iceberg queries up to 4x (spiraldb.com)

Raycast for iOS (raycast.com)

Let Me Grok for You: Accelerating Grokking via Embedding Transfer (arxiv.org)

U.S. Economy Contracts at 0.3% Rate in First Quarter (wsj.com)

Big Table of Big Tech (Alternatives) (comparisontabl.es)

An Inside Look at the Subway's Archaic Signal System (nytimes.com)

Cast AI Closes a $108M Series C Round (cast.ai)

Fourier Caterpillar (reubenmargolin.com)

OCaml's Wings for Machine Learning (github.com)

How Rolling Planning Changes the Strategy Game (mcchrystalgroup.com)

Humans Will Be DJs, Not Track Producers, in the Age of AI (singulatron.com)

Ask HN: Are there AI crawlers that crawl on the last day of the month?

Tidewave: Beyond Code Intelligence (twitter.com)

Ask HN: DAO startup with shares based on commits

No-as-a-Service (NAAS) is a simple API that returns a random rejection reason (naas.isalman.dev)

Dragonfly, a Pu-fueled drone heading to Titan, gets key NASA approval (ans.org)

The global hiring boom is here – and it's solving the talent crisis (hrdive.com)

Antithesis for Founders (antithesis.com)

How many dams India needs to deprive Pakistan of Indus waters (indiatoday.in)

We Tried to Warn You (unsafescience.substack.com)

PicoCalc Lisp Machine (ulisp.com)

I want to help clear EU skies from US clouds

Xiaomi MiMo Reasoning Model

Comments (50)