Launch HN: Morph (YC S23) – Apply AI code edits at 4,500 tokens/sec

1 bhaktatejas922 0 7/7/2025, 2:40:45 PM

Hey HN, I'm Tejas at Morph. We've built a crazy-fast Apply model that applies AI-generated code edits into your files. It clocks in at over 4,500 tokens per second, turning sometimes lazy/imperfect AI-generated patches into fast, reliable edits.

Why? AI spits out code that can’t reliably be inserted into existing code. Full file rewrites, brittle search-and-replace hacks are too slow, expensive, or error-prone.

Morph takes a different approach:

- Your agent outputs edits “lazily”, referencing unmodified lines in the existing file (ex: // ...existing code...)

- Morph instantly applies these edits to a file using our Fast Apply model + our inference engine. We do this with a slight variation on "speculative edits," using speculative decoding and the original code as a reference for blazing-fast generation

This approach was pioneered by Cursor last year, but the models that set Cursor apart—like their Fast Apply model—aren’t available as an API. We built Morph so developers can build a similar experience into their own coding agents.

Try it (no payment or sign up): https://morphllm.com/dashboard

Docs: https://docs.morphllm.com/quickstart

We have 2 Fast Apply models: morph-v3-fast - 4500+ tok/sec, and morph-v3-large - 2500+ tok/sec. These models power Fast Apply at create.xyz, databutton, continue.dev, and more

We also have more cooking: - Inline Edit Model (Cmd-K): Extremely fast inline edits - keep dev flow state; and Morph Tab API: Our Next Edit Prediction model guesses your next code edit + action with sub-500ms latency. It's currently in private beta, but you can request early access here: https://morphllm.com/tab

Our hot takes:

(1) Raw inference speed is very important for practical coding assistants. We've found boosting inference speed dramatically improves dev experience compared to 0.2% accuracy gains. Curious if HN agrees or disagrees.

(2) Frontier model full-file rewrites are legacy—incremental speculative edits are the future. Many popular tools still have frontier models rewrite whole files or use udiff, but we've seen huge wins in speed, reliability, user retention/conversion, and cost by ditching this approach entirely. As frontier models move upmarket, they’ll leave behind tasks like these - narrow + 99% accurate that can be extremely inference optimized

(3) We will see all complexity move into models (plural) - not model (singular). As benchmarks on narrow tasks saturate to 99%+ and frontier models move upmarket, tasks will move to inference optimized models. Frontier model tokens will be used to do tasks only frontier models can do We’d love to hear your ideas and experiences with coding agents! https://youtu.be/LdT8epGHJPk – Tejas & the Morph team

Mercury: Ultra-Fast Language Models Based on Diffusion (arxiv.org)

François Chollet: The Arc Prize and How We Get to AGI [video] (youtube.com)

When Figma starts designing us (designsystems.international)

I used o3 to profile myself from my saved Pocket links (noperator.dev)

Bitchat – A decentralized messaging app that works over Bluetooth mesh networks (github.com)

Cpparinfer: A C++23 implementation of the parinfer algorithm (gitlab.com)

Hymn to Babylon, missing for a millennium, has been discovered (phys.org)

Solving Wordle with uv's dependency resolver (mildbyte.xyz)

Show HN: NYC Subway Simulator and Route Designer (buildmytransit.nyc)

Neanderthals operated prehistoric “fat factory” on German lakeshore (archaeologymag.com)

A non-anthropomorphized view of LLMs (addxorrol.blogspot.com)

Showh HN: Microjax – JAX in two classes and six functions (github.com)

Show HN: Piano Trainer – Learn piano scales, chords and more using MIDI (github.com)

Show HN: I wrote a "web OS" based on the Apple Lisa's UI, with 1-bit graphics (alpha.lisagui.com)

Intel's Lion Cove P-Core and Gaming Workloads (chipsandcheese.com)

Why English doesn't use accents (deadlanguagesociety.com)

Building the Rust Compiler with GCC (fractalfir.github.io)

Lightfastness Testing of Colored Pencils (sarahrenaeclark.com)

The Cat's Meat Man: Feeding Felines in Victorian London (publicdomainreview.org)

Anthropic cut up millions of used books, and downloaded 7M pirated ones – judge (businessinsider.com)

LLMs should not replace therapists (arxiv.org)

The messy reality of SIMD (vector) functions (johnnysswlab.com)

What every programmer should know about how CPUs work [video] (youtube.com)

Thesis: Interesting work is less amenable to the use of AI (remark.ing)

Async Queue – One of my favorite programming interview questions (davidgomes.com)

High Performance Image Sensor Processing Using FPGAs [pdf] (oda.uni-obuda.hu)

Opencode: AI coding agent, built for the terminal (github.com)

Uncommon Uses of Python in Commonly Used Libraries (2022) (eugeneyan.com)

The first time I was almost fired from Apple (engineersneedart.com)

Get the location of the ISS using DNS (shkspr.mobi)

I extracted the safety filters from Apple Intelligence models (github.com)

Backlog.md – Markdown‑native Task Manager and Kanban visualizer for any Git repo (github.com)

Swedish Campground (2004) (folklore.org)

Functions Are Vectors (2023) (thenumb.at)

Portability of Tar Features (mgorny.pl)

Show HN: Modernized file manager and program manager from Windows 3.x (github.com)

The era of full stack chip designers (chipinsights.substack.com)

Jane Street barred from Indian markets as regulator freezes $566M (cnbc.com)

Show HN: A Language Server Implementation for SystemD Unit Files (github.com)

New quantum paradox clarifies where our views of reality go wrong (2018) (quantamagazine.org)

Lessons from creating my first text adventure (entropicthoughts.com)

Metriport (YC S22) is hiring engineers to improve healthcare data exchange (ycombinator.com)

Ask HN: Any resources for finding non-smart appliances?

Claude Code Pro Limit? Hack It While You Sleep (github.com)

Deno 2.4 (deno.com)

I don't think AGI is right around the corner (dwarkesh.com)

Crypto 101 – Introductory course on cryptography (2017) (crypto101.io)

The War on the Walkman (newsletter.pessimistsarchive.org)

Nobody has a personality anymore: we are products with labels (freyaindia.co.uk)

Corrected UTF-8 (2022) (owlfolio.org)

Launch HN: Morph (YC S23) – Apply AI code edits at 4,500 tokens/sec

Comments (0)