A new visual language for content creation

Comments (4)

qantrell · 4h ago

Our new architecture analyzes and decomposes images into a code-like intermediate representation called layouts — an internal visual language that captures the composition and structure of any image.

Our intermediate image representation is designed for transparency and control. Rather than hiding the model's understanding behind a black box, we expose its internal representation, or code, to enable direct manipulation of visual elements. Users can now move, resize, add, remove or replace objects with granular control.

moorjani · 3h ago

very cool, surely it's better than nano banana

mgh83 · 3h ago

exciting to see new tools that don't just give you a one-off random answer but let you mess up with the details!

hunterloftis · 3h ago

Hey, I worked on this & totally agree - I want my tools to be editors, not slot machines.

My focus has been on the beta "Edit" feature, tucked away into the top-right when you're looking at a single image. It lets you directly manipulate the image as both a spatial canvas and a semantic structure.

We should have the ability to run any code we want on hardware we own (hugotunius.se)

Cognitive load is what matters (github.com)

NPM debug and chalk packages compromised (aikido.dev)

I didn't bring my son to a museum to look at screens (sethpurcell.com)

Show HN: A store that generates products from anything you type in search (anycrap.shop)

I ditched Docker for Podman (codesmash.dev)

Germany is not supporting ChatControl – blocking minority secured (digitalcourage.social)

30 minutes with a stranger (pudding.cool)

Charlie Kirk killed at event in Utah (nbcnews.com)

Show HN: Term.everything – Run any GUI app in the terminal (github.com)

996 (lucumr.pocoo.org)

Next.js is infuriating (blog.meca.sh)

Show HN: I recreated Windows XP as my portfolio (mitchivin.com)

EU court rules nuclear energy is clean energy (weplanet.org)

The MacBook has a sensor that knows the exact angle of the screen hinge (twitter.com)

Anthropic agrees to pay $1.5B to settle lawsuit with book authors (nytimes.com)

Signal Secure Backups (signal.org)

Using Claude Code to modernize a 25-year-old kernel driver (dmitrybrant.com)

iPhone Air (apple.com)

Pontevedra, Spain declares its entire urban area a "reduced traffic zone" (greeneuropeanjournal.eu)

I replaced Animal Crossing's dialogue with a live LLM by hacking GameCube memory (joshfonseca.com)

Google can keep its Chrome browser but will be barred from exclusive contracts (cnbc.com)

UTF-8 is a brilliant design (iamvishnu.com)

We all dodged a bullet (xeiaso.net)

Stripe Launches L1 Blockchain: Tempo (tempo.xyz)

Mistral raises 1.7B€, partners with ASML (mistral.ai)

New Mexico is first state in US to offer universal child care (governor.state.nm.us)

Chat Control Must Be Stopped (privacyguides.org)

The treasury is expanding the Patriot Act to attack Bitcoin self custody (tftc.io)

“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

Almost anything you give sustained attention to will begin to loop on itself (henrikkarlsson.xyz)

Where's the shovelware? Why AI coding claims don't add up (mikelovesrobots.substack.com)

Models of European metro stations (stations.albertguillaumes.cat)

Google AI Overview made up an elaborate story about me (bsky.app)

iPhone dumbphone (stopa.io)

Claude Code: Now in Beta in Zed (zed.dev)

Why our website looks like an operating system (posthog.com)

Eternal Struggle (yoavg.github.io)

Corporations are trying to hide job openings from US citizens (thehill.com)

KDE launches its own distribution (lwn.net)

Many hard LeetCode problems are easy constraint problems (buttondown.com)

ICE is using fake cell towers to spy on people's phones (forbes.com)

Court rejects Verizon claim that selling location data without consent is legal (arstechnica.com)

Claude now has access to a server-side container environment (anthropic.com)

I'm absolutely right (absolutelyright.lol)

Hosting a website on a disposable vape (bogdanthegeek.github.io)

LLM Visualization (bbycroft.net)

Notes on Managing ADHD (borretti.me)

Serverless Horrors (serverlesshorrors.com)

MIT Study Finds AI Use Reprograms the Brain, Leading to Cognitive Decline (publichealthpolicyjournal.com)

A new visual language for content creation

Comments (4)