Can LLMs recognise ASCII images?

Comments (1)

hyp0thetical · 9h ago

Our tests with accelerated Elastic Models show they factorize tasks by identifying features line-by-line, then combine them using statistical patterns common in ASCII art.

If we think about it, attention scores will look like a heatmap of the original image. So, transformers have an internal representation of the image inside, and if they can recognize images by default, it's some sort of image classifier which provides token ID as an image class. The tests are, to be honest, trivial, but anyway funny :)

Try Deepseek-Qwen-14B in our tutorial - running at 120 tok/s on H100 and 40 tok/s on L40s, up to 3x faster than the original implementation! Fully free, get your API token and start!

Single Sign on for Furries (cendyne.dev)

Commodore 64 Ultimate is company's first hardware release in 30 years (tomshardware.com)

Notes on some basics of LLM usage (sumant.bearblog.dev)

Design History in India

America's Stock-Market Dominance Is an Emergency for Europe (wsj.com)

German court throws book at ad blockers (theregister.com)

Show HN: Agentic Sync – AI-Native Task Management Platform (github.com)

Book Review: 'ADHD Does Not Exist' by Richard Saul – 'The ADHD Explosion' (wsj.com)

I built JAIPilot on IntelliJ – a plugin that auto-generates JUnit tests (jaipilot.vercel.app)

The Perfect Battery Material Is Dangerous [video] (youtube.com)

Why security experts recommend standalone password managers over browser-based (bitwarden.com)

Show HN: Free API to block temporary emails (isfakemail.com)

Anyone else experienced fraudulent billing practices on Alibaba Cloud's AI API?

PuTTY has a new website (putty.software)

Scicloj – Helping Clojure Grow in New Fields, Like Data-Science (scicloj.github.io)

Prompt_this, an AI Short Story (docs.google.com)

A Visual Exploration of Gaussian Processes (2019) (distill.pub)

Woz: 'I Am the Happiest Person Ever' (daringfireball.net)

Exploring AI character generation with consistent styles and easy management (aicharactergenerator.app)

Ancient DNA Reveals the Real Origin of the Black Death [video] (youtube.com)

The 10 Percent Is in a Fit of Rage over Airport Lounges (newrepublic.com)

Iceland offering £1 reward scheme for customers who report a shoplifter (theguardian.com)

Claude Code-This guy consumed $50k in 30 days on a $200 subscription (old.reddit.com)

ORMs Are Good (userjot.com)

Maharajah and the Sepoys (en.wikipedia.org)

Why has Linux stopped innovating? (cocz.net)

8x19 Text Mode Font Origins (os2museum.com)

Best Practices for Building Agentic AI Systems (userjot.com)

Document.write (vladimirslepnev.me)

IFairy: The First 2-bit Complex LLM with All Parameters in \{\pm1, \pm i\} (arxiv.org)

'Luxury' Originally Meant 'Lust' (merriam-webster.com)

New Website "Is It FOSS?" Tracks Transparency in Open Source Distribution (socket.dev)

Int. Association for the Preservation of Spiritualist and Occult Periodicals (iapsop.com)

"I Have a Theory Too": The Challenge and Opportunity of Avocational Science (writings.stephenwolfram.com)

D&D Bio: Walbrek the Fourth (github.com)

A parent's job is never to make their kid happy'–what to do instead (cnbc.com)

Mangoes and diabetes: Indian trials debunk sweet myths (bbc.com)

Whistleblower alleges misconduct by United Nations in Gaza (foxnews.com)

Nobody Wants to Be You: Ghislaine Maxwell and the Facilitator's Impossible Game (shalashashka.substack.com)

Radar Can Be Used to Eavesdrop on Smartphone Conversations (studyfinds.org)

Interpretability: Understanding how AI models think [video] (youtube.com)

EV automakers see billions in revenue disappear as US ends emission credits (electrek.co)

Using AI to Revolutionize Art and Patrimonial Asset Storage Brokerage (stockage-courtage.fr)

Gemma 3 270M – Google's NEW AI – How to Fin-tune Gemma3 [video] (youtube.com)

Why Advertising Falls Flat in Individuals with Autism (psychologytoday.com)

Vibe Physics – Angela Collier (gist.github.com)

OpenAI in talks to sell around $6B in stock at roughly $500B valuation (cnbc.com)

What I learned from making 200 different LLMs flip coins (christopherkrapu.com)

Microsoft is getting ready to return to the office (theverge.com)

Can LLMs recognise ASCII images?

Comments (1)