The Original Permissionless JPEG: From malware payload to Bitcoin blockchain (theoriginalpermissionlessjpeg.com)

I want to believe that however obsolete these old algorithms are today, at least some aspects of the underlying code and/or logic should prove useful to LLMs as they try to generate modern code.

monkeyelite · 16h ago

The idea that ML is the only way to do computer vision is a myth.

Yes, it may not make sense to use classical algorithms to try to recognize a cat in a photo.

But there are often virtual or synthetic images which are produced by other means or sensors for which classical algorithms are applicable and efficient.

sokoloff · 13h ago

I worked (as an intern) on autonomous vehicles at Daimler in 1991. My main project was the vision system, running on a network of transputer nodes programmed in Occam.

The core of the approach was “find prominent horizontal lines, which exhibit symmetry about a vertical axis, and frame-to-frame consistency”.

Finding horizontal lines was done by computing variances in value. Finding symmetry about a vertical axis was relatively easy. Ultimately, a Kalman filter worked best for frame-to-frame tracking. (We processed video in around 120x90 output from variance algorithm, which ran on a PAL video stream.)

There’s probably more computing power on a $10 ESP32 now, but I really enjoyed the experience and challenge.

This was our vehicle: https://mercedes-benz-publicarchive.com/marsClassic/en/insta...

thatcat · 14h ago

Any recommendations on background reading for classical CV for radar?

monkeyelite · 9h ago

I don’t know anything about radar. I have a book called “machine vision” (Shmuck, Jain, Kasturi) easy undergrad level, but also very useful. It’s $6 on Amazon.

klodolph · 16h ago

Maybe… some of these algorithms from the 1980s struggled to do basic OCR, so they may need a lot of modification to be useful.

PaulHoule · 16h ago

That whole approach of "find edges, convert to line drawing, process a line drawing" in the 1980s struggled to do anything at all.

Retric · 15h ago

There was a surprising amount of useful OCR happening in the 70’s.

High error rates and significant manual rescanning can be acceptable in some applications, as long as there’s no better alternative.

GuB-42 · 12h ago

I find that modern OCR, audio transcription, etc... are beginning to have the opposite problem: they are too smart.

It means that they make a lot fewer mistakes, but when they do, it can be subtle. For example, if the text is "the bat escaped by the window", a dumb OCR can write "dat" instead of "bat". When you read the resulting text, you notice it and using outside clues, recover the original word. An smart OCR will notice that "dat" isn't a word and can change it for "cat", and indeed "the cat escaped by the window" is a perfectly good sentence, unfortunately, it is wrong and confusing.

devilbunny · 8h ago

Thankfully, most speech misrecognition events are still obvious. I have seen this in OCR and, as you say, it is bad. There are enough mistakes in the sources; let us not compound them.

alightsoul · 17h ago

Amazing. Wonder how fast it would be on a modern computer

Hydration9044 · 16h ago

+1, which is faster when compare to OpenCV findContours

cyberax · 13h ago

One approach that blew my mind was the use of FFT to recognize objects.

FFT has this property that object orientation or location doesn't matter. As long as you have the signature of an object, you can recognize it anywhere!

changoplatanero · 12h ago

I believe orientation still matters but you’re right that position doesn’t.

Legend2440 · 12h ago

FFT is equivalent to convolution, which is widely used today for object recognition in CNNs.

bobmcnamara · 9h ago

> FFT is equivalent to convolution

What do you mean by that? Could you give me an example?

timewizard · 8h ago

The basic convolution theorem.

https://en.wikipedia.org/wiki/Convolution_theorem

Experimenting with no-build Web Applications • AndreGarzia.com (andregarzia.com)

50 Years of Microsoft and Developer Tools with Scott Guthrie (newsletter.pragmaticengineer.com)

The History of R2E and the Micral (abortretry.fail)

Show HN: Loregrep – In Memory RepoMap for coding assistants (github.com)

Red Hat just transformed enterprise server Linux (zdnet.com)

The Original 300B (westernelectric.com)

American Science and Surplus is fighting for its life. Why Should You Care? (arstechnica.com)

Top AI Tools

A Programming System (2023) (andreyor.st)

Advertising is coming to AI, and it's going to be product placement on steroids (sebs.website)

A Writers frustrating experience with ChatGPT's approach to lying (amandaguinzburg.substack.com)

Django, JavaScript Modules and Importmaps (406.ch)

Viralia Project (indiegogo.com)

Tesla Optimus photoshoot with influencer Anna Malygon (coeval-magazine.com)

The Original Permissionless JPEG: From malware payload to Bitcoin blockchain (theoriginalpermissionlessjpeg.com)

Detecting, Exploiting, Remediating a Path Traversal Vulnerability Across GitHub (arxiv.org)

Magnetic 3D-printed pen could help diagnose people with Parkinson's (theguardian.com)

Geojob App (indiegogo.com)

Mellon "We Are Not Alone" – A Reflection on UAP and Humanity's Cosmic Context [video] (youtube.com)

'I happened to be sitting next to Bill Joy at UCB when he wrote the first "yes"' (github.com)

The Steve Ballmer Interview (open.spotify.com)

Building a Catalytic Computer over the Weekend (leetarxiv.substack.com)

An Interview with Cursor Co-Founder and CEO Michael Truell About Coding with AI (stratechery.com)

Open Table Format Revolution: Why Hyperscalers Are Betting on Managed Iceberg (rilldata.com)

Morley Safer Award Winners (2019) (morleysaferaward.briscoecenter.org)

Manatees as Gardeners of the Amazon (worldsensorium.com)

The Mathematical Mysteries of Fireflies (nautil.us)

Tesla loses another manager to layoffs – but this one quit due to morale (electrek.co)

Nvidia ISO-26262 Spark Process (nvidia.github.io)

Inkscape UI Vision Going Forward [video] (media.ccc.de)

Yorick programming language for scientific computations (yorick.sourceforge.net)

Over $1B in federal funding got slashed for this polluting industry (technologyreview.com)

10 Years of Betting on Rust (tably.com)

Scope of Work Generator (chromewebstore.google.com)

UK tech job openings climb 21% to pre-pandemic highs (theregister.com)

GOP rage with Musk spills out privately after break with Trump (axios.com)

Consumer groups filed a complaint against SHEIN for dark patterns (beuc.eu)

Ask HN: Self Hosted Cloud Stack

What Palantir Does (twitter.com)

2010 U.S.-Russia Treaty Helped Kyiv's UAVs Destroy Russian Nuclear Bombers (eurasiantimes.com)

Synee – A Socializing App Reinventing Event Hosting

Pice: India's #1 Invoicing and Payment Collections Software (piceapp.com)

DNS4EU for Public Is Available (joindns4.eu)

GenAI-Assisted Fantasies (cacm.acm.org)

Core Database Design (andyatkinson.com)

How NASA advisory committees are navigating a new political landscape (spacenews.com)

Andrej Karpathy: Opaque, Non-Scriptable UI-Heavy Products Risk Obsolescence (twitter.com)

WhatsApp's Billion-User Database: How FreeBSD and Erlang Handled the Impossible (medium.com)

There Aren't Enough Engineers to Meet Growing Hunger for Power (bloomberg.com)

Deriod Calculator (deriod.net)

When memory was measured in kilobytes: The art of efficient vision

Comments (20)