Show HN: An open-source browser extension that integrates difftastic into GitHub (andersonaddo.github.io)

I want to believe that however obsolete these old algorithms are today, at least some aspects of the underlying code and/or logic should prove useful to LLMs as they try to generate modern code.

monkeyelite · 1d ago

The idea that ML is the only way to do computer vision is a myth.

Yes, it may not make sense to use classical algorithms to try to recognize a cat in a photo.

But there are often virtual or synthetic images which are produced by other means or sensors for which classical algorithms are applicable and efficient.

sokoloff · 1d ago

I worked (as an intern) on autonomous vehicles at Daimler in 1991. My main project was the vision system, running on a network of transputer nodes programmed in Occam.

The core of the approach was “find prominent horizontal lines, which exhibit symmetry about a vertical axis, and frame-to-frame consistency”.

Finding horizontal lines was done by computing variances in value. Finding symmetry about a vertical axis was relatively easy. Ultimately, a Kalman filter worked best for frame-to-frame tracking. (We processed video in around 120x90 output from variance algorithm, which ran on a PAL video stream.)

There’s probably more computing power on a $10 ESP32 now, but I really enjoyed the experience and challenge.

This was our vehicle: https://mercedes-benz-publicarchive.com/marsClassic/en/insta...

digdugdirk · 22h ago

That's awesome! What kind of hardware was needed to pull that off? And was the size of the bus any indication of the answer?

thatcat · 1d ago

Any recommendations on background reading for classical CV for radar?

monkeyelite · 1d ago

I don’t know anything about radar. I have a book called “machine vision” (Shmuck, Jain, Kasturi) easy undergrad level, but also very useful. It’s $6 on Amazon.

ipunchghosts · 1d ago

Kasturi was my undergraduate honors advisor!

monkeyelite · 19h ago

Small world! These are always just names on a book to me.

sceadu · 1d ago

Don't know about radar but here's a good book on classical CV https://udlbook.github.io/cvbook/

even though I think Simon admits that most of it is obsolete after DL computer vision came about

monkeyelite · 17h ago

> is obsolete after DL computer vision came about

I just don’t understand this. Why would new technology invalidate real understanding and useful computer algorithms?

klodolph · 1d ago

Maybe… some of these algorithms from the 1980s struggled to do basic OCR, so they may need a lot of modification to be useful.

PaulHoule · 1d ago

That whole approach of "find edges, convert to line drawing, process a line drawing" in the 1980s struggled to do anything at all.

Retric · 1d ago

There was a surprising amount of useful OCR happening in the 70’s.

High error rates and significant manual rescanning can be acceptable in some applications, as long as there’s no better alternative.

GuB-42 · 1d ago

I find that modern OCR, audio transcription, etc... are beginning to have the opposite problem: they are too smart.

It means that they make a lot fewer mistakes, but when they do, it can be subtle. For example, if the text is "the bat escaped by the window", a dumb OCR can write "dat" instead of "bat". When you read the resulting text, you notice it and using outside clues, recover the original word. An smart OCR will notice that "dat" isn't a word and can change it for "cat", and indeed "the cat escaped by the window" is a perfectly good sentence, unfortunately, it is wrong and confusing.

devilbunny · 1d ago

Thankfully, most speech misrecognition events are still obvious. I have seen this in OCR and, as you say, it is bad. There are enough mistakes in the sources; let us not compound them.

taeric · 1d ago

I'm not sure I can sign on to this. In particular, this sounds kind of like an indictment of many algorithms. But, how many where there? And did any go on to give good results?

Considers, OCR was a very new field, such that a lot of the struggle was getting data into a place you could even try recognition against it. It should be no surprise that they were not able to succeed that often. It would be more surprising if they had a lot of different algorithms.

alightsoul · 1d ago

Amazing. Wonder how fast it would be on a modern computer

Hydration9044 · 1d ago

+1, which is faster when compare to OpenCV findContours

cyberax · 1d ago

One approach that blew my mind was the use of FFT to recognize objects.

FFT has this property that object orientation or location doesn't matter. As long as you have the signature of an object, you can recognize it anywhere!

changoplatanero · 1d ago

I believe orientation still matters but you’re right that position doesn’t.

Legend2440 · 1d ago

FFT is equivalent to convolution, which is widely used today for object recognition in CNNs.

bobmcnamara · 1d ago

> FFT is equivalent to convolution

What do you mean by that? Could you give me an example?

kragen · 1d ago

The FFT, composed with pointwise multiplication, composed with the inverse FFT, is equivalent to convolution. The FFT is not.

timewizard · 1d ago

The basic convolution theorem.

https://en.wikipedia.org/wiki/Convolution_theorem

bobmcnamara · 1d ago

That is something else entirely.

timewizard · 20h ago

Then if you know what the OP meant why did you ask?

bobmcnamara · 6m ago

There are so many FFT tricks. I was hoping this was another.

Grimblewald · 12h ago

because they made a nonsensical claim that doesn't align with my (and likely their) understanding of what the FT is and does.

The FT is _NOT_ just a convolution, but under certain conditions a specific operation on FT terms is equivalent to a convolution.

mrheosuper · 1d ago

I still deal with <128kb ram system everyday

DaSHacka · 1d ago

Ah, Mac user?

mrheosuper · 14h ago

more like STMicroelectronics user

weareregigigas · 1d ago

I too need a coffee in the morning before I can do anyhting

Show HN: Tape/Z – a toolkit for analysing z/OS assembler (HLASM) code (github.com)

Show HN: Air Lab – A portable and open air quality measuring device (networkedartifacts.com)

Show HN: Claude Composer (github.com)

Show HN: An open-source browser extension that integrates difftastic into GitHub (andersonaddo.github.io)

Show HN: Ask-human-mcp – zero-config human-in-loop hatch to stop hallucinations (masonyarbrough.com)

Show HN: Lambduck, a Functional Programming Brainfuck (imjakingit.github.io)

Show HN: Instant video edits with local Whisper models (macOS) (cutword.com)

Show HN: iOS Screen Time from a REST API (thescreentimenetwork.com)

Show HN: Container Use for Agents (github.com)

Show HN: ClickStack – Open-source Datadog alternative by ClickHouse and HyperDX (github.com)

Show HN: Book to help you build a PostgreSQL-like database server from scratch (technicaldeft.com)

Show HN: I Built an Agent That Writes Fresh Newsletters for Any Topic

Show HN: GPT image editing, but for 3D models (adamcad.com)

Show HN: String Flux – Simplify everyday string transformations for developers (stringflux.io)

Show HN: Camus – The World's First Truly Useless AI Agent (camus.im)

Show HN: A scriptable text editor for LLMs (github.com)

Show HN: Grab a Random ArXiv Paper (jepedersen.dk)

Show HN: I made a 3D SVG Renderer that projects textures without rasterization (seve.blog)

Show HN: Posture Correction Using AirPods Motion Sensors (github.com)

Show HN: Create LLM graders and run evals in JavaScript with one file (github.com)

Show HN: Memotron – PKM Tool for All (memotron.app)

Show HN: App.build, an open-source AI agent that builds full-stack apps (app.build)

Show HN: I build one absurd web project every month (absurd.website)

Show HN: A Simple Tool to Copy Special Characters and Symbols Easily (special-characters.aitoolshubs.com)

Show HN: I wrote a Java decompiler in pure C language (github.com)

Show HN: Open a browser by clapping twice (inspired by Iron Man) (github.com)

Show HN: Kan.bn – An open-source alterative to Trello (github.com)

Show HN: Verysmall.site – vibecode single page websites (verysmall.site)

Show HN: Localize React apps without rewriting code (github.com)

Show HN: Controlling 3D models with voice and hand gestures (github.com)

Show HN: Explainr – Upload a research paper and get a learning roadmap (explainr.aryanbuilds.com)

Show HN: Onlook – Open-source, visual-first Cursor for designers (github.com)

Show HN: Gradle plugin for faster Java compiles (github.com)

Show HN: AirAP AirPlay server – AirPlay to an iOS Device (github.com)

Show HN: YOYO – AI Version Control for Vibe Coding (runyoyo.com)

Show HN: Ephe – A minimalist open-source Markdown paper for today (github.com)

Show HN: This Hacker News does not exist (thishackernewsdoesnotexist.com)

Show HN: Tiptap AI Agent – Add AI workflows to your text editor in minutes

Show HN: Patio – Rent tools, learn DIY, reduce waste (patio.so)

Show HN: I built an old photo restoration tool using the Flux Kontext (restoreoldphotos.io)

Show HN: MCP-Cloud – One-click hosting for MCP servers (50 templates) (mcp-cloud.ai)

Show HN: A toy version of Wireshark (student project) (github.com)

Show HN: An Alfred workflow to open GCP services and browse resources within (github.com)

Show HN: JSON_fast – 35% faster JSON parsing than serde_JSON (github.com)

Show HN: Moon Phase Algorithms for C, Lua, Awk, JavaScript, etc. (github.com)

Show HN: patdb: a snappy + easy + pretty TUI debugger for Python (github.com)

Show HN: This database never puts you on hold (github.com)

Show HN: Run 30B model in 4GB Active Memory (github.com)

Show HN: Mosaique.info – Global news in context (solo dev, no ads, no tracking) (mosaique.info)

Show HN: Triage.flow – Chat with Any GitHub Repo Using Faiss and LlamaIndex (github.com)

When memory was measured in kilobytes: The art of efficient vision

Comments (32)