LibRedirect – Redirects popular sites to alternative privacy-friendly frontends (libredirect.github.io)

Beyond the obvious chatbots and coding copilots, curious what people are actually shipping with LLMs. Internal tools? Customer-facing features? Any economically useful agents out there in the wild?

Comments (55)

petercooper · 4h ago

Analyzing firehoses of data. RSS feeds, releases, stuff like that. My job involves curating information and while I still do that process by hand, LLMs make my net larger and help me find more signals. This means hallucinations or mistakes aren't a big deal, since it all ends up with me anyway. I'm quite bullish on using LLMs as extra eyes, rather than as extra hands where they can run into trouble.

captainbland · 4h ago

Is cost a major consideration for you here? Like if you're dealing with firehose data which I'm assuming is fairly high throughput, do you see an incentive for potentially switching to a more specific NLP classifier model rather than sticking with generative LLMs? Or is it that this is good enough/the ROI of switching isn't attractive? Or is the generative aspect adding something else here?

meesles · 2h ago

I don't think everyone's using the term 'firehose' the same here. A child comment refers to half a billion tokens for $20.

I did some really basic napkin math with some Rails logs. One request with some extra junk in it was about 400 tokens according to the OpenAI tokenizer[0]. 500M/400 = ~1.25 million log lines.

Paying linearly for logs at $20 per 1.25 million lines is not reasonable for mid-to-high scale tech environments.

I think this would be sufficient if a 'firehose of data' is a bunch of news/media/content feeds that needs to be summarized/parsed/guessed at.

[0] https://platform.openai.com/tokenizer

simonw · 4h ago

If you do the calculations against the cheapest available models (GPT-4.1-nano and Gemini 1.5 Flash 8B and Amazon Nova Micro for example - I have a table on https://www.llm-prices.com/ ) it is shockingly inexpensive to process even really large volumes of text.

$20 could cover half a billion tokens with those models! That's a lot of firehose.

actinium226 · 5h ago

We have a prompt that takes a job description and categorizes it based on whether it's an individual contributor role, manager, leadership, or executive, and also tags it based on whether it's software, mechanical, etc.

We scrape job sites and use that prompt to create tags which are then searchable by users in our interface.

It was a bit surprising to see how Karpathy described software 3.0 in his recent presentation because that's exactly what we're doing with that prompt.

Vegenoid · 4h ago

Can you elaborate on what makes this “software 3.0”? I didn’t really understand what the distinction was in Karpathy’s talk, and felt like I needed a more concrete example. What you describe sounds cool, but I still feel like I’m not understanding what makes it “3.0”. I’m not trying to criticize, I really am trying to understand this concept.

diggan · 2h ago

> Can you elaborate on what makes this “software 3.0”?

Software 2.0: We need to parse a bunch of different job ads. We'll have a rule engine, decide based on keywords what to return, do some filtering, maybe even semantic similarity to descriptions we know match with a certain position, and so on

Software 3.0: We need to parse a bunch of different job ads. Create a system prompt that says "You are a job description parser. Based on the user message, return a JSON structure with title, description, salary-range, company, position, experience-level" and etc, pass it the JSON schema of the structure you want and you have a parser that is slow, sometimes incorrect but (most likely) covers much broader range than your Software 2.0 parser.

Of course, this is wildly simplified and doesn't include everything, but that's the difference Karpathy is trying to highlight. Instead of programming those rules for the parser ourselves, you "program" the LLM via prompts to do that thing.

Vegenoid · 38m ago

Thank you for the explanation, I appreciate it.

adobrawy · 4h ago

In other words, are you using LLM as a text classifier?

blindriver · 4h ago

This is what I'm using it for as well, it's really simple to use for text classification of any sort.

jerpint · 4h ago

Are there currently services (or any demand for) a text classifier that you fine tune on your own data that is tiny and you can own forever? Like use a ChatGPT + synthetic data to fine tune a nanoBERT type of model

yamalight · 5h ago

Built vaporlens.app in my free time using LLMs (specifically gemini, first 2.0-flash, recently moved to 2.5-flash).

It processes Steam game reviews and provides one page summary of what people thing about the game. Have been gradually improving it and adding some features from community feedback. Has been good fun.

polishdude20 · 4h ago

I usually find that if a game is rated overwhelmingly positive, I'm gonna like it. The moment it's just mostly positive, it doesn't stay as a favorite for me.

yamalight · 4h ago

Those games are usually brilliant - but those are very rare. Like "once in a few years" kind of rare IMO. While that is a valid approach, I play way more than that haha!

What I found interesting with Vaporlens is that it surfaces things that people think about the game - and if you find games where you like all the positives and don't mind largest negatives (because those are very often very subjective) - you're in a for a pretty good time.

It's also quite amusing to me that using fairly basic vector similarity on points text resulted in a pretty decent "similar games" section :D

on_the_train · 2h ago

That rating is not (just) a function of positive to negative ratio. Small number of reviews (ie small games) can't reach that rating although they might be equally well received.

jackthetab · 52m ago

Which LLMs and plans are you guys using for all of these cool ideas?

ATM I use ChatGPT Plus for everything except coding inside my Jetbrains IDEs.

I'm starting to look around at other LLMs for non-coding purposes (brainstorming, docs, being a project manager, summarizing, learning new subjects, etc.).

intermerda · 5h ago

Mostly for understanding existing code base and making changes to it. There are tons of unnecessary abstractions and indirections in it so it takes a long time for me to follow that chain. Writing Splunk queries is another use.

People use it to generate meeting notes. I don't like it and don't use it.

GarnetFloride · 5h ago

We've been encouraged to use LLMs for brainstorming blog posts. The actual posts it generates are usually not good but gives us something to talk about so we can write something better. And doing SEO to posts. It seems to do that pretty well.

jabroni_salad · 4h ago

One of my clients is doing m&a like crazy and we are now using it to help with directory merging. Every HR and IT department does things a little differently and we want to match them to our predefined roles for app licensing and access control.

You used to either budget for data entry or just graft directories in a really ugly way. The forest used to know about 12000 unique access roles and now there are only around 170.

rootsofallevil · 4h ago

> Beyond the obvious chatbots and coding copilots, curious what people are actually shipping with LLMs.

We're delivering confusion and thanks to LLMs we're 30% more efficient doing it

ArneVogel · 4h ago

I am using it for FisherLoop [1] to translate text/extract vocabulary/generate example sentences in different languages. I found it pretty reliable for longer paragraphs. For one sentence translations it lacks context and I have to manually edit sometimes. I tried adding more context like the paragraph before and after, but then I found it wouldn't follow the instructions and only translate the paragraph I wanted but also the context, which I found no good way to prevent. So now I manually verify, but it saves me still ~98% of the work.

[1] https://www.fisherloop.com/en/

alonsonic · 4h ago

I created an agent to scan niche independent cinemas and create a repository of everything playing in my city. I have an LLM heavy workflow to scrape, clean, classify and validate the data. It can handle any page I throw at it with ease. Very accurate as well, less than 5% errors right now.

perk · 4h ago

Several things! But my favourite use-case works surprisingly well.

I have a js-to-video service (open source sdk, WIP) [1] with the classic "editor to the left - preview on the right" scenario.

To help write the template code I have a simple prompt input + api that takes the llms-full.txt [2] + code + instructions and gives me back updated code.

It's more "write this stuff for me" than vibe-coding, as it isn't conversational for now.

I've not been bullish on ai coding so far, but this "hybrid" solution is perfect for this particular use-case IMHO.

[1] https://js2video.com/play [2] https://js2video.com/llms-full.txt

nickandbro · 4h ago

I have a hobby project called https://Vimgolf.ai where users try to best a bot that is powered by O3. Apparently, O3 is really good at vim sequences to transform a start file to an end file albeit with moderate complexity.

miketery · 4h ago

I built a SQL agent with detailed database context and a set of tools. It’s been a huge lift for me and the team in generating rather complex queries that would take non trivial time to construct, even if using cursor or ChatGPT.

dartharva · 4h ago

I'm in the process of building one too. Handing off SQL queries to LLMs feels like a no-brainer.

jakevoytko · 4h ago

I work for Hinge, the dating app. We use them for our "prompt feedback" feature, where the LLM gives constructive feedback on how to improve your prompts if it judges them as low-effort or clichéd.

bronco21016 · 4h ago

Won’t this lead to long-term everyone using the same prompt? It seems like this already naturally happens.

jakevoytko · 3h ago

It doesn’t pick your prompt, just evaluates your response. AFAIK it doesn’t suggest other prompts

miketery · 4h ago

Doesn't this create a signal problem long term?

If everyone is using it now prompts aren’t a good gauge.

jakevoytko · 4h ago

It's optional and doesn't generate responses for you, instead just nudging you in better directions. So it's certainly not generating a bunch of indistinguishable profiles. Quite the opposite, it gives people a second chance to expand on their own views or experiences.

tibbar · 4h ago

Internal research assistants. Essentially 'deep research' hooked up to the internal data lake, knowledge bases, etc. It takes some iterations to make a tool like this actually effective, but once you've fixed the top N common roadblocks, it just sorta works. Modern (last 6 months) of models are amazing.

If all you've built is RAG apps up to this point, I highly recommend playing with some LLM-in-a-loop-with-tools reasoning agents. Totally new playing field.

orphea · 4h ago

When a customer onboards, we scrap their website to pre-fill some answers and pre-create certain settings (categories, tags, etc.). Ideally the customer spends most of the time just confirming things.

themanmaran · 4h ago

This is something we've been doing as well, and it's pretty magical when the user has a fully customized experience.

That said, it required the user to sign in with their real work email or the results are way off.

wayschultz · 4h ago

I work for Typeform, we do conversational forms. For over a year we've been evolving this internal product (still in Beta) to generate smart insights for the collected responses https://medium.com/typeforms-engineering-blog/under-the-hood...

binarymax · 5h ago

So many things. I have built several customer facing products, a web research platform that works better than the RAG you get from Google, and lots of small tools.

For example, I wrote a recent blog post on how I use LLMs to generate excel files with a prompt (less about the actual product and more about how to improve outcomes): https://maxirwin.com/articles/persona-enriched-prompting/

notjoemama · 2h ago

Thank you for the link! That was a nice read through. I'm just familiarizing myself with using AI in software development and this gives me some structure around how to scaffold up a domain knowledge response. Very cool.

VladVladikoff · 5h ago

Nvidia nemo ASR + an 8B LLM to generate transcripts and summaries of phone calls that my support team conducts. It works better than the notes they leave about the calls.

hoistbypetard · 4h ago

I work with (a few someones) who see fit to send out schedules as PDFs, 3 months at a time. I have a script that feeds Claude the PDFs and gets it to generate an ICS file. Then a script that feeds it both the ICS file and the original PDF and asks it to highlight any differences between the two.

Getting those events onto a usable, sharable calendar is much easier now.

asdev · 5h ago

Still kind of a chatbot, but I've integrated them into a workout tracking app. I'm using them to generate workout programs, log my training by just chatting and adjust my training as I see fit.

https://apps.apple.com/us/app/forceai-ai-workout-generator/i...

impure · 4h ago

Pretty much all of my productivity apps has LLM integration now. My language learning app uses them to break down phrases and get detailed definitions. My RSS app generates summaries. And recently I released an email app that's like Google Inbox in that it uses bundles. It also summarizes emails and extracts expiry and due dates.

on_the_train · 1h ago

And all your users hate it

karmakaze · 4h ago

Not production I was just playing around but seems useful. On so many platforms bios are mostly blank. The best way to get good ones is to have AIs search for pictures and info about yourself and write a draft that's close but definitely not how you want it. That motivates fixing it up on the spot.

ohxh · 4h ago

Lots of non-chatbot uses in property management. Auditing leases vs. payment ledgers. Classifying maintenance work orders. Creating work orders from inspections (photos + text). Scheduling vendors to fix these issues. Etc.

tony_codes · 4h ago

Enabling users at jumblejournal.org to journal by hand using openAI OCR. Also, for journal extraction of growth vectors

lazy_afternoons · 5h ago

We use it for lead quality assessment, detecting bad language, scoring language on subtle skills etc

Pretty much 5-6 niche classification use cases.

IdealeZahlen · 5h ago

I've been building some interactive educational stuff (mostly math and science) with react / three.js using Claude.

cpursley · 4h ago

Parsing information into structured data as well as classifying information into normalized fields.

sethops1 · 5h ago

Not really, no. Still just using ChatGPT or Gemini for the occasional search for things that are buried in documentation somewhere. Anything more than that and LLMs make a hash of it fairly quick.

rc_mob · 5h ago

I have enjoyed how these LLMs make a nice wrapper around projects tha are terrible at writing documentation.

joeyagreco · 5h ago

Writing test boilerplate.

rootcage · 5h ago

The most common use case - coding assistant to get more done in less time.

Used it to deeper understand complex code base, create system design architecture diagrams and help onboard new engineers.

Summarizing large data dumps that users were frustrated with.

gametorch · 3h ago

1. Pre-prompting for image and video generation. Gives you way better results for less than a cent of added cost. Although many image models do this thing for you; you have to understand each individual model and apply this judiciously.

2. I build REPLs into any manual workflow that makes use of LLMs. Instead of just being like "F@ck, it didn't work!" you can instead tell the LLM why it didn't work and help it get the right answer. Saves a ton of time.

3. Coming up with color palettes, themes, and ideas for "content". LLMs are really good at pumping out good looking input for whatever factory you have built.

nurettin · 4h ago

I use LLMs to provide up to date information (by injecting newer information into the live conversation) and figure out what functions the user wants to call.

tootie · 4h ago

Coding assistant and audio transcription

Gemini CLI (blog.google)

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights (jameshard.ing)

U.S. bombs Iranian nuclear sites (bbc.co.uk)

Mechanical Watch: Exploded View (fellerts.no)

IDF officers ordered to fire at unarmed crowds near Gaza food distribution sites (haaretz.com)

Writing toy software is a joy (blog.jsbarretto.com)

uv: An extremely fast Python package and project manager, written in Rust (github.com)

OpenAI charges by the minute, so speed up your audio (george.mand.is)

A new PNG spec (programmax.net)

A new pyramid-like shape always lands the same side up (quantamagazine.org)

Fun with uv and PEP 723 (cottongeeks.com)

Backyard Coffee and Jazz in Kyoto (thedeletedscenes.substack.com)

Vera C. Rubin Observatory first images (rubinobservatory.org)

Git Notes: Git's coolest, most unloved­ feature (2022) (tylercipriani.com)

Man 'refused entry into US' as border control catch him with bald JD Vance meme (dublinlive.ie)

A new PNG spec (programmax.net)

Thnickels (thick-coins.net)

Define policy forbidding use of AI code generators (github.com)

How I use my terminal (jyn.dev)

-2000 Lines of code (2004) (folklore.org)

I wrote my PhD Thesis in Typst (fransskarman.com)

AlphaGenome: AI for better understanding the genome (deepmind.google)

Writing a basic Linux device driver when you know nothing about Linux drivers (crescentro.se)

What Problems to Solve (1966) (genius.cat-v.org)

Microsoft Edit (github.com)

I deleted my second brain (joanwestenberg.com)

Facebook is starting to feed its AI with private, unpublished photos (theverge.com)

Starship: A minimal, fast, and customizable prompt for any shell (starship.rs)

PlasticList – Plastic Levels in Foods (plasticlist.org)

TPU Deep Dive (henryhmko.github.io)

Klein Bottle Amazon Brand Hijacking (2021) (kleinbottle.com)

Games run faster on SteamOS than Windows 11, Ars testing finds (arstechnica.com)

LibRedirect – Redirects popular sites to alternative privacy-friendly frontends (libredirect.github.io)

Tell HN: Beware confidentiality agreements that act as lifetime non competes

Fairphone 6 is switching to a new design that's even more sustainable (androidcentral.com)

Introducing Gemma 3n (developers.googleblog.com)

Finding a 27-year-old easter egg in the Power Mac G3 ROM (downtowndougbrown.com)

US Supreme Court limits federal judges' power to block Trump orders (theguardian.com)

U.S. Chemical Safety Board could be eliminated (ishn.com)

Alternative Layout System (alternativelayoutsystem.com)

XSLT – Native, zero-config build system for the Web (github.com)

Puerto Rico's Solar Microgrids Beat Blackout (spectrum.ieee.org)

MCP: An (Accidentally) Universal Plugin System (worksonmymachine.substack.com)

Using Home Assistant, adguard home and an $8 smart outlet to avoid brain rot (romanklasen.com)

Ambient Garden (ambient.garden)

GitHub CEO: manual coding remains key despite AI boom (techinasia.com)

US economy shrank 0.5% in the first quarter, worse than earlier estimates (apnews.com)

Basic Facts about GPUs (damek.github.io)

MCP is eating the world (stainless.com)

Getting ready to issue IP address certificates (community.letsencrypt.org)

Ask HN: What are you actually using LLMs for in production?

Comments (55)

Git Notes: Git's coolest, most unloved feature (2022) (tylercipriani.com)