Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs (emergent-misalignment.com)

Is there any project out there that's trying to figure out how to filter the web to its (increasingly shrinking) fraction that can be trusted? Trusted as in NOT AI slop, content spam, or intentionally poorly-researched/misleading.

There are different thresholds to "trusted": the highest that'd be useful to search would be just authoritative go-to sources for something, e.g. the official docs for an API or library. But that'd be perhaps too narrow: including stackoverflow links around that API or library usage would be fine. In other domains like product reviews, you'd include only sites that have a reputation for actually testing the products.

Use-cases: - Reliably good search results... Google et al are full of SEO spam now and soon will be full of AI slop - Better grounding for LLMs than current search grounding... even if the LLM doesn't hallucinate it can cite junk on the web - Better pre-training data... I actually don't understand how LLMs will themselves filter out their own slop from future pre-training runs

I'm not sure what form this should take if it doesn't exist yet. Maybe a github project or wiki curating links per domain (yahoo directory reinvented?), each of us curating our own bookmarks and sharing it (delicious reinvented?), something else?

Comments (2)

k310 · 12h ago

Hacker News. It's moderated and users Sherlock quickly and smartly.

Perhaps a "Hackerpedia" of archived articles, not just Algolia search, but organized in some {use your imagination} way.

TBH,when I search, I choose Wikipedia first, then as close to the originator as possible.

Everyone's search results seem heavily weighted by "Shit for sale". And just as an aside, ebay seems to offer every search term for sale, including Richard Feynmann and a slave ship.

zhyder · 9h ago

Yup I directly search Wikipedia often too. Hmm Hackerpedia could work, or ReddiRank.

Best AI for Reading Books

EFF: Security Experts Urging Trump Administration to Leave Chris Krebs Alone (eff.org)

Laser communications systems for inter-satellite links (ISLs) in LEO (blog.satsearch.co)

Correct application of the Lanczos filter in 2D (github.com)

Show HN: Compare PC Monitors (comparepcmonitors.com)

Development with Linux Terminal pKVM VM on Pixel Tablet (blog.lizhao.net)

What's New in Swift 6.1? (hackingwithswift.com)

I Asked ChatGPT for a Comic–It Gave Me 55 Pages of Chaos

The Uncanny Mirror: AI, Self-Doubt, and the Limits of Reflection (lucidnonsense.net)

Emergent Misalignment: Narrow Finetuning Can Produce Broadly Misaligned LLMs (emergent-misalignment.com)

Why some Mac apps launch slowly: A follow-up (lapcatsoftware.com)

Show HN: Mini Hex – Convert 6-digit colors into their closest 3-digit equivalent (dannyduplex.com)

Show HN: Custom Build Deep Researcher in Genkit to automate my own job (hello-jp.net)

Merged PR (youtube.com)

The High Cost of Saying No: Why I Can't Stop Talking About Housing (dataandpolitics.net)

Liverpool fans' celebrations caused earth tremor (bbc.co.uk)

Ghosts and Dolls (thelampmagazine.com)

Show HN: Stop Searching for Documents (everfind.ai)

Harnessing the Power of Geophysical Imaging to Recharge California's Groundwater (agupubs.onlinelibrary.wiley.com)

IonQ CEO: Want to become the '800-pound gorilla' of quantum (cnbc.com)

Show HN: OSle – A 510 bytes OS in x86 assembly (github.com)

IonQ Demonstrates Quantum-Enhanced Applications Advancing AI (ionq.com)

Proposal: Reimagining Spring Core in Go

Core Unix Programs (wizardzines.com)

Link Out for In-App Purchases (docs.stripe.com)

Verified U-Boot (2013) (lwn.net)

What DeleteMe and Incogni aren't telling you [video] (youtube.com)

Launchpad Sorter on Peerlist (github.com)

Temu Blocks US Shoppers from Seeing Products Shipped from China (wired.com)

DeepChat – A smart assistant that connects powerful AI to your personal world (github.com)

Show HN: Dia TTS – open-source multi-speaker dialogue generator (diatts.com)

Making PyPI's test suite 81% faster (blog.trailofbits.com)

Real-Time Markov Chain Path Guiding for Global Illumination and Single Scatter (lalber.org)

The Speed of VITs and CNNs (lucasb.eyer.be)

Solar panels to be fitted on all new-build homes in England by 2027 (theguardian.com)

Zoho halts $700M semiconductor plan (semiconductorsinsight.com)

Don't watermark your legal PDFs with purple dragons in suits (arstechnica.com)

Testosterone gave me my life back (usefulfictions.substack.com)

Show HN: SQSAdmin – A web service / Docker that can manage your localstack SQS (ronreiter.github.io)

They Write the Right Stuff (1996) (fastcompany.com)

Things I wish I'd known about CSS (2020) (davesmyth.com)

Why Some macOS Apps Sometimes Launch Slowly (mjtsai.com)

Zipf's Law (en.wikipedia.org)

Looking for builders who quit healthcare because of its misaligned incentives

Total Private Construction Spending: Manufacturing in the United States (fred.stlouisfed.org)

Adjacent: Add Related Repositories to Readme on GitHub (github.com)

Show HN: Texture: Create an MCP Server from Any API (usetexture.com)

Apple says most US-bound iPhones no longer made in China as tariffs bite (bbc.com)

Army made a tank it doesn't need and can't use. Now it's figuring out what to do (defenseone.com)

Improving Instruct Models for Free: A Study on Partial Adaptation (arxiv.org)

Ask HN: Trusted Web

Comments (2)