Chat Control repelled 4th time in the EU (twitter.com)

1 points by miohtama 1m ago 0 comments

Researchers turn mouse scalp transparent to image brain development (news.stanford.edu)

1 points by PaulHoule 2m ago 0 comments

Care-Driven Development: The Art of Giving a Shit (brodzinski.com)

1 points by flail 2m ago 0 comments

Learn x86-64 assembly by writing a GUI from scratch (gaultier.github.io)

1 points by ibobev 2m ago 0 comments

New iOS app helps you stop re-checking stoves, doors, and switches (apps.apple.com)

1 points by ardakaan 4m ago 0 comments

Area of unit disk under a univalent function (johndcook.com)

1 points by ibobev 6m ago 0 comments

Thumby Modding [video] (youtube.com)

1 points by doruk101 7m ago 0 comments

Why I Hope the Search for Extraterrestrial Life Finds Nothing (nickbostrom.com)

1 points by voxleone 11m ago 0 comments

Go Mobile Now- for SMEs, Lead the Appplaude (vite-react-one-sable-18.vercel.app)

1 points by mugambindeke 11m ago 0 comments

Multigres: Horizontally scalable Postgres with multi-tenant, HA capabilities (multigres.com)

1 points by merqurio 12m ago 0 comments

I Made the World's Smallest Minecraft Server (youtube.com)

1 points by gavide 19m ago 0 comments

The Trellis: A Gardening Metaphor for Software Engineering (sinclairtarget.com)

1 points by weebst 19m ago 0 comments

Lessons Learned: Using Git Workflows to Manage a Multilingual Festival Website (2h10.de)

1 points by advancingu 20m ago 1 comments

Glfmn.io Blog Links (glfmn.io)

1 points by weebst 21m ago 0 comments

Building multi-agent tools for engineering (newstoretech.substack.com)

1 points by Thomvis 22m ago 0 comments

Volkswagen patented a system that uses the car itself for VR gaming (worldwide.espacenet.com)

1 points by leo_researchly 22m ago 0 comments

AI Startup Founders Tout a Winning Formula–No Booze, No Sleep, No Fun (wsj.com)

3 points by pondsider 23m ago 0 comments

Show HN: Play with an AI agent that debugs incidents in our sandbox (sandbox.syn-cause.com)

1 points by morethananai 23m ago 0 comments

A website that focuses on blogs (govars.com)

1 points by mazwar 24m ago 0 comments

Gen Z protestors have chosen next Nepal PM via vote on Discord (indiatoday.in)

4 points by debo_ 28m ago 1 comments

ASTs with Fix and Free (chrispenner.ca)

2 points by todsacerdoti 30m ago 0 comments

Improving state machine code generation (trifectatech.org)

1 points by Bogdanp 35m ago 0 comments

A Land for All: Two States. One Homeland (2s1h.org)

1 points by ciconia 38m ago 0 comments

Thoughts on Using a Computer Efficiently (johnsillings.com)

2 points by johnsillings 39m ago 0 comments

Objects Talking to Objects (youtube.com)

2 points by bodacious 40m ago 0 comments

Understanding n+1 query problems in Ruby on Rails (sourcediving.com)

1 points by bodacious 42m ago 0 comments

The Treasury Is Expanding the Patriot Act to Attack Bitcoin Self Custody (tftc.io)

12 points by bilsbie 44m ago 4 comments

Sorry, Inflation Still Lives (wsj.com)

4 points by cs702 44m ago 0 comments

Why Your Rust Adoption Will Probably Fail (and How to Beat the Odds) (thenewstack.io)

3 points by stevefan1999 44m ago 0 comments

Analog in-memory computing attention mechanism fast and energy-efficient LLMs (nature.com)

1 points by bilsbie 44m ago 0 comments

Show HN: My CRT Aesthetic Portfolio Website (softwaredesign.ing)

1 points by prakhar897 46m ago 0 comments

LLM-Generated Rules Engines for LLM Explainability (brain.co)

1 points by matthewolfe 46m ago 0 comments

I Built 100 Tiny Tools – A Curated List of Small, Useful Web Tools (100tinytools.com)

2 points by remotemonk 47m ago 3 comments

Paramount Skydance preparing bid for Warner Bros Discovery, source says (reuters.com)

1 points by throw0101a 48m ago 0 comments

Apple's iPhone security feature makes life more difficult for spyware makers (techcrunch.com)

5 points by janandonly 49m ago 0 comments

The Making of Mondo: how Duplantis is reaching new heights (bbc.co.uk)

1 points by FromTheArchives 50m ago 0 comments

Pirating 'The Talos Principle' Will Trap You in an Elevator (2014) (neogaf.com)

1 points by LorenDB 51m ago 0 comments

Clever Hans Couldn't Do Arithmetic, and LLMs Don't Understand (codemanship.wordpress.com)

3 points by flail 52m ago 0 comments

'You just can't recreate that glow': The people who hunt old TVs (bbc.com)

1 points by mauvehaus 53m ago 0 comments

Built a SaaS with AI and almost got destroyed by production problems (humanizer.vercel.app)

1 points by hechem 54m ago 1 comments

Show HN: Turn graveyard of unused domains into an AI affiliate stores (app.domainpark.com)

1 points by yaychay 54m ago 0 comments

If Oaks and Orchids Could Talk (worldsensorium.com)

1 points by dnetesn 56m ago 0 comments

A New Method for Estimating P2P Network Size (eli.sohl.com)

3 points by ogurechny 56m ago 0 comments

I Became a Birdwatcher (nautil.us)

3 points by dnetesn 57m ago 0 comments

The Elements of Programming Style (en.wikipedia.org)

1 points by Sharlin 59m ago 0 comments

Air – The new web framework that breathes fresh air into Python web development (github.com)

2 points by rahimnathwani 1h ago 0 comments

Death to Type Classes (jappie.me)

1 points by zeepthee 1h ago 0 comments

Dinosaurs to supercrocs: Niger's bone keepers preserve its ancient fossils (aljazeera.com)

1 points by Qem 1h ago 0 comments

Lumina-DiMOO: An open-source discrete multimodal diffusion model (synbol.github.io)

2 points by SweetSoftPillow 1h ago 0 comments

Linux 6.18 Will Further Complicate Non-GPL Out-of-Tree File-Systems (phoronix.com)

4 points by xenophonf 1h ago 0 comments

What's the best way to benchmark neuro‑symbolic‑causal AI agents?

1 aytuakarlar 1 9/12/2025, 9:54:59 AM github.com ↗

Comments (1)

aytuakarlar · 2h ago

I’m building Project Chimera, an open‑source neuro‑symbolic‑causal AI framework. The goal:

Combine LLMs (for hypothesis generation), symbolic rules (for safety & domain constraints), and causal inference (for estimating true impact) into a single decision loop.

In long‑horizon simulations, this approach seems to preserve both profit and trust better than LLM‑only or non‑symbolic agents — but I’m still refining the architecture and benchmarks.

I’d love to hear from the HN community:

• If you’ve built agents that reason about cause–effect, what design choices worked best?

• How do you benchmark reasoning quality beyond prediction accuracy?

• Any pitfalls to avoid when mixing symbolic rules with generative models?

GitHub (for context): https://github.com/akarlaraytu/Project-Chimera

Thanks in advance — I’ll be around to answer questions and share results from this discussion.