Fight Chat Control (fightchatcontrol.eu)

What the author is trying to get at in the admittedly poorly worded question is that the trials are noisy measures of an underlying effect. Your job is to sort by effect size, while accounting for the random chance that a low sample size trial just got unlucky.

You might argue that the question is much harder than the author assumes, since your best guess at the actual effect size seems like it should still just be the success rate, even if the low sample size trials have wider error bars. You'd need to come up with some sort of heuristic that says why 7/9 deserves a lower rank than 50/70 using binomial confidence intervals.

Probably that heuristic is intended to be a bayesian approach? Like, if you add just two successes and two failures to each scenario as a prior, thats enough to put the 50/70 option ahead.

kruffalon · 13m ago

I wrote the deleted comment you are replying to.

The essence of my comment was that this text/test is not for me (one person of the general public) but more like a few leetcode-style questions for statisticians.

Your attempt to explain what I didn't understand just proves my point as I don't really understand what you are saying either.

And that's ok: this is just not for me! (And that's why I deleted my original comment)

jldugger · 28m ago

And I guess since they answer the questions at the bottom, it seems their intent is indeed the simplistic approach

> The lower bound of which can be used to order the fractions, and so control the risk of over-estimation.

It not clear to me from the question whether the cost of a mistake is in the over-estimating the underlying effect or in misranking the effects, and that seems like it would drive your heuristic selection.

usgroup · 21m ago

From the question:

“However, it is very important that the uncertainty in the number of trials is taken into account because over-estimating a fraction is a costly mistake.“

Seems fairly clear to me that you’re supposed to use a lower bound estimate to take into account variance on the fraction due to the number of trials in a way to bounds the chance of over estimation.

Further, there is no need for a heuristic when there a several statistical models for this exact problem with clear properties. Some are given in the answer.

thekoma · 6m ago

Out of context, the expression "the uncertainty in the number of trials" would refer to missing knowledge in terms of how many trials actually ran.

In the context of the post this doesn't make sense, so the reader is left to hypothesize what the writer actually meant.

taylorius · 6m ago

I think "uncertainty due to the number of trials" would be clearer.

GPT-5 (openai.com)

Fight Chat Control (fightchatcontrol.eu)

GitHub is no longer independent at Microsoft after CEO resignation (theverge.com)

I tried every todo app and ended up with a .txt file (al3rez.com)

Claude Sonnet 4 now supports 1M tokens of context (anthropic.com)

Ultrathin business card runs a fluid simulation (github.com)

I want everything local – Building my offline AI workspace (instavm.io)

Wikipedia loses challenge against Online Safety Act (bbc.com)

Streaming services are driving viewers back to piracy (theguardian.com)

FFmpeg 8.0 adds Whisper support (code.ffmpeg.org)

Emailing a one-time code is worse than passwords (blog.danielh.cc)

Debian 13 “Trixie” (debian.org)

Steve Wozniak: Life to me was never about accomplishment, but about happiness (yro.slashdot.org)

Good system design (seangoedecke.com)

Vibechart (vibechart.net)

Why LLMs can't really build software (zed.dev)

Claude Code is all you need (dwyer.co.za)

VC-backed company just killed my EU trademark for a small OSS project

Gemma 3 270M: Compact model for hyper-efficient AI (developers.googleblog.com)

Nginx introduces native support for ACME protocol (blog.nginx.org)

Show HN: The current sky at your approximate location, as a CSS gradient (sky.dlazaro.ca)

Claude says “You're absolutely right!” about everything (github.com)

PYX: The next step in Python packaging (astral.sh)

Open hardware desktop 3D printing is dead? (josefprusa.com)

How I code with AI on a budget/free (wuu73.org)

Show HN: Building a web search engine from scratch with 3B neural embeddings (blog.wilsonl.in)

Try and (ygdp.yale.edu)

This website is for humans (localghost.dev)

GPT-5: Key characteristics, pricing and system card (simonwillison.net)

Wikimedia Foundation Challenges UK Online Safety Act Regulations (wikimediafoundation.org)

I accidentally became PureGym’s unofficial Apple Wallet developer (drobinin.com)

OpenFreeMap survived 100k requests per second (blog.hyperknot.com)

Jim Lovell, Apollo 13 commander, has died (nasa.gov)

Search all text in New York City (alltext.nyc)

Ask HN: How can ChatGPT serve 700M users when I can't run one GPT-4 locally?

What's the strongest AI model you can train on a laptop in five minutes? (seangoedecke.com)

Historical Tech Tree (historicaltechtree.com)

The future of large files in Git is Git (tylercipriani.com)

Why are there so many rationalist cults? (asteriskmag.com)

Meta Leaks Part 1: Israel and Meta (archive.org)

Cursed Knowledge (immich.app)

The Chrome VRP Panel has decided to award $250k for this report (issues.chromium.org)

Monero appears to be in the midst of a successful 51% attack (twitter.com)

Do things that don't scale, and then don't scale (derwiki.medium.com)

The Framework Desktop is a beast (world.hey.com)

PuTTY has a new website (putty.software)

Occult books digitized and put online by Amsterdam’s Ritman Library (openculture.com)

GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2 (magazine.sebastianraschka.com)

Getting good results from Claude Code (dzombak.com)

Flipper Zero dark web firmware bypasses rolling code security (rtl-sdr.com)

A short statistical reasoning test

Comments (6)