Auto-Labeling Data for Object Detection

Comments (1)

nwlotz · 1d ago

The ML research team at Voxel51 just released a paper showing that foundation models rival the accuracy of human annotators in labeling large visual datasets, at several orders of magnitude less time and cost.

We also found that models trained from these labels perform about as well as those trained from human labels when tested against public validation sets. Interestingly, setting a relatively low confidence threshold (0.2 - 0.5) for the auto-generated labels maximized downstream model performance. Very high confidence thresholds often produced worse results due to reduced recall.

The upshot is that zero-shot labeling can replace human annotation in many datasets. The massive cost savings can then be redirected toward training higher-parameter models.

Happy to answer any questions about the research. You can also read this blog we wrote that goes more in depth into the methods and tools we used. https://link.voxel51.com/HN-VAL-blog/

Phasing out Bazaar code hosting (discourse.ubuntu.com)

Accelerometer-Measured Physical Activity and Neuroimaging-Driven Brain Age (spj.science.org)

You're not still using Windows XP, are you? (computerworld.com)

1k-year-old Native American fields defy limits of farming (phys.org)

Aurora – 500-watt SDR ham radio transceiver announced (flexradio.com)

Why I Let Wikipedia Block Me (So It Would Remember Me Forever) (lightcapai.medium.com)

Woman sues IBM over lost job, claims she was passed over because she is white (universalhub.com)

Launching Kaizly – summer learning made simple for your child (kaizly.com)

Feds charge 12 more suspects in RICO case over crypto crime spree (therecord.media)

Show HN: String Flux – Simplify everyday string transformations for developers (stringflux.io)

Gleam JavaScript gets 30% faster (gleam.run)

Show HN: Dietnb – Prevent Jupyter notebooks from bloating with Base64 images (github.com)

How to Improve Data Quality (blog.engora.com)

A short history of Greenland, in six maps (economist.com)

Don't Settle for Mediocre Front End Testing (blog.thinkst.com)

CEO Sundar Pichai says Google to keep hiring engineers (timesofindia.indiatimes.com)

Facet: Reflection for Rust (youtube.com)

Ask HN: Is GPU nondeterminism bad for AI?

Discord's CTO Is Just as Worried About Enshittification as You Are (engadget.com)

What LLMss Don't Talk About: Empirical Study of Moderation & Censorship Practice (arxiv.org)

Ask HN: Should movie theaters allow you to watch movies in 30 minute chunks?

Soviet Radio Manufacturer Logos (oldradio.ru)

Vapor: Swift, but on a Server (vapor.codes)

$300 Ukrainian drones vs. $100M Russian bombers (gzeromedia.com)

Show HN: YOYO – AI Version Control for Vibe Coding (runyoyo.com)

Trump and Musk enter bitter feud – and Washington buckles up (bbc.co.uk)

Musk: SpaceX will ground Dragon spacecraft used to shuttle astronauts to ISS (thehill.com)

Tokasaurus: An LLM Inference Engine for High-Throughput Workloads (scalingintelligence.stanford.edu)

Technical Interviews in the Age of LLMs (fractional.ai)

APL Interpreter – An implementation of APL, written in Haskell (2024) (scharenbroch.dev)

Ask HN: Validating a Tool to Help Founders Stay Focused and Build What Matters

U.S. Research Stock Returns Data (mba.tuck.dartmouth.edu)

Meta Advertising Manual (proxima-wiki.notion.site)

Remote Development with X2Go (reemus.dev)

Intel: New products must deliver 50% gross profit to get the green light (tomshardware.com)

I made a list of free stuff for college hackers (buildincollege.com)

Why Texas Won't Force Companies to Use E-Verify for Employment Authorization (texastribune.org)

The Rarest Signature [video] (youtube.com)

600 years before Europeans arrived, Great Lakes farmers transformed the land (science.org)

Measuring the elastic properties of the Gibeon meteorite using laser ultrasound (sciencedirect.com)

A curated list of available fantasy consoles/computers (github.com)

AWS Plunks Down $10B for Datacenters in North Carolina (nextplatform.com)

A private company wants to build a city on the moon (abcnews.go.com)

How Common Is Multiple Invention? (construction-physics.com)

Shadowsocks to Tor: Why It Failed as a VPN Alternative (bogomolov.work)

The Dangers of Consolidating All Government Information (eff.org)

Myanmar's chinlone ball sport threatened by conflict and rattan shortages (aljazeera.com)

Junie, an AI coding agent from JetBrains, is available in RubyMine (blog.jetbrains.com)

Susan Kare demonstrating the Macintosh Interface in 1984 (youtube.com)

Show HN: I made realtime user counter/pulsar (taara.knhash.in)

Auto-Labeling Data for Object Detection

Comments (1)