BaxBench: Can LLMs Generate Secure and Correct Back Ends?

2 chillax 1 7/2/2025, 7:22:31 PM baxbench.com ↗

Comments (1)

lucasluitjes · 11h ago
I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.

Also, I'm curious how the average coder would fare on this benchmark.