BaxBench: Can LLMs Generate Secure and Correct Back Ends?

2 chillax 1 7/2/2025, 7:22:31 PM baxbench.com โ†—

Comments (1)

lucasluitjes ยท 17h ago
I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.

Also, I'm curious how the average coder would fare on this benchmark.