OpenAI and a crypto company called Paradigm just released something called EVMbench. It is basically a testing tool like a benchmark to see how good AI models are at dealing with security problems in smart contracts the code that runs on blockchains like Ethereum and holds over 100 billion in crypto.
Smart contracts can have bugs that let hackers steal funds so this benchmark checks if AI can help fix that risk. They built it using 120 real vulnerabilities pulled from 40 actual professional security audits including some from a blockchain project called Tempo.
The test has three main parts:
• Detect: Can the AI spot the bugs when it looks at the code
• Patch: Can it fix the bugs without breaking how the contract is supposed to work
• Exploit: Can it carry out an attack to drain the funds in a safe test environment not real money
They ran some top AI models on it. The newest one from OpenAI called GPT5.3Codex did really well at the exploit part it succeeded 72.2 percent of the time. That is a big jump from the earlier GPT5 which only managed about 31.9 percent six months ago.
The idea is to keep an eye on how fast AI is getting better at both finding and fixing these issues which is good for security and also at exploiting them which shows new risks we need to watch out for. They say it is important to use AI more for defense like having it audit and strengthen contracts to stay ahead of potential problems. Overall it is a way to measure progress in this area and make blockchain stuff safer as AI gets smarter.
$ETH #OpenAI #AI #EVMBench #Web3 $AI