Binance Square

evmbench

167 рет көрілді
4 адам талқылап жатыр
currentupdate
·
--
AI vs. Hackers: Measuring Smart Contract Security with EVMbenchOpenAI and a crypto company called Paradigm just released something called EVMbench. It is basically a testing tool like a benchmark to see how good AI models are at dealing with security problems in smart contracts the code that runs on blockchains like Ethereum and holds over 100 billion in crypto. Smart contracts can have bugs that let hackers steal funds so this benchmark checks if AI can help fix that risk. They built it using 120 real vulnerabilities pulled from 40 actual professional security audits including some from a blockchain project called Tempo. The test has three main parts: • Detect: Can the AI spot the bugs when it looks at the code • Patch: Can it fix the bugs without breaking how the contract is supposed to work • Exploit: Can it carry out an attack to drain the funds in a safe test environment not real money They ran some top AI models on it. The newest one from OpenAI called GPT5.3Codex did really well at the exploit part it succeeded 72.2 percent of the time. That is a big jump from the earlier GPT5 which only managed about 31.9 percent six months ago. The idea is to keep an eye on how fast AI is getting better at both finding and fixing these issues which is good for security and also at exploiting them which shows new risks we need to watch out for. They say it is important to use AI more for defense like having it audit and strengthen contracts to stay ahead of potential problems. Overall it is a way to measure progress in this area and make blockchain stuff safer as AI gets smarter. $ETH #OpenAI #AI #EVMBench #Web3 $AI {spot}(AIUSDT)

AI vs. Hackers: Measuring Smart Contract Security with EVMbench

OpenAI and a crypto company called Paradigm just released something called EVMbench. It is basically a testing tool like a benchmark to see how good AI models are at dealing with security problems in smart contracts the code that runs on blockchains like Ethereum and holds over 100 billion in crypto.

Smart contracts can have bugs that let hackers steal funds so this benchmark checks if AI can help fix that risk. They built it using 120 real vulnerabilities pulled from 40 actual professional security audits including some from a blockchain project called Tempo.

The test has three main parts:
• Detect: Can the AI spot the bugs when it looks at the code
• Patch: Can it fix the bugs without breaking how the contract is supposed to work
• Exploit: Can it carry out an attack to drain the funds in a safe test environment not real money

They ran some top AI models on it. The newest one from OpenAI called GPT5.3Codex did really well at the exploit part it succeeded 72.2 percent of the time. That is a big jump from the earlier GPT5 which only managed about 31.9 percent six months ago.

The idea is to keep an eye on how fast AI is getting better at both finding and fixing these issues which is good for security and also at exploiting them which shows new risks we need to watch out for. They say it is important to use AI more for defense like having it audit and strengthen contracts to stay ahead of potential problems. Overall it is a way to measure progress in this area and make blockchain stuff safer as AI gets smarter.
$ETH

#OpenAI
#AI
#EVMBench
#Web3
$AI
OpenAI + Paradigm Drop EVMBench – AI Shield for Crypto Tokens & Smart Contracts!Game-changer for Web3 security! OpenAI just launched EVMBench, a cutting-edge benchmarking system with Paradigm to test AI agents on detecting, exploiting, and patching vulnerabilities in Ethereum smart contracts & tokens.         Core Features   EVMBench evaluates AI across three modes in a sandboxed EVM:     •  Detection: Spot flaws in Solidity code (tokens, DeFi).     •  Exploitation: Simulate real attacks.    •  Patching: Auto-fix issues with explanations.   Open dataset of real/vulnerable contracts + 10M$ OpenAI cybersecurity fund. Aims to slash billions in hacks via standardized AI audits.          Impact    Paradigm calls it "open benchmark revolution" no more manual audits scaling issues. OpenAI's first crypto-specific tool post-EO clarity. DeFi safer, faster launches ahead!  Bullish for AI-blockchain fusion?  Thoughts?  #OpenAI #EVMBench #SmartContractSecurity #CryptoAi #paradigm $ETH {spot}(ETHUSDT)

OpenAI + Paradigm Drop EVMBench – AI Shield for Crypto Tokens & Smart Contracts!

Game-changer for Web3 security! OpenAI just launched EVMBench, a cutting-edge benchmarking system with Paradigm to test AI agents on detecting, exploiting, and patching vulnerabilities in Ethereum smart contracts & tokens. 
       Core Features 
 EVMBench evaluates AI across three modes in a sandboxed EVM: 
   •  Detection: Spot flaws in Solidity code (tokens, DeFi). 
   •  Exploitation: Simulate real attacks.
   •  Patching: Auto-fix issues with explanations. 
 Open dataset of real/vulnerable contracts + 10M$ OpenAI cybersecurity fund. Aims to slash billions in hacks via standardized AI audits. 
        Impact 
  Paradigm calls it "open benchmark revolution" no more manual audits scaling issues. OpenAI's first crypto-specific tool post-EO clarity. DeFi safer, faster launches ahead! 
Bullish for AI-blockchain fusion?  Thoughts?
 #OpenAI #EVMBench #SmartContractSecurity #CryptoAi #paradigm
$ETH
OpenAI and Paradigm to launch AI agent tool for smart contract security #OpenAI has partnered with #Paradigm to launch #EVMbench , a benchmark measuring how well AI agents can identify, fix, and exploit critical smart contract vulnerabilities. EVMbench draws on 120 real-world vulnerabilities across 40 audits, including scenarios from the #Tempo blockchain audit focused on high-volume stablecoin payments. EVMbench measures three core capabilities. Detect scores how well agents identify known smart contract flaws. Patch evaluates whether agents can fix issues without breaking intended behavior. Exploit tests agents’ ability to execute full fund-draining attacks in a sandboxed blockchain environment. 👉 openai.com/index/introducing-evmbench/
OpenAI and Paradigm to launch AI agent tool for smart contract security

#OpenAI has partnered with #Paradigm to launch #EVMbench , a benchmark measuring how well AI agents can identify, fix, and exploit critical smart contract vulnerabilities. EVMbench draws on 120 real-world vulnerabilities across 40 audits, including scenarios from the #Tempo blockchain audit focused on high-volume stablecoin payments.

EVMbench measures three core capabilities. Detect scores how well agents identify known smart contract flaws. Patch evaluates whether agents can fix issues without breaking intended behavior. Exploit tests agents’ ability to execute full fund-draining attacks in a sandboxed blockchain environment.

👉 openai.com/index/introducing-evmbench/
Басқа контенттерді шолу үшін жүйеге кіріңіз
Криптоәлемдегі соңғы жаңалықтармен танысыңыз
⚡️ Криптовалюта тақырыбындағы соңғы талқылауларға қатысыңыз
💬 Таңдаулы авторларыңызбен әрекеттесіңіз
👍 Өзіңізге қызық контентті тамашалаңыз
Электрондық пошта/телефон нөмірі