FREE MEETING: KEY TRENDS AND RISKS IN NFT GAMES– REGISTER

Crypto Cipherium
  • Home
  • News
    Jane Road faucets former Barclays e-FX dealer
    Market

    Jane Road faucets former Barclays e-FX dealer

    Jane Road has employed George Fordham, a former director inside Barclays’ digital…

    By Editor
    June 5, 2026
    10 Jobs Hiring Instantly and 10 That Take Months to Fill
    Money
    10 Jobs Hiring Instantly and 10 That Take Months to Fill
    how InvestingPro noticed Lear’s 62% acquire one 12 months earlier than it occurred
    Business
    how InvestingPro noticed Lear’s 62% acquire one 12 months earlier than it occurred
    Bull of the Day: Marathon Petroleum (MPC)
    Market
    Bull of the Day: Marathon Petroleum (MPC)
    Is BDX Underperforming the Healthcare Sector?
    Business
    Is BDX Underperforming the Healthcare Sector?
  • Stock Market
    Stock MarketShow More
    DeFi Lending Platforms Defined: Advantages, Dangers, and Market Tendencies
    DeFi Lending Platforms Defined: Advantages, Dangers, and Market Tendencies
    June 5, 2026
    Senators Press For ‘Truthful’ Crypto Capital Guidelines In New Letter
    Senators Press For ‘Truthful’ Crypto Capital Guidelines In New Letter
    June 5, 2026
    Semiconductors proceed to wrestle forward of the Wall Avenue open
    Semiconductors proceed to wrestle forward of the Wall Avenue open
    June 5, 2026
    OpenAI to adjust to Trump AI mannequin evaluation order: Osborne
    OpenAI to adjust to Trump AI mannequin evaluation order: Osborne
    June 5, 2026
    Premu Launches Consumer-Created Leveraged Prediction Markets Simply in Time for the 2026 World Cup
    Premu Launches Consumer-Created Leveraged Prediction Markets Simply in Time for the 2026 World Cup
    June 5, 2026
  • Blockchain
    BlockchainShow More
    A Full Roadmap to Turn out to be a Crypto Auditor
    A Full Roadmap to Turn out to be a Crypto Auditor
    June 5, 2026
    SEC’s Hester Peirce Defends Open-Supply Blockchain Builders
    SEC’s Hester Peirce Defends Open-Supply Blockchain Builders
    June 5, 2026
    SEC’s Hester Peirce Defends Open-Supply Blockchain Builders
    Bitcoin ETF Possession Drops as Hedge Funds Promote, Banks Add
    June 5, 2026
    Stellar (XLM) Unveils Protocol 27: Key Options for Builders
    Stellar (XLM) Unveils Protocol 27: Key Options for Builders
    June 5, 2026
    SEC’s Hester Peirce Defends Open-Supply Blockchain Builders
    Google Pockets Expands Digital IDs and Fee Instruments in Europe
    June 5, 2026
  • Market Analysis
    Market Analysis
    Show More
    Top News
    Dubai actual property: fäm Properties launches AED3bn ultra-luxury assortment within the emirate
    Dubai actual property: fäm Properties launches AED3bn ultra-luxury assortment within the emirate
    November 10, 2025
    ISITC’s Paul Fullam on the ‘anxiousness’ over T+1 in Europe
    ISITC’s Paul Fullam on the ‘anxiousness’ over T+1 in Europe
    February 19, 2026
    Buc-ee’s earns ‘F’ grade from Higher Enterprise Bureau for ignoring complaints
    Buc-ee’s earns ‘F’ grade from Higher Enterprise Bureau for ignoring complaints
    March 10, 2026
    Latest News
    Jane Road faucets former Barclays e-FX dealer
    June 5, 2026
    10 Jobs Hiring Instantly and 10 That Take Months to Fill
    June 5, 2026
    how InvestingPro noticed Lear’s 62% acquire one 12 months earlier than it occurred
    June 5, 2026
    Bull of the Day: Marathon Petroleum (MPC)
    June 5, 2026
Reading: OpenAI and Paradigm Launch EVMbench to Check AI Good Contract Hacking
Share
Crypto CipheriumCrypto Cipherium
Font ResizerAa
Search
  • Home
  • News
    • NFT
    • Mining
  • Stock Market
    • Bitcoin
    • Ethereum
    • Forex
    • Tether
  • Blockchain
  • Market
    • Business
    • Money
Have an existing account? Sign In
Follow US
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Service
2025 © Crypto Cipherium. All Rights Reserved.
Blockchain

OpenAI and Paradigm Launch EVMbench to Check AI Good Contract Hacking

Editor
Last updated: March 5, 2026 8:58 am
Editor
Published: March 5, 2026
Share
OpenAI and Paradigm Launch EVMbench to Check AI Good Contract Hacking


Contents
  • Three Methods to Break Good Contracts
  • Actual Limitations Price Noting
  • $10M for Defensive Analysis


Rongchai Wang
Mar 05, 2026 00:55

New benchmark evaluates AI brokers’ skill to detect, patch, and exploit sensible contract vulnerabilities. GPT-5.3-Codex scores 72.2% on exploit duties.





OpenAI and crypto enterprise agency Paradigm have launched EVMbench, a benchmark that measures how effectively AI brokers can discover, repair, and exploit vulnerabilities in Ethereum sensible contracts. The announcement comes as AI-powered safety instruments race to guard the $100 billion-plus locked in DeFi protocols.

The benchmark attracts from 120 curated high-severity vulnerabilities pulled from 40 actual safety audits, principally from Code4rena competitions. It additionally consists of vulnerability eventualities from safety evaluations of Tempo, a Layer 1 blockchain constructed for stablecoin funds.

Three Methods to Break Good Contracts

EVMbench checks AI brokers throughout three distinct modes. In Detect mode, brokers audit contract repositories and get scored on discovering recognized vulnerabilities. Patch mode requires brokers to repair susceptible code with out breaking current performance. Exploit mode is essentially the most aggressive—brokers should execute precise fund-draining assaults towards contracts deployed on a sandboxed blockchain.

The outcomes present how rapidly AI capabilities are advancing on this area. GPT-5.3-Codex working through Codex CLI hit a 72.2% success price on exploit duties. That is greater than double the 31.9% rating from GPT-5, which launched simply six months prior.

Curiously, AI brokers carry out higher at attacking than defending. The exploit setting has a transparent goal—hold iterating till you drain the funds. Detection and patching proved more durable. Brokers typically stopped after discovering one bug as an alternative of auditing exhaustively, and sustaining full contract performance whereas eradicating delicate vulnerabilities remained difficult.

Actual Limitations Price Noting

OpenAI acknowledged EVMbench does not seize the total issue of real-world contract safety. Closely deployed protocols like Uniswap or Aave bear way more scrutiny than audit competitors code. The benchmark can also’t confirm if an agent finds legit vulnerabilities that human auditors missed—it solely checks towards recognized points.

The exploit setting runs on a clear native Anvil occasion quite than forked mainnet state, and timing-dependent assaults fall outdoors scope. Single-chain environments just for now.

$10M for Defensive Analysis

Alongside EVMbench, OpenAI dedicated $10 million in API credit particularly for defensive safety analysis. The corporate is increasing its Aardvark safety analysis agent to extra customers and partnering with open-source maintainers without spending a dime codebase scanning.

The timing issues. As AI brokers get higher at exploiting contracts, the window between vulnerability discovery and exploitation shrinks. Protocol groups that are not utilizing AI-assisted auditing will more and more discover themselves at a drawback towards attackers who’re.

OpenAI launched EVMbench’s duties, tooling, and analysis framework publicly. For DeFi builders and safety researchers, it is each a measuring stick and a warning about the place AI capabilities are headed.

Picture supply: Shutterstock


Trump Calls Peter Schiff A “Loser” And A “Jerk”
Pretend Ledger Wallets With Hidden WiFi Chips Floor on Chinese language Marketplaces
AAVE Value Prediction: Testing $240 Breakout with $280 Medium-Time period Goal Regardless of Bearish Momentum
ALGO Worth Prediction: Concentrating on $0.21 Restoration Inside 30 Days Regardless of Latest Volatility
EU Eyes SEC-Model Regulator For Inventory, Crypto Exchanges

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
[mc4wp_form]
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Email Copy Link Print
Previous Article Bitwise Channels 3K in Bitcoin ETF Features to Assist Open-Supply BTC Builders Bitwise Channels $233K in Bitcoin ETF Features to Assist Open-Supply BTC Builders
Next Article Crypto Market Invoice Hits New Impasse as Banks Reject White Home Deal Crypto Market Invoice Hits New Impasse as Banks Reject White Home Deal
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Follow US

Find US on Socials
FacebookLike
XFollow
YoutubeSubscribe
TelegramFollow
Popular News
Success Story: Charles Tyler’s Studying Journey with 101 Blockchains
Success Story: Charles Tyler’s Studying Journey with 101 Blockchains
Key Advantages, Use Circumstances, And Developments
Key Advantages, Use Circumstances, And Developments
The Innovation Hub Playbook: Constructing a Digital Ecosystem for the Recent Meals Chain
The Innovation Hub Playbook: Constructing a Digital Ecosystem for the Recent Meals Chain

Follow Us on Socials

We use social media to react to breaking news, update supporters and share information

Facebook X-twitter Youtube
Crypto Cipherium

We influence 20 million users and is the number one business blockchain and crypto news network on the planet.

Topics

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Service
Reading: OpenAI and Paradigm Launch EVMbench to Check AI Good Contract Hacking
Share
2025 © Crypto Cipherium. All Rights Reserved.
  • bitcoinBitcoin(BTC)$62,086.00-0.49%
  • ethereumEthereum(ETH)$1,666.29-4.56%
  • tetherTether(USDT)$1.000.02%
  • binancecoinBNB(BNB)$590.07-0.56%
  • usd-coinUSDC(USDC)$1.000.01%
  • rippleXRP(XRP)$1.12-2.59%
  • solanaSolana(SOL)$66.11-3.15%
  • tronTRON(TRX)$0.324673-0.93%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.021.81%
  • HyperliquidHyperliquid(HYPE)$61.37-7.23%
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?