Web 3

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

Published

11 months ago

October 21, 2024

DCS Team

Credit : cryptoslate.com

Receive, manage and grow your crypto investments with Brighty

SolidityBench by IQ has been launched as the primary leaderboard that evaluates LLMs in Solidity code era. Accessible on Hugging faceit introduces two progressive benchmarks, NaiveJudge and HumanEval for Solidity, designed to evaluate and rank the proficiency of AI fashions in producing good contract code.

Developed by IQs BrainDAO As a part of the upcoming IQ Code suite, SolidityBench serves to refine and evaluate their proprietary EVMind LLMs towards generalist and community-created fashions. IQ Code goals to supply AI fashions tailor-made to generate and management good contract code, assembly the rising want for safe and environment friendly blockchain functions.

As IQ stated CryptoSlateNaiveJudge gives a brand new strategy by tasking LLMs with implementing good contracts based mostly on detailed specs derived from audited OpenZeppelin contracts. These contracts present a gold customary for correctness and effectivity. The generated code is evaluated towards a reference implementation utilizing standards comparable to purposeful completeness, compliance with Solidity greatest practices and safety requirements, and optimization effectivity.

The evaluate course of makes use of state-of-the-art LLMs, together with a number of variations of OpenAI’s GPT-4 and Claude 3.5 Sonnet as unbiased code reviewers. They evaluate the code based mostly on strict standards, together with implementing all main functionalities, dealing with edge instances, error administration, right syntax utilization, and general code construction and maintainability.

Optimization concerns comparable to gasoline effectivity and storage administration are additionally evaluated. Scores vary from 0 to 100 and supply a complete evaluation of performance, safety, and effectivity, reflecting the complexity of good contract skilled improvement.

READ Lava Network’s Smart Router Fuels Wyoming Stablecoin Program

Which AI fashions are greatest for growing stable good contracts?

Benchmark outcomes confirmed that OpenAI’s GPT-4o mannequin achieved the best general rating of 80.05, with a NaiveJudge rating of 72.18 and HumanEval for Solidity move charges of 80% on move@1 and 92% on move@3 .

Curiously, newer reasoning fashions like OpenAI’s o1-preview and o1-mini have been crushed into first place, with scores of 77.61 and 75.08 respectively. Fashions from Anthropic and XAI, together with Claude 3.5 Sonnet and Grok-2, confirmed aggressive efficiency with general scores hovering round 74. Nvidia’s Llama-3.1-Nemotron-70B scored the bottom within the high 10 with 52.54.

SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s authentic HumanEval benchmark from Python to Solidity, and contains 25 duties of various problem. Every job contains corresponding assessments suitable with Hardhat, a well-liked Ethereum improvement setting, which permits correct compilation and testing of the generated code. The analysis metrics, move@1 and move@3, measure the mannequin’s success on first makes an attempt and over a number of makes an attempt, offering perception into each accuracy and problem-solving capacity.

Goals of utilizing AI fashions in good contract improvement

By introducing these benchmarks, SolidityBench goals to advertise the AI-enabled improvement of good contracts. It encourages the creation of extra superior and dependable AI fashions and offers builders and researchers with helpful insights into the present capabilities and limitations of AI in Solidity improvement.

The benchmarking toolkit goals to advance IQ Code’s EVMind LLMs and in addition units new requirements for the event of AI-enabled good contracts within the blockchain ecosystem. The initiative hopes to handle a vital want within the business, the place demand for safe and environment friendly good contracts continues to develop.

READ IQ GPT to Enhance AI-Driven Lottery Experience by Integrating with Lottry

Builders, researchers, and AI fans are invited to discover and contribute to SolidityBench, which goals to drive the continued refinement of AI fashions, advance greatest practices, and advance decentralized functions.

Go to the SolidityBench ranking on Hugging Face for extra info and to start out benchmarking Solidity era fashions.

Talked about on this article

Related Topics:Code Contract GPT Model openai ranked Smart Solidity writing

Up Next

Carpooling Software Market Analysis By Top Keyplayers – Uber, BlaBlaCar, Wunder Carpool, Karos, Carma, SPLT (Splitting Fares), Waze Carpool, Shared Rides (Lyft Line), Via Transportation, Zimride by Enterprise, Scoop Technologies, Ola Share, SRide, Meru Ca

Don't Miss

Carrier Ethernet Access Devices Market Set for 3.8% CAGR Growth by 2031

Click to comment

Digital Coin Scoop

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

Web 3

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

Which AI fashions are greatest for growing stable good contracts?

Goals of utilizing AI fashions in good contract improvement

🤖 High AI crypto belongings

Talked about on this article

Leave a Reply
Cancel reply

Leave a Reply

What Is Arc? The Stablecoin Blockchain From USDC Issuer Circle

US lawmakers challenge SEC on Tron IPO, press for probe into Justin Sun

CCM: Inventor of liquid glass coating for cell phones – trend term gets a new boost

DOGE Sees Massive User Growth: Active Addresses Up 400%

Orbler Partners with Meta Lion to Accelerate Web3 Growth

Shocking Truth About TRON! TRX Crypto Review & Price Predictions!

Altcoin Treasury Companies Are STACKING SOL, SUI & WLD!

3 HUGE 400X ALTCOINS YOU’LL BE ANGRY YOU MISSED IN ALTCOIN SEASON?! DOG/DOGE MEMECOIN NARRATIVE HYPE

$USELESS IS PUMPING AGAIN!! NEW ALL TIME HIGH FOR $USELESS?!

Adoption

First dogecoin ETF outperforms expectations, trading nearly $6M in first hour on Wall Street

Sora Ventures joins Columbia Teachers College initiative to integrate web3 tech in education, policy

Metaplanet’s $1.4B boost sparks US and Japan expansion

Solana treasury company stock drops 7% after committing $4 billion to new purchases

Bitcoin ETFs attract $2.9 billion in fresh capital

Majority of institutions with no stablecoin project plan adoption within 12 months

Digital treasuries under pressure but Ethereum stands strong

Polymarket’s US expansion and SEC filing fuel token launch rumors

Trending

Digital Coin Scoop

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

Which AI fashions are greatest for growing stable good contracts?

Goals of utilizing AI fashions in good contract improvement

Talked about on this article

You may like

Leave a Reply Cancel reply

Leave a Reply

What Is Arc? The Stablecoin Blockchain From USDC Issuer Circle

US lawmakers challenge SEC on Tron IPO, press for probe into Justin Sun

CCM: Inventor of liquid glass coating for cell phones – trend term gets a new boost

DOGE Sees Massive User Growth: Active Addresses Up 400%

Orbler Partners with Meta Lion to Accelerate Web3 Growth

Shocking Truth About TRON! TRX Crypto Review & Price Predictions!

Altcoin Treasury Companies Are STACKING SOL, SUI & WLD!

3 HUGE 400X ALTCOINS YOU’LL BE ANGRY YOU MISSED IN ALTCOIN SEASON?! DOG/DOGE MEMECOIN NARRATIVE HYPE

$USELESS IS PUMPING AGAIN!! NEW ALL TIME HIGH FOR $USELESS?!

Adoption

First dogecoin ETF outperforms expectations, trading nearly $6M in first hour on Wall Street

Sora Ventures joins Columbia Teachers College initiative to integrate web3 tech in education, policy

Metaplanet’s $1.4B boost sparks US and Japan expansion

Solana treasury company stock drops 7% after committing $4 billion to new purchases

Bitcoin ETFs attract $2.9 billion in fresh capital

Majority of institutions with no stablecoin project plan adoption within 12 months

Digital treasuries under pressure but Ethereum stands strong

Polymarket’s US expansion and SEC filing fuel token launch rumors

Trending

Leave a Reply
Cancel reply