Categories: Cyber Security News

OpenAI Launches GPT-5.4 With Advanced Reasoning, Coding, and Computer-Use Capabilities

OpenAI on March 5, 2026, released GPT-5.4, its most capable and efficient frontier model to date, combining advanced reasoning, coding, and agentic workflows into a single unified system.

The model is rolling out across ChatGPT (as GPT-5.4 Thinking), the API, and Codex, with a higher-performance GPT-5.4 Pro variant available for users requiring maximum compute on complex tasks.

GPT-5.4 consolidates capabilities previously spread across separate models, integrating the industry-leading coding strengths of GPT-5.3-Codex with improved general reasoning and native computer-use capabilities.

The result is a model engineered for end-to-end professional workflows from spreadsheets and presentations to complex multi-step agentic tasks with less back-and-forth interaction required from users.

In ChatGPT, GPT-5.4 Thinking introduces an upfront reasoning plan that allows users to interrupt and redirect the model mid-response without restarting, enabling more targeted, context-accurate outputs. This real-time steerability is a notable shift from prior reasoning models, where course corrections required starting over entirely.

GPT-5.4 Launched

GPT-5.4 sets new state-of-the-art scores across several critical industry benchmarks:

Benchmark	GPT-5.4	GPT-5.3-Codex	GPT-5.2
GDPval (wins or ties)	83.0%	70.9%	70.9%
SWE-Bench Pro (Public)	57.7%	56.8%	55.6%
OSWorld-Verified	75.0%	74.0%	47.3%
Toolathlon	54.6%	51.9%	46.3%
BrowseComp	82.7%	77.3%	65.8%

On GDPval, which tests agents across 44 occupations spanning the top 9 U.S. GDP industries, GPT-5.4 matches or exceeds industry professionals in 83% of comparisons, up from 70.9% with GPT-5.2.

On the BigLaw Bench evaluation for legal document work, the model scored 91%, according to Harvey’s Head of Applied Research, Niko Grupen.

GPT-5.4 is OpenAI’s first general-purpose model with native computer-use capabilities, enabling agents to interact directly with software through screenshots, mouse commands, and keyboard inputs.

On OSWorld-Verified, it achieves a 75.0% success rate, surpassing human performance benchmarked at 72.4% and far exceeding GPT-5.2’s 47.3%.

OpenAI’s new GPT-5.4 model is a big step toward autonomous agents

OpenAI is launching GPT-5.4, the latest version of its AI model that the company says combines advancements in reasoning, coding, and professional work involving spreadsheets, documents, and presentations. It's also OpenAI's first model with native computer use capabilities, meaning it can operate a computer on your behalf and complete tasks…

March 5, 2026

In "The Verge"

OpenAI Releases GPT-5.1-Codex-Max That Independently Performs Coding Tasks

OpenAI has unveiled GPT-5.1-Codex-Max, a frontier agentic coding model designed to autonomously handle complex software engineering tasks across multiple stages of the development lifecycle. Built on an updated foundational reasoning model, this iteration represents a significant advancement in AI-assisted coding capabilities, combining improved intelligence with enhanced token efficiency and multi-context…

November 20, 2025

In "Cyber Security News"

OpenAI GPT-5.2-Codex Supercharges Agentic Coding and Vulnerability Detection

OpenAI has unveiled GPT-5.2-Codex, a cutting-edge model optimized for agentic coding and enhanced cybersecurity tasks. The release highlights breakthroughs in handling complex software engineering and vulnerability detection. GPT-5.2-Codex tops SWE-Bench Pro with 56.4% accuracy, outperforming GPT-5.2 at 55.6% and GPT-5.1 at 50.8%. On Terminal-Bench 2.0, it scores 64.0%, surpassing prior…

December 19, 2025

In "Cyber Security News"

rssfeeds-admin