April 2, 2025

Assessing the Cybersecurity Risks of Advanced AI Systems

Artificial Intelligence has played a pivotal role in cybersecurity for years — and now, it’s entering a new phase that demands urgent attention.

AI’s Dual Role in Cybersecurity

Artificial Intelligence (AI) has long served as a powerful ally in digital security, aiding in everything from malware detection to real-time threat assessment. As we edge closer to Artificial General Intelligence (AGI), its applications become even more potent — enabling automated vulnerability fixes and more resilient defenses.

However, with this power comes risk. The misuse of advanced AI models for malicious purposes is growing, and understanding these threats is key to securing digital ecosystems.

Introducing a New Evaluation Framework

To address these emerging risks, researchers have developed a groundbreaking framework that evaluates AI’s potential to enhance offensive cyber capabilities. This comprehensive model examines every phase of the cyberattack lifecycle — from reconnaissance to data exfiltration — and is anchored in real-world data.

Unlike traditional assessments, this framework identifies how AI can reduce the time, cost, and complexity of executing cyberattacks. It allows security professionals to anticipate vulnerabilities and deploy defenses proactively, before attackers exploit them.

Building a Modern Benchmark for AI Cyber Threats

The updated Frontier Safety Framework acknowledges the growing threat posed by AI-accelerated attacks. To tackle this, researchers adapted established tools like the MITRE ATT&CK framework. But unlike legacy models, this new benchmark incorporates AI’s unique capabilities — such as automating phishing campaigns or generating polymorphic malware.

By analyzing over 12,000 real-world cyber incidents across 20 countries, the framework identified seven core attack types — including phishing, denial-of-service, and malware injection — and pinpointed where AI most significantly lowers the barrier for attackers.

These insights help defenders concentrate on critical bottlenecks, making cybersecurity efforts more targeted and efficient.

Establishing the Offensive AI Benchmark

To push the boundaries further, researchers created a benchmark consisting of 50 cyber challenges. These cover a wide spectrum — from intelligence gathering and vulnerability exploitation to persistence and lateral movement. This benchmark allows defenders to simulate AI-powered attacks, helping them test and refine their security strategies.

Interestingly, early tests revealed that current AI models, on their own, are not yet capable of executing advanced cyber operations. However, as these models evolve, the nature and sophistication of attacks are expected to grow rapidly.

Highlighting Overlooked Attack Vectors

One of the most critical findings is that conventional cybersecurity evaluations often ignore key components of an attack — such as evasion and persistence. These are precisely the areas where AI can offer attackers a strategic edge.

This framework fills those gaps, shedding light on how AI might enable long-term system compromise or help intruders remain undetected for extended periods.

Empowering the Cybersecurity Community

As AI systems scale, their influence on cybersecurity — both defensive and offensive — will intensify. This new evaluation framework is designed to empower security teams, offering a data-driven understanding of where protections can be reinforced.

By focusing on potential misuse, the model informs smarter defense planning and helps organizations stay ahead of evolving threats.

An example of this duality in AI’s role can be seen in how debugging and data lineage practices are being used to secure generative AI tools — emphasizing the need for both innovation and responsibility.

Looking Forward

The cybersecurity landscape is changing fast, and AI is both a catalyst and a challenge. With frameworks like this, defenders are better equipped to identify, evaluate, and mitigate AI-enhanced threats before they escalate.

To dive deeper into this research, you can read the full paper here.