Fusing the Flows: How a New Testing Method Unearthed Decades-Old Flaws in PHP’s Core

20 August 2025

Cybersecurity, Department of Computer Science, Feature, Programming Languages & Software Engineering, Research, Student

SHARE THIS ARTICLE

Fusing the Flows: How a New Testing Method Unearthed Decades-Old Flaws in PHP’s Core

The Invisible Engine of the Web

Every time you post on a blog, browse an online store, or check a university portal, there’s a good chance you’re interacting with PHP. This server-side scripting language quietly powers more than 70% of all websites, from personal pages to major platforms.

Its ubiquity is a double-edged sword. On one hand, PHP’s maturity and widespread use make it a trusted workhorse. On the other, any deep-seated flaw in the PHP interpreter – the program that executes PHP code – has enormous reach. Vulnerabilities at this level can threaten the confidentiality, integrity, and availability of millions of sites simultaneously.

Most security attention focuses higher up the stack, at the application layer: plugging SQL injection holes, patching cross-site scripting, and fixing logic errors. But the interpreter is a sprawling, million-line C codebase, susceptible to the kinds of low-level memory errors, including buffer overflows, use-after-free, null pointer dereferences, etc., that can enable severe exploits.

Finding those flaws is not trivial. They’re often buried in rarely traversed paths of code, triggered only by complex interactions that no one thought to test.

Why the Best Test Suites Still Miss the Worst Bugs

The PHP community isn’t careless about testing. Its official test suite, known as the “golden test bed,” contains more than 19,000 cases covering core features and modules. These tests are valid, thorough, and widely respected.

Yet they share a common limitation: they tend to be simple, linear, and isolated. Each test probes a specific feature with straightforward inputs. Running two tests back-to-back doesn’t create the kind of intricate state interactions that can trip up deep memory management.

It’s in these untested interaction spaces, where one feature’s internal state bleeds unexpectedly into another, that the most stubborn bugs hide.

Enter FlowFusion

A team of security researchers at NUS Computing led by CS PhD student Jiang Yuancheng, in collaboration with fellow PhD students Zhang Chuqi, Ruan Bonan, and Liu Jiahao and advised by NUS Computing faculty members Assistant Professor Manuel Rigger, Associate Professor Roland Yap, and Associate Professor Liang Zhenkai, tackled this challenge with FlowFusion, the first automated fuzzing framework designed specifically to root out memory errors in the PHP interpreter. Their paper, entitled “Fuzzing the PHP Interpreter via Dataflow Fusion,” recently won the Distinguished Paper Award at the 34th USENIX Security Symposium in Seattle, Washington, USA.

Its signature move is data flow fusion; a method for creating rich, interaction-heavy tests by intelligently merging existing ones. Instead of simply concatenating scripts, FlowFusion analyzes the way variables and data structures move through one test and interleaves them with another. By weaving together two unrelated flows, it generates entirely new execution paths that were never explicitly tested before – paths that can reveal latent defects.

But here’s where FlowFusion moves from clever to transformative:

It is now incorporated directly into PHP’s official repository, making it a permanent part of the language’s quality-assurance pipeline.
It may be the top bug reporter for PHP today, rivaling or surpassing human submissions and other automated tools.
It continues to find new bugs every week on average, proving it isn’t just a one-off sweep but an ongoing engine of discovery.

Together, these points mean that FlowFusion isn’t just a research artifact; it has become one of the most important active contributors to the security of the web itself.

How Fusing Flows Finds Fossil Bugs

One discovery illustrates the power of the approach.

Test A checked DOM object behavior; variables flowed through reference and node manipulation.
Test B checked base64 encoding; values moved through string handling and encoding functions.

Individually, they’re worlds apart. But FlowFusion linked them, feeding a DOM object into base64 encoding routines via a newly created fusion variable. This forced the interpreter into an unusual sequence of operations, including a foreach loop in a context it had never been tested for.

The result? A heap use-after-free vulnerability that had been lurking in PHP’s core since version 5.0.0 in 2004; a flaw old enough to vote.

No conventional unit test or random fuzzing run had ever triggered this chain of events.

More Than Just Merging

FlowFusion layers multiple strategies to maximize bug discovery:

Test Mutation tweaks seed tests before fusion by replacing expressions or constants, injecting special values, while keeping syntax valid.
Interface Fuzzing feeds complex, fused variables into PHP’s internal C-level functions to stress-test hidden code paths.
Environment Crossover runs tests under varied PHP configurations (modules loaded, memory limits, JIT compilation, opcache settings) to expose environment-specific faults.

This multi-pronged approach targets the three big variables in bug manifestation: input complexity, execution context, and environmental conditions.

Results: A Sweeping Code Cleanup

Deployed against PHP’s latest interpreter, FlowFusion uncovered:

158 unique, previously unknown bugs

125 fixed and 11 confirmed in the official PHP repository
39 severe enough to crash the interpreter without specialized debugging tools

The defects spanned 10 distinct weakness categories (CWE) and touched over 80 source files, prompting changes to more than 5,000 lines of core code.

Examples include:

A heap overflow in the SQLite module due to improper buffer size checks.
A null pointer dereference in the Zend engine, caused by stale data structures generating invalid instructions.
Segmentation faults in the JIT compiler triggered by opcache mismanagement.

FlowFusion didn’t just find more bugs; it covered more ground. Compared under identical conditions, it achieved 24% higher code coverage than state-of-the-art general-purpose fuzzers like AFL++ and Polyglot after 24 hours.

Adoption by PHP’s Core Team

Perhaps the clearest vote of confidence came from PHP’s maintainers themselves, who integrated FlowFusion into their official toolchain. That means its methods are now part of the ongoing quality assurance process for one of the world’s most widely used software interpreters.

Why This Matters Beyond PHP

FlowFusion’s principles aren’t tied to PHP alone. Many large, mature software systems, especially those written in C or C++, share the same risk profile: complex, performance-critical codebases maintained over decades, with extensive but siloed test suites.

Think:

- Database engines (MySQL, PostgreSQL, MongoDB)
- Language interpreters and VMs (Python, Ruby, Node.js, JVM)

Operating system kernels

Network protocol stacks

In all these cases, intelligent fusion of existing tests could surface bugs that standard fuzzers miss—particularly those triggered by feature interactions no one anticipated.

From Patchwork Testing to Interaction-Aware Testing

FlowFusion’s contribution is as much about mindset as it is about method. Traditional testing often assumes features can be validated in isolation. But real-world software rarely runs in isolation; it’s the interplay of components, under diverse configurations, that exposes fragility.

By mining existing tests for their embedded knowledge and recombining them in semantically meaningful ways, FlowFusion automates what skilled human testers might attempt—at a scale and speed no team could match manually.

Future Uses and Impacts

1. Proactive Vulnerability Discovery
  Embedding data flow fusion into CI/CD pipelines for critical infrastructure software could catch severe memory issues before they ship—protecting end-users from zero-day exploits.
2. Security Audits of Legacy Systems
  Government and enterprise systems often run on outdated but critical platforms. Applying FlowFusion-like techniques could surface vulnerabilities that have lain dormant for decades, allowing safe remediation without disruptive rewrites.

Hardening AI/ML Frameworks
Popular machine learning libraries (TensorFlow, PyTorch) have complex C/C++ backends. Their massive test suites could be fused to uncover edge-case crashes or data corruption bugs that impact reproducibility and safety.

Interoperability Testing
In ecosystems with multiple interlinked components (e.g., browser engines combining HTML parsers, JavaScript engines, and rendering pipelines), fusing tests from different modules could reveal interaction bugs that manifest only in full-stack scenarios.

A Safer Internet Starts in the Engine Room

The lesson from FlowFusion’s success is clear: even the most trusted, thoroughly tested software can harbor deep, long-lived vulnerabilities. Finding them requires moving beyond conventional test case thinking and embracing methods that probe the unpredictable ways in which features interact.

For PHP, this has meant purging dangerous flaws, some as old as the interpreter’s early releases, and fortifying the security of millions of websites in the process. For the rest of the software world, it’s a proof of concept: if you can intelligently fuse the flows, you can force even the most mature systems to reveal their hidden weaknesses—before an attacker does.

Futher Readings: Jiang, Y., Zhang, C., Ruan, B., Liu, J., Rigger, M., Yap, R.H.C. and Liang, Z. (2025) “Fuzzing the PHP Interpreter via Dataflow Fusion,” 34th USENIX Security Symposium, Seattle: WA, August 13-15, https://www.usenix.org/conference/usenixsecurity25/presentation/jiang-yuancheng

Trending Posts

1 July 2020

Wanted: Sensitive New Age…Robot

Today’s virtual assistants and smart devices have come a long way. They can tell you if you’re running low on milk, what the weather will be like tomorrow, or change ...

4 March 2020

Online gifting and why we do it

For many of us, the introduction of Facebook, WhatsApp, and other social media platforms was a game-changer. They altered the way we make and maintain friends, and transformed how we ...

25 July 2019

Building a Vibrant Innovation Ecosystem

From driverless cars to life-saving medical devices and everything in between, the technologies of the future not only promise to change the world, but also to create high-paying jobs and ...

5 March 2021

Archipelago — making sure no student is an island

Like everyone else, Yuen Jien Soo found himself struggling to adapt when Covid-19 first hit last year. Soo, who teaches operating systems, computer organisation, and software product engineering at NUS ...

4 June 2025

Bullying the Machine: What AI’s Reactions to Psychological Pressure Teach Us About Vulnerability

A new study led by Professor Mohan Kankanhalli (Provost’s Chair Professor and Director of NUS AI Institute) reveals that large language models exhibit human-like psychological vulnerabilities when subjected to AI-driven ...

26 November 2021

Built a good machine learning model? Think again

When Jungpil Hahn was appointed head of the Department of Information Systems and Analytics at NUS Computing in 2015, it changed his perspective on many things. ...

27 December 2019

Move over Alfred, there’s a new butler in town

The shiny, black robotic arm gleamed as it whirred into action and ‘waved’ at us, accompanied by Alexa’s robotic, yet (somehow) cheery, disembodied greeting, “Hello! My name is MICO.” Mohit ...

28 May 2019

What Bayesian Optimisation can teach us about baking better cookies and more

Mention “Bayesian Optimisation” to Professor Bryan Low Kian Hsiang and he begins to talk about baking cookies. That’s because to the uninitiated, concepts such as “distributed batch Gaussian process optimisation” ...

18 March 2022

A course that lets you get your hands dirty

‘EPP’ is an acronym that rolls easily off the tongue, and is something that all first-year Computer Engineering undergraduates at NUS are intimately familiar with. Short for ‘Engineering Principles and ...

8 October 2021

Empty shelves in Nairobi’s pharmacies: There’s more than meets the eye

When you’re ill, seeing the doctor is one thing. Getting your prescription filled is another. If you live in an industrialised country, you probably wouldn’t think twice about the latter ...

26 July 2023

Motivating and Sustaining Heterogenous Exercisers: No One Size Fits All Solution

If you like to dabble in exercise — whether as a weekend warrior, Ironman contender, or somewhere in between — you might remember 2015 as being an exciting year. Fitbit ...

14 May 2020

Always one step ahead: Robo-Chef predicts steps of recipes it’s never seen before

To understand the work she does, Angela Yao says to imagine a future where robot helpers are commonplace. Whether they’re workplace assistants, companions, or domestic helpers, robots need to be ...

20 July 2020

Want to better categorise your products online? Try translation tech

Confined to their homes during the circuit breaker period, Singapore’s Covid-19 lockdown, people began ordering certain products in earnest: fitness equipment, home office accessories, flour and other baking goods. If, ...

1 July 2019

Does “practice makes perfect” apply to businesses too?

Remember when your piano teacher used to insist you practise your scales every single day? Turns out she wasn’t just being a tyrannical tormentor, but a firm believer in the ...

15 May 2025

Helping AI Helps Us Too: The Surprising Mental Health Benefits of Assisting Artificial Intelligence

A study led by Assistant Professor LEE Yi-Chieh and his team at the AI 4 Social Good Lab (AI4SG) at NUS Computing has uncovered a surprising finding that assisting even ...

15 February 2021

More than Assignments: Developing Software for the Real World

In 2011, Damith Rajapakse was teaching a few modules at NUS Computing when he ran into a problem. Part of his modules comprised an aspect of project work, and he ...

26 September 2025

Reasoning with Intelligence: A New Blueprint for Controllable Generative AI

Reasoning with Intelligence: A New Blueprint for Controllable Generative AI Generative AI has already dazzled the world with its capabilities. Whether it’s crafting photorealistic images, composing music, generating dialogue, or ...

4 August 2025

Veil: Bridging the Gap Between Speed and Rigor in Verifying Complex Distributed Systems

Explore how scalable collaborative zk-SNARKs enable fast, secure zero-knowledge proofs across multiple servers. This breakthrough improves privacy and scalability for AI verification, blockchain, and data markets, making advanced cryptography more ...