Fixing Vulnerable Computer Programs with Semantic Reasoning

27 September 2023

Department of Computer Science, Faculty, Feature, Programming Languages & Software Engineering, Research, Security

Abhik Roychoudhury

Provost's Chair Professor

Computer Science

SHARE THIS ARTICLE

Debugging is the bane of many a computer programmer’s existence — a task that’s both immensely costly and time-consuming. For a start, locating the source of a software error, or bug, is “like finding a needle in a haystack,” says Abhik Roychoudhury, a Provost’s Chair Professor of Computer Science at NUS Computing.

It’s a critical problem: bugs aren’t always simple coding errors. Sometimes they entail software vulnerabilities that can be exploited by an attacker. To make matters worse, firms may not fix these errors because they are so short-staffed, leaving them incredibly exposed.

A large part of the problem lies in the fact that vulnerability fixing remains a largely manual effort today. According to a recent study, it takes an average of 52 days for a vulnerability to be found and fixed. Moreover, in many software projects, “up to 80%, or even 90%, of all the resources in a software project goes into debugging and fixing errors,” says Roychoudhury.

But in a bid to alleviate this burden, researchers have been working to develop methods that can automatically repair buggy programs, by identifying a suitable patch and applying it to the faulty code in question with little, or possibly even without, human intervention.

To that end, Roychoudhury has come up with one such automated repair method: SemFix, short for Semantic-based Program Fixing. He and his team — comprising collaborator Satish Chandra (then at IBM) and two PhD students (one, Dawei Qi, is now CTO of a large software security company) — first revealed SemFix at the International Conference on Software Engineering (ICSE) in 2013, where it made waves among the research community.

A decade on, their work continues to create impact. In May, ICSE bestowed it the ‘Most Influential Paper Award’, an accolade that recognises research with “the most influence on the theory or practice of software engineering during the 10 years since its original publication.” It also marked the first time the award was given to researchers outside of North America or Europe.

“ICSE is the top venue for software engineering research, and the ICSE Most Influential Paper Award is a well-recognized award which has been given since 1989. We are very pleased and humbled that our research has been included in such a hall of fame,” says Roychoudhury. “Our work enables a software system to heal itself with the help of semantic reasoning.”

Semantics, not syntax

SemFix was revolutionary for a simple reason: it introduced a new approach to carrying out program repairs, one that was based on semantics. By comparison, previous methods relied largely on syntactic searches — whereby the repair program sieves through the entire software code, or earlier versions of it, in order to find a suitable replacement for the defective code from existing expressions, from among billions of possible edits. It then copies the fix across and attempts to patch up the bug. “But this only goes so far,” says Roychoudhury.

To understand why, he uses the metaphor of a football player getting injured during training. “Suppose I lose some skin on my arm due to rough tackling and I try to cure myself by copying some skin from my feet and putting it on my arm. But that doesn’t work as you can’t just arbitrarily take some skin and put it there.”

That’s because the new replacement skin — or software patch — may not function in the same way as the original. “But what we want to do is retain all of the functionality,” says Roychoudhury. “And that’s where our work comes in.”

As its name suggests, SemFix takes a semantics-based approach to program repair. “Instead of trying to search for edits, it computes a property that captures the essence of what the fix is supposed to be,” he explains. “Once we have that property, we can use it to synthesise one or more fixes that satisfy that property, rather than copying it from elsewhere.”

SemFix’s novelty lies in its ability to identify the particular property that would allow the repaired program to pass given test cases. Such a specification helps in automatically generating repairs. “Trying to figure out exactly what is wrong with a program tends to be subjective because oftentimes we don’t write down what the program is supposed to do in a very precise way,” says Roychoudhury. “So we thought: if we can try to fix the program so that it meets the basic criteria, like passing the given tests, then that would be a good alternative.”

The work, however, presented a significant technical challenge: deriving a property of the program edits usually involves higher-order reasoning. But the NUS team managed to develop a new kind of symbolic execution mechanism that is capable of pinpointing the relevant properties using first order logic, which is less costly overall. “Our approach produces higher quality repairs,” says Roychoudhury.

Research in the real world

In the decade that’s passed since their groundbreaking work, Roychoudhury’s team has made other inroads in the field of automatic repair and its applications in the real world. The team has produced a dataset for vulnerability repairs to study and advance the state-of-the-art in security vulnerability repair. This helps ensure vulnerabilities in software systems are found and fixed, reducing the time systems are exposed to bugs. To carry out this work, the researchers have been actively working with Oracle Labs and other R&D partners.

Another related aspect Roychoudhury is currently exploring, via a Ministry of Education grant, is how other kinds of artifacts (such as static analysis results), rather than just test cases, can be used as warnings to trigger automatic program repairs. He’s also studying whether the two repair approaches — semantics-based and search-based — can possibly be combined to enhance their effectiveness and applicability, alongside collaborators at Microsoft Research.

Additionally, he and his team have conducted a number of field studies, including one last year which involved more than 100 software developers. “We wanted to find out what they are looking for in automatic program repair so we can tailor the technology to meet their expectations,” he explains. Pushing the envelope on automatic fixes is crucial because “currently, when security vulnerabilities are found, the manpower to fix it isn’t there, even if they are severe bugs. Automated repair can provide a solution to reduce the exposure of critical software systems to such vulnerabilities.”

Roychoudhury also hopes that one day, the technology can be applied to aid developer productivity when it comes to programming, especially with the advent of large language models like ChatGPT. “There is a very significant role for these kinds of technologies, but the reality is that companies aren’t using code that is automatically generated by ChatGPT today because there are issues of whether this code is really safe and trustworthy, and so on,” he says. “But if we have some kind of repair that can help improve this code to enhance programmers’ confidence in it,” then that could have a tremendous impact on the industry.

Within NUS, Roychoudhury’s work has been integrated into the Coursemology learning platform to help instructors provide feedback to students learning how to program. Such a tutoring system can also be used for automated grading of programming assignments, which “is really helpful because there are so many students on the course,” he says. He is also working to build a plugin for the NUS teaching platform, Canvas, which will allow the program to be used for other modules and also outside NUS.

“What we did in this paper ten years ago is very core technology,” reflects Roychoudhury. “Today, it has many different applications — in teaching programming, software engineering, and of course, very much in enhancing software security. It helps achieve the vision of autonomous cyber-defence for software systems.”

Trending Posts

5 October 2020

Watching People Walk

Life has a funny way of leading people down paths they least expect. Just ask NUS Computing lecturer Boyd Anderson. Two years ago, Anderson, then a PhD student, found himself ...

6 May 2021

Can Mobile Apps Make Us Eat Better and Be Healthier?

Every decade has an exercise trend or two that defines it. Step aerobics and the Thighmaster were popular in the ‘90s, for instance, while exer-gaming and CrossFit were all the ...

14 February 2025

DiversiNews Helps Increase News Exposure and Broaden People’s Minds

Those of us who don’t belong to Gen Z or Alpha may recall simpler times, when there were no smartphones or internet, and the only news we got was via ...

19 June 2025

Breaking the Bottleneck: Making Zero-Knowledge Proofs Practical at Scale

Explore how scalable collaborative zk-SNARKs enable fast, secure zero-knowledge proofs across multiple servers. This breakthrough improves privacy and scalability for AI verification, blockchain, and data markets, making advanced cryptography more ...

26 March 2019

Future Wearables: smaller, faster and more independent

For those who’ve taken the plunge into the world of wearable devices — 61 million of us by the year’s end, as estimates predict — the leap can be liberating. ...

8 February 2019

Sing Your Way to Language Success

Have you ever struggled to learn a new language? Maybe you should spend less time trying to speak it and start singing instead! ...

13 April 2023

Spotting concurrency bugs in software with sampling

In the summer of 1983, the government organisation Atomic Energy of Canada Limited launched its newest radiation therapy machine. The Therac-25 was highly anticipated — it boasted a revolutionary dual ...

2 November 2023

Creating mobile health apps that factor in the weather

All across the developed world, people are living increasingly sedentary lives. The average adult spends more than half their day sitting down — nearly six hours for those in Singapore, ...

29 December 2021

Watch the Action as it Happens: Towards Low-Latency Video Streaming

Roger Zimmermann has been in the business for a long time — nearly 25 years to be precise. He first started studying media streaming in the late 1990s, as a ...

24 September 2021

Making sense of messy data with ThunderGP

Choice is good, but sometimes having too much choice can be a bad thing. Just ask anyone who’s ever tried to delve into a new film on Netflix, discover new ...

4 June 2025

Designing Better Software Teams

A forthcoming study by NUS Computing’s Prof Jungpil Hahn and collaborators sheds new light on how software team structures impact product success in the digital economy. ...

1 October 2020

Beyond Paywalls and Profits

In March 2011, the New York Times introduced a policy that would later be recognised as a milestone in media history. The newspaper, deemed one of the best in the ...

11 October 2018

Online Shopping and the Science of Serendipity: NUS Computing Researcher Jack Jiang on Product Search in Social Commerce

Have you ever gone to an e-commerce website with the intention of buying one specific thing, but then ended up with something totally different? ...

4 June 2025

A New Grasp on Robotics: Teaching Robots to Hold the Future

A new framework developed by NUS Computing’s Asst Prof Shao Lin and collaborators brings robots closer to human-like dexterity, overcoming a key barrier in robotic grasping. ...

2 May 2025

Building the Right Features: Rethinking Innovation in the App Economy

A new study published in Information Systems Research by NUS Computing Assistant Professor Aditya Karanam sheds light on how feature strategy influences app adoption in the competitive app market. ...

22 April 2021

Reuse, Recycle…Recode

For an electronic device to ‘know’ what to do, computer programmers need to give it a set of instructions, called code. Writing software programmes can be an immense task — ...

13 November 2020

Quantum Physics Gets a Boost from AI

Stéphane Bressan and Christian Miniatura grew up in rival neighbourhoods of the naval garrison town of Toulon in southern France. They went to the same high school and the same ...

18 March 2022

A course that lets you get your hands dirty

‘EPP’ is an acronym that rolls easily off the tongue, and is something that all first-year Computer Engineering undergraduates at NUS are intimately familiar with. Short for ‘Engineering Principles and ...

10 June 2022

Want to make a good app? Update often and get customers involved

Modern-day learners have a wealth of “teachers” to turn to: online books, e-learning courses, YouTube tutorials, and even smartphone apps. If, for instance, you are yearning to lead a more ...

4 June 2025

Seeing Safety: How Augmented Reality Could Transform Drone Inspections Forever

Associate Professor Ooi Wei Tsang and his team at NUS Computing have developed SafeSpect, an adaptive augmented reality system that enhances drone pilots' situational awareness for drone inspections. ...

19 August 2020

The path to startup success: finding product market fit

In 2015, Shi Ying Lim was working on her Ph.D. in Austin, Texas. As part of her work, she studied a budding health IT startup that was trying to develop ...

3 January 2023

So you have a dataset? Think about the values it’s missing

Imagine that you’re a book publisher gathering feedback for a new novel that your firm has recently released. Sales figures are useful, but you’re keen to find out more about ...

15 June 2020

When cloud providers pool and throttle to win the race

When Yingda Zhai was working on his PhD in Austin, Texas, he used to stroll through the neighbourhood he lived in not too far from campus. On these walks, he ...

1 June 2020

Vanquishing smartphone zombies with EYEditor

If you have been to parts of Orchard Road or Bugis Junction, two busy shopping streets in Singapore, you might have noticed something unusual. There, familiar “traffic light men” flash ...

17 March 2025

Waterfall: A New Watermarking Method to Protect Copyright in the World of LLMs

A new watermarking technique protects copyright in the world of LLMs ...

8 October 2021

Empty shelves in Nairobi’s pharmacies: There’s more than meets the eye

When you’re ill, seeing the doctor is one thing. Getting your prescription filled is another. If you live in an industrialised country, you probably wouldn’t think twice about the latter ...

19 December 2019

Lost? Eyes in the sky can tell you where you are

No matter how many times you’ve flown, sitting at the window seat and watching the world shrink away from view as the plane takes off never seems to grow old. ...

17 December 2020

Towards personalised medicine: subtyping patients using their genomic data

Most pundits gazing into the crystal ball will likely shout two words in their prediction of healthcare’s future: precision medicine. Increasingly, there is growing recognition that tailoring treatments based on ...

15 February 2021

More than Assignments: Developing Software for the Real World

In 2011, Damith Rajapakse was teaching a few modules at NUS Computing when he ran into a problem. Part of his modules comprised an aspect of project work, and he ...

1 March 2019

Building Better IT Systems with Prof Chuan-Hoo Tan

At some point in our careers, most of us have to deal with an IT system that is clunky, unreliable, or just plain difficult to use. It might have an ...

Fixing Vulnerable Computer Programs with Semantic Reasoning

SHARE THIS ARTICLE

Trending Posts

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES