Veil: Bridging the Gap Between Speed and Rigor in Verifying Complex Distributed Systems

4 August 2025

Algorithms, Artificial Intelligence, Department of Computer Science

Ilya Sergey

Associate Professor

Computer Science

SHARE THIS ARTICLE

Veil: Bridging the Gap Between Speed and Rigor in Verifying Complex Distributed Systems

Why Verifying Distributed Systems Is Mission-Critical

Imagine the technology that powers everything from cloud services to blockchains suddenly failing because of a hidden flaw. As our reliance on large, interconnected distributed systems grows, ensuring their correctness and safety becomes not just a technical challenge, but a societal one. A failure in a cloud-based financial system could halt businesses; an error in a blockchain could cost millions. Despite this, verifying these complex systems remains one of the most daunting tasks in computer science.

Traditional methods offer a stark trade-off. On one end, there are fully manual proofs created using powerful interactive proof assistants. These provide a high degree of certainty but require months or even years of painstaking work by specialized experts. On the other end, fully automated verification tools promise speed but often require developers to contort their descriptions of systems into unnatural forms, and even then, they may be unable to prove complex properties.

This gap between thoroughness and usability has limited the scalability of formal verification efforts. Bridging it could revolutionize the reliability of critical digital infrastructure. That’s precisely the promise behind Veil, a groundbreaking tool developed by researchers at NUS Computing’s Verified Systems Engineering (VERSE) Lab led by Associate Professor Ilya Sergey.

The Pain Points of Traditional Verification

The difficulty in verifying distributed systems stems from their sheer complexity. Multiple computers interact across unreliable networks, facing unpredictable failures and delays. To be confident that such a system behaves correctly in every possible scenario is akin to proving that a massive, chaotic ballet will never miss a single step.

Interactive proof assistants like Lean, Coq, or Isabelle can rigorously verify these systems, but at enormous human cost. It’s not uncommon for verification projects to consume thousands of hours from highly trained formal method experts.

Meanwhile, automated tools like Ivy or mypyvy offer quicker checks but with serious limitations. They often require “contorted specifications” – rewriting what you want to verify into awkward forms that fit the tool’s limited language. Worse, when automation fails, these tools provide little support for users to manually prove what remains.

Thus, developers have faced a painful choice: aim for very high assurance but pay the price in time and expertise, or aim for speed and accept weaker, narrower guarantees.

Veil’s Big Idea: The Best of Both Worlds

Veil is designed to break this false dichotomy. It provides “push-button” verification when possible but offers seamless access to the full power of interactive proof when needed. Built on the modern proof assistant Lean 4, Veil offers a unified workflow where developers can first lean on automation and, if necessary, escalate to richer manual proofs without changing tools or losing rigor.

Importantly, the core of Veil, the part that translates system descriptions into logical puzzles for solvers, is itself formally proven sound. This foundational guarantee means that users can have greater trust that verification results reported by Veil are correct, as the risk of bugs in the verifier itself is reduced.

How Veil Works: A Closer Look

Veil structures system verification around three components: the system’s state, its possible actions, and the invariants that must always hold.

State captures all mutable information, like which node holds a token in a mutual exclusion protocol.
Actions describe events that can change the state, like sending a message or granting access.
Invariants are the safety properties you want to guarantee, such as “no two nodes can be in the critical section simultaneously.”

Developers write these components in a high-level, human-readable way. Veil then automatically generates verification conditions – logical formulas that must be true if the system behaves correctly – and passes them to powerful solvers like cvc5 or Z3.

If the solvers succeed, great; the invariant is verified. If not, Veil lets users step into Lean’s interactive proof mode, where they can craft detailed, human-guided proofs with strong automation support to finish the job.

Veil also supports bounded model checking for quick sanity checks and takes care to produce solver-friendly verification conditions. When counterexamples are found, it employs model minimization to simplify them, making debugging easier.

Real-World Testing: Veil in Action

To prove its effectiveness, Veil was tested on a battery of 16 benchmark distributed protocols, including complex algorithms like Vertical Paxos. It successfully verified all of them; something not all other tools could achieve. Moreover, verification times were practical. Many case studies completed in under 10 to 15 seconds, making it realistic for developers to integrate formal verification into everyday development cycles.

Perhaps even more impressively, Veil could handle specifications and properties that fell outside the “effectively propositional” fragment that limits older tools. This means it can tackle richer, more expressive properties, which is key for real-world distributed systems.

In porting formal proofs from other systems, like the Stellar Consensus Protocol (SCP) and Rabia, Veil even helped discover inconsistencies in proofs conducted using a mixture of verification tools, showing the usefulness of a unified framework to avoid human errors which previous approaches are prone to.

Why This Matters: A Future of Safer Systems

The stakes for reliable distributed systems are only rising. Financial systems, healthcare infrastructures, public blockchains, and future metaverse platforms all rely on distributed coordination among computers that must handle failures gracefully.

If we can significantly lower the cost and expertise needed to formally verify these systems, it could unlock a new era of safer, more trustworthy digital infrastructure. Developers could routinely verify complex protocols without requiring years of specialized training. Companies could ship safer products without prohibitive costs. Regulators could demand higher verification standards, knowing that practical tools exist to meet them.

For example, in the context of public blockchains, which manage billions in assets, more accessible formal verification could prevent vulnerabilities like double-spend attacks or smart contract bugs. In cloud computing, companies could more confidently offer fault-tolerant services knowing their distributed algorithms are mathematically guaranteed to behave correctly.

Conclusion: Enabling a New Era of Verified Systems

The development of Veil is more than an academic exercise; it could fundamentally change what is possible in the real-world deployment of complex distributed systems.

Today, many industries hesitate to fully embrace formal verification because of the steep costs and technical barriers. But with Veil’s blend of automation, expressiveness, and efficiency, this equation shifts. It brings us closer to a future where formally verified protocols are not exceptional luxuries, but standard practice.

In cloud computing, for example, services like distributed storage, consensus systems, and fault-tolerant microservices could now be designed with formally verified safety properties, dramatically reducing the risk of silent data loss, split-brain errors, or service outages. Teams could verify key properties of their systems incrementally during development, rather than months after deployment, or worse, after a critical failure.

In blockchain and decentralized finance (DeFi), where billions of dollars flow through public, open networks, Veil’s approach could enable developers to formally verify the safety, liveness, and consistency of smart contracts, consensus protocols, and cross-chain bridges. Rather than reacting to catastrophic vulnerabilities only after they are exploited, blockchain systems could ship with machine-checked guarantees of behavior.

Similarly, in emerging fields like autonomous vehicles, drones, and smart city infrastructure, where distributed coordination among independent agents is crucial, Veil’s framework could be applied to verify that safety-critical invariants—like collision avoidance or priority rules—are always maintained, even in the face of partial failures or network delays.

Beyond these immediate applications, the impact on software engineering culture could be profound. With tools like Veil making formal verification more accessible and faster, new generations of engineers could be trained to think about proofs and invariants as a routine part of system design – just as they are already trained to think about testing or security today. Verified-by-construction systems could become the norm, dramatically improving the reliability and trustworthiness of the digital world.

Veil’s modularity and portability mean it could inspire a broader ecosystem of verification tools tailored to different domains, from IoT networks to distributed AI training platforms. The long-standing tension between moving fast and building safe, reliable distributed systems could finally start to ease.

Veil can also bring about a paradigm shift in how we think about regulation for critical software systems. Rather than solely relying on best practices and ad-hoc testing, regulators and standards bodies could begin requiring formal verification for systems that meet certain criticality thresholds. For instance, financial regulators could mandate that blockchain-based financial infrastructure undergo machine-checkable proofs of key properties like consistency and liveness. Cloud service providers could be asked to produce formal verifications for their consensus protocols before being certified for high-availability claims. Importantly, regulators should also recognize the need to balance burden and practicality. Mandates should evolve alongside tools like Veil, ensuring that compliance is achievable without imposing unrealistic costs on developers.

In short, Veil doesn’t just offer a better way to verify today’s systems. It opens the door to building the next generation of distributed technologies – technologies we can trust from the ground up.

Veil is available open-source under the Apache 2.0 license at https://github.com/verse-lab/veil

Further Readings: Pîrlea, G., Gladshtein, V., Kinsbruner, E., Zhao, Q. and Sergey, I. (2025) “Veil: A Framework for Automated and Interactive Verification of Transition Systems,” 37th International Conference on Computer Aided Verification (CAV 2025), July 21-25, Zagreb, Croatia.

Trending Posts

13 December 2024

Exploring DiffPath: A Revolutionary Approach to Detecting Out-of-Distribution Data with AI

In the world of artificial intelligence (AI), one major challenge is teaching models to recognise when they encounter something they’ve never seen before—known as out-of-distribution (OOD) data. Imagine training a ...

1 March 2025

Revolutionising 3D Modelling with Tetsphere Splatting: A New Era of Digital Geometry

Explore how a groundbreaking technique developed by NUS Computing’s Assistant Professor Wang Bohan is set to transform digital geometry. Tetsphere Splatting, recently presented at ICLR 2025, uses virtual clay-like spheres ...

18 April 2023

Mining the marvellous richness of the human singing voice

Sound and music have always been a big part of Wang Ye’s life, guiding him through a career that has spanned being a research engineer at Nokia in Finland to ...

24 June 2022

When disaster strikes, where do people run?

When a natural disaster, terrorist attack, or any other crisis strikes, the best time to act isn’t just as it occurs, but rather in the months, even years, before it ...

17 June 2021

Predicting When Rare and Multiple Diseases Happen At Once

To say that the human body is an intricate complicated system would be an understatement. When one thing goes wrong, others often follow suit. So in 1970 when Alvan Feinstein ...

5 March 2021

Archipelago — making sure no student is an island

Like everyone else, Yuen Jien Soo found himself struggling to adapt when Covid-19 first hit last year. Soo, who teaches operating systems, computer organisation, and software product engineering at NUS ...

19 June 2025

Breaking the Bottleneck: Making Zero-Knowledge Proofs Practical at Scale

Explore how scalable collaborative zk-SNARKs enable fast, secure zero-knowledge proofs across multiple servers. This breakthrough improves privacy and scalability for AI verification, blockchain, and data markets, making advanced cryptography more ...

4 September 2020

Bringing video games to life

Your heartbeat quickens as you watch your video game avatar run through the twisting corridors of the castle. There is still treasure to be found and a hostage to be ...

27 August 2019

There’s power in hierarchy — but not what you expect

These days, it seems that whenever you’re thirsty and in need of a quick caffeine pick-me-up, there’s always a Starbucks close by — whether you’re running errands locally in the ...

13 April 2023

Spotting concurrency bugs in software with sampling

In the summer of 1983, the government organisation Atomic Energy of Canada Limited launched its newest radiation therapy machine. The Therac-25 was highly anticipated — it boasted a revolutionary dual ...

3 June 2025

Ripple Effects of Empathy: How a Distant Disaster Reshaped Global Microloans

A new study by NUS Computing’s Shaw Professor Bernard Tan and his former PhD students reveals how empathy and subtle social cues shape generosity on microlending platforms like Kiva. ...

3 July 2025

When AI Talks in Groups: How Multi-Agent Systems May Be Shaping Your Opinions

26 July 2023

Motivating and Sustaining Heterogenous Exercisers: No One Size Fits All Solution

If you like to dabble in exercise — whether as a weekend warrior, Ironman contender, or somewhere in between — you might remember 2015 as being an exciting year. Fitbit ...

27 September 2023

Fixing Vulnerable Computer Programs with Semantic Reasoning

Debugging is the bane of many a computer programmer’s existence — a task that’s both immensely costly and time-consuming. For a start, locating the source of a software error, or ...

30 April 2020

Human-centred explainable AI: Helping people to faithfully interpret machine learning with less mental effort

These days, artificial intelligence (AI) is everywhere we look. It’s what powers predictive searches on Google, enables Spotify and Amazon to recommend new songs and products, puts self-driving vehicles on ...

8 December 2022

Is the Right-to-Repair an overrated battle?

For the most part, Henrik Huseby was an average, hardworking man — a small business owner making a modest living repairing iPhones and MacBooks in Ski, a tiny city in ...

2 November 2023

Creating mobile health apps that factor in the weather

All across the developed world, people are living increasingly sedentary lives. The average adult spends more than half their day sitting down — nearly six hours for those in Singapore, ...

28 April 2025

When AI Confidence Rubs Off on Us: How AI’s Confidence Shapes Human Decision-Making

A new study by NUS Computing’s AI4SG Lab reveals that AI confidence levels can significantly influence human self-confidence, with lasting effects on our decision-making. ...

13 November 2018

Of beer and diapers, and other sale-boosting tricks

One of the most famous folklore in marketing and data mining goes like this: many years ago, Walmart noticed that on Fridays, men would head to the store, pick up ...

10 April 2023

Building a better detector to guard computers against malicious hardware attacks

The past few years have been a mixed bag for facial recognition. In 2017, the technology stepped into the global spotlight as Apple launched the iPhone X — its first ...

15 July 2021

Fending off Stealthy Side Channel Attacks

To get to his office, Jun Han has to walk down a long windowless corridor. There are office rooms on either side, but the doors are often closed, making the ...

6 November 2024

Reasoning and Planning: New Frontiers for AI

If artificial intelligence (AI) were a person, it would be an adolescent who’s just gone through a growth spurt and come of age. AI can now detect tumours with great ...

13 August 2019

The dilemma of an unknown diameter

They say that in the future, vehicles will be able to talk. Not in the way that those in the Pixar movie “Cars” do, but more in the sense of ...

4 June 2025

Up-to-date AI Without the Side Effects: How AlphaEdit Could Change the Way We Edit Knowledge in Language Models

KITHCT Chair Professor Chua Tat Seng and his team have developed AlphaEdit, a breakthrough approach that allows AI models to learn new facts without losing what they already know. ...

6 May 2021

Can Mobile Apps Make Us Eat Better and Be Healthier?

Every decade has an exercise trend or two that defines it. Step aerobics and the Thighmaster were popular in the ‘90s, for instance, while exer-gaming and CrossFit were all the ...

4 March 2025

Breaking the Bottleneck: Making Zero-Knowledge Proofs Practical at Scale

A team led by Asst Prof Zhang Jiaheng has developed a scalable, privacy-preserving way to generate zk-SNARKs—unlocking faster, secure proof generation across multiple machines. ...

24 August 2018

Blockchain gets better: moving beyond Bitcoin

Steeped in every culture since the beginning of time are legendary figures of mythical proportions. The Greeks and Romans had Hercules, the Celts had King Arthur, and the Chinese had ...

2 October 2019

Quicker MRIs in the future? Machine learning can help

If you’ve ever had an MRI done, you would know that it’s not the most comfortable experience. They can make you feel claustrophobic, you’ll often hear loud thumping or tapping ...

5 October 2025

From Frustrated Commands to Cooperative Partners: Rethinking AI Through Intent Inference

From Frustrated Commands to Cooperative Partners: Rethinking AI Through Intent Inference Have you ever found yourself repeating a command to a virtual assistant, tweaking your phrasing endlessly, only to give ...

10 June 2022

Want to make a good app? Update often and get customers involved

Modern-day learners have a wealth of “teachers” to turn to: online books, e-learning courses, YouTube tutorials, and even smartphone apps. If, for instance, you are yearning to lead a more ...

Veil: Bridging the Gap Between Speed and Rigor in Verifying Complex Distributed Systems

SHARE THIS ARTICLE

Trending Posts

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES