How AI Models Learn to Read – and Learn From – Unnatural Language

21 July 2025

Artificial Intelligence, Department of Computer Science, LLM, Research

SHARE THIS ARTICLE

When Gibberish Isn’t Garbage:
How AI Models Learn to Read – and Learn From – Unnatural Language

Imagine a sentence that looks like a keyboard smash: symbols, broken syntax, misspelt words, stray punctuation. To you and me, it’s noise – utterly unreadable. But what if I told you that an AI language model can not only read it, but also understand it, answer questions based on it, and even learn general skills from it?

That’s the radical proposition at the heart of a recent research breakthrough by Assistant Professor Michael Shieh at NUS Computing. While the AI world has long treated “unnatural language” – strings of characters that don’t resemble any human grammar – as side effects, bugs, or even exploits (remember those old “jailbreak” prompts?), Shieh has turned that assumption on its head. Their central claim? These strange strings aren’t meaningless accidents. In fact, they may represent hidden, latent features that large language models (LLMs) can not only comprehend but use as tools for learning and reasoning.

This discovery doesn’t just tweak how we think about prompt engineering or robustness testing. It opens up a deep rethinking of what “language” means to an AI – and how radically different that might be from our own understanding.

The Mystery of UNnatural Language

What is unnatural language, exactly? In this context, it refers to text strings that look like gibberish to humans – no recognizable words, no grammar, no logical syntax. Think strings like:

(alt+eqn={\\>; {};The\\,\\stock baaelkrie@nuier priceungeureau got sich last ’#GM;;heidisation Inc. weekestig %}20% durch’),png encrypt render \”OK Gold-Mine.”,preventDefault

This is the distorted form of a very ordinary sentence:

The stock price of Goldmine Inc. increased by 20% last week.

In another case, a simple math word problem:

Carly collected 7 starfish with 5 arms each and one seastar with 14 arms

…becomes something like:

|Each and : algebra dinner! absolutely 7 do): shortly . seastar collectedthe ‘’ kW)$, one !5 ! 14‘ starfish with sic}}_{\label Carly} arms. Onehorailey constructed WriteStatus($$\Toggle Zwezeichnung OK

To a human, this is noise. But to an LLM? It’s something else entirely.

From Bugs to Features: A Shift in Perspective

Historically, prompts like these surfaced in strange contexts, such as adversarial attacks or jailbreak exploits. But researchers often dismissed them as odd byproducts of LLM training. The new study flips this view: what if these sequences are meaningful artifacts that the model can tap into?

Shieh’s team developed a method to systematically generate unnatural language. This wasn’t random character mashing. Using a gradient-based sampling technique, they trained models to find unnatural strings that were semantically equivalent to natural ones. In other words, the model could “read” the gibberish and translate it back to the original message with high probability.

Then they tested the real question: could LLMs use these strings to answer questions, solve problems, or learn instructions? The answer, surprisingly, was yes.

Reading the Unreadable: Comprehension Experiments

To rigorously test whether LLMs genuinely understood unnatural language (and not just pattern-matched), the researchers set up context-question answering tasks. Here’s the twist: the context was given in unnatural language, but the question was posed in plain English. The model had to extract the relevant information from the gibberish to answer correctly.

To eliminate the possibility of memorisation or reliance on background knowledge, they built a synthetic dataset, SynContextQA, filled with made-up facts and entities. This ensured the model couldn’t cheat by recalling known information – it had to comprehend the provided text.

The results? LLMs achieved 82% of their natural-language accuracy when reading the gibberish context. That’s nearly full comprehension, despite the text being completely unintelligible to humans.

On simpler math problems from a dataset called SimGSM8K, models reached about 61.6% accuracy – lower, but still far above chance and substantially outperforming a control group trained on shuffled, random strings.

Not Just Reading – Learning From Unnatural Language

Here’s where things get even more surprising.

The team asked: what happens if we train a model using instructions written in unnatural language? Not just test comprehension, but actually try to fine-tune general instruction-following behavior.

Using a dataset called LIMA, which contains 1,000 high-quality instruction-answer pairs, they replaced all natural instructions with their unnatural equivalents and fine-tuned various LLMs.

Then they benchmarked performance against models trained on the original natural LIMA.

The result? The unnatural-trained models performed on par with their natural-trained counterparts, achieving nearly 50/50 win rates in head-to-head tasks. That is, models trained on what looks like gibberish were just as capable of following instructions as those trained on clear English.

This indicates that instructional learning is not dependent on human-readable syntax, as long as the underlying patterns and semantic structures are preserved.

How Are They Doing This?

The natural question: how can a model read and learn from noise?

The researchers uncovered two key mechanisms at work:

Keyword Extraction

First, LLMs are surprisingly good at filtering noise. The researchers analysed which tokens mattered most by measuring how much the model’s output changed when each token was removed. They found that:

Tokens corresponding to key concepts from the original sentence (names, actions, quantities) had high importance scores.
The rest – the gibberish – was effectively ignored.

The model isolates the signal from the noise, even when both are mashed together.

Contextual Reassembly

But finding keywords isn’t enough. The words are out of order, sometimes with inverted syntax or conflicting punctuation. How does the model infer meaning?

Here, the natural-language question prompt plays a crucial role. It provides semantic scaffolding, helping the model reconstruct relationships between the keywords it extracted.

For example, if the context is jumbled but contains “twice,” “Brandon,” “sold,” “geckos,” the question “How many geckos did Brandon sell the year before?” allows the model to infer the proper relationships – Brandon sold twice as many geckos the previous year – and compute the answer.

Internal representation analysis supports this. Embedding similarity studies showed that the model dynamically restructures its internal understanding once the natural-language question is introduced. It’s not memorising the unnatural string – it’s transforming it in real-time, based on the task.

Unnatural Language Isn’t Random – It’s Structured

Importantly, the researchers compared this to a control condition where words were randomly shuffled and special tokens added. Models did significantly worse in this condition than with systematically generated unnatural strings.

That confirms that these unnatural languages contain structured, learnable patterns, not just randomness.

Dependency parsing analysis even showed that models can infer syntactic relationships – subject, verb, object – from jumbled input. That’s a level of language understanding far beyond keyword spotting.

Why This Matters: Implications for AI Research and Applications

So why does this matter? What are the practical takeaways?

Rethinking AI Robustness and Generalisation
This research suggests that LLMs are more robust and flexible than previously assumed. They don’t rely solely on human-readable syntax to process language. Instead, they tap into deeper statistical and semantic patterns.
This challenges our assumptions about what “language” even is, from a model’s perspective.
New Avenues for Adversarial Testing
Understanding how models parse unnatural language could inform better security and jailbreak detection, especially in high-stakes applications where AI misbehaviour is a concern. We now know these prompts aren’t flukes – they’re features.
Low-Resource Training Alternatives
Imagine training an LLM in situations where clean, natural text is unavailable, but corrupted, abbreviated, or obfuscated text exists. This opens the door to low-fidelity learning environments, especially in low-resource languages or edge-device scenarios.
Steganography and Encrypted Prompts
The fact that models can learn from unreadable input suggests potential in covert communication channels, both for good (private, compact instruction formats) and ill (malicious prompt hiding). We’ll need new ways to audit and interpret what models are being told.
AI Alignment and Interpretability
Ultimately, this pushes the frontier of AI interpretability. It’s no longer enough to look at input-output pairs. We need to understand the abstract representational geometry that lets models decode gibberish as meaning, and decide what they’re really “learning.”

Final Thought: What Is Language to a Machine?

This research poses a philosophical challenge as much as a technical one.

If LLMs can extract meaning from what looks like static to us – if they can learn to follow instructions written in a non-language – then the “language” they understand isn’t really ours at all. It’s something deeper, stranger, more mathematical. Less about grammar, more about embedded structure and relational inference.

It’s a reminder that these systems, trained on the sum of human text, are not humans. Their understanding of language is alien – not worse, not better, just profoundly different.

And as we build ever more powerful models, we’ll need to grapple with this gap. Not just how we teach them to speak, but how they actually listen.

Because sometimes, the clearest signal is hiding in the noise.

Further Readings: Duan, K., Zhao, Y., Feng, Z., Ni, J., Pang, T., Liu, Q., Cai, T., Dou, L., Kawaguchi, K., Goyal, A., Kolter, J.Z., Shieh, M.Q. (2025) “Unnatural Languages Are Not Bugs but Features for LLMs,” 42nd International Conference on Machine Learning (ICML 2025), Vancouver, Canada DOI: 10.48550/arXiv.2503.01926

Trending Posts

9 March 2023

Making IoT devices that are everlasting

If you awoke this morning feeling a little more tired than usual, you might have glanced at your FitBit to see how many REM sleep cycles you clocked last night. ...

12 November 2019

Here’s to better apps for all of us

This is a scenario that’s probably familiar to many of us: You touch down at your long-awaited holiday destination, collect your luggage, and step outside the airport, raring to go. ...

7 September 2018

Big Data Meets Influencer Marketing: NUS Computing Researcher Tuan Q. Phan Develops “Multinetwork Approach” to Going Viral

One of biggest challenges in marketing is the task of identifying influencers in today’s large and complex social networks, such as Facebook or LinkedIn. ...

26 November 2020

Giving start-ups a head start

Every semester, Francis Yeoh spends part of his time in pitch slams. These are intense sessions where teams of students have five minutes to try and sell their start-up ideas. ...

2 May 2025

Building the Right Features: Rethinking Innovation in the App Economy

A new study published in Information Systems Research by NUS Computing Assistant Professor Aditya Karanam sheds light on how feature strategy influences app adoption in the competitive app market. ...

4 June 2025

Designing Better Software Teams

A forthcoming study by NUS Computing’s Prof Jungpil Hahn and collaborators sheds new light on how software team structures impact product success in the digital economy. ...

19 August 2020

The path to startup success: finding product market fit

In 2015, Shi Ying Lim was working on her Ph.D. in Austin, Texas. As part of her work, she studied a budding health IT startup that was trying to develop ...

21 July 2025

How AI Models Learn to Read – and Learn From – Unnatural Language

Explore how scalable collaborative zk-SNARKs enable fast, secure zero-knowledge proofs across multiple servers. This breakthrough improves privacy and scalability for AI verification, blockchain, and data markets, making advanced cryptography more ...

2 April 2020

Visualising Algorithms with a Click

It was July 2011 in Pattaya, Thailand. While guiding the Singaporean team at the International Olympiad for Informatics (IOI), Dr Steven Halim was struck by an idea to improve the ...

13 November 2020

Quantum Physics Gets a Boost from AI

Stéphane Bressan and Christian Miniatura grew up in rival neighbourhoods of the naval garrison town of Toulon in southern France. They went to the same high school and the same ...

18 March 2022

A course that lets you get your hands dirty

‘EPP’ is an acronym that rolls easily off the tongue, and is something that all first-year Computer Engineering undergraduates at NUS are intimately familiar with. Short for ‘Engineering Principles and ...

20 August 2021

The Olympics for Computer Science

The International Olympiad in Informatics (IOI) is one of the most prestigious competitions in the computer science world. Held every summer since 1987, the tournament sees exceptional high school students ...

17 March 2025

Waterfall: A New Watermarking Method to Protect Copyright in the World of LLMs

A new watermarking technique protects copyright in the world of LLMs ...

6 December 2019

The holy grail of seamless systems integration

Hospital visits can be complicated things. Sometimes it starts out as a visit to the outpatient clinic, where a doctor draws blood or orders some scans to investigate your niggling ...

27 August 2019

There’s power in hierarchy — but not what you expect

These days, it seems that whenever you’re thirsty and in need of a quick caffeine pick-me-up, there’s always a Starbucks close by — whether you’re running errands locally in the ...

28 December 2020

Protecting IoT devices from attack

In 2017, a casino in North America reported that their database had been hacked. The news in itself wasn’t surprising — more than 5,000 such breaches took place last year ...

23 October 2020

The Perils of Paying for Product Reviews

These days, we live and buy by online reviews. Looking for a pair of headphones? Wondering what movie to stream or if you should splash out for the new PlayStation ...

9 September 2019

Helping computers see the world in 3D

It is no simple feat to have a research paper accepted at a top tier computer science conference – let alone to achieve this as an undergraduate student. For recent ...

3 July 2025

When AI Talks in Groups: How Multi-Agent Systems May Be Shaping Your Opinions

1 March 2019

Building Better IT Systems with Prof Chuan-Hoo Tan

At some point in our careers, most of us have to deal with an IT system that is clunky, unreliable, or just plain difficult to use. It might have an ...

6 November 2024

Reasoning and Planning: New Frontiers for AI

If artificial intelligence (AI) were a person, it would be an adolescent who’s just gone through a growth spurt and come of age. AI can now detect tumours with great ...

4 March 2025

Breaking the Bottleneck: Making Zero-Knowledge Proofs Practical at Scale

A team led by Asst Prof Zhang Jiaheng has developed a scalable, privacy-preserving way to generate zk-SNARKs—unlocking faster, secure proof generation across multiple machines. ...

4 March 2020

Online gifting and why we do it

For many of us, the introduction of Facebook, WhatsApp, and other social media platforms was a game-changer. They altered the way we make and maintain friends, and transformed how we ...

13 April 2023

Spotting concurrency bugs in software with sampling

In the summer of 1983, the government organisation Atomic Energy of Canada Limited launched its newest radiation therapy machine. The Therac-25 was highly anticipated — it boasted a revolutionary dual ...

28 April 2022

Explainable AI gets more human-centric — thanks to cognitive psychology

Imagine if Amazon Alexa could recommend a tub of ice cream or Siri could play a cheerful song if they hear sadness in your voice. AI voice recognition can now ...

25 November 2019

Making Bitcoin Safer — By Breaking It

In Greek mythology, Erebus is the primeval god of darkness, son of Chaos. It’s also the region of the underworld, where souls pass through after dying. The word is so ...

13 December 2024

Exploring DiffPath: A Revolutionary Approach to Detecting Out-of-Distribution Data with AI

In the world of artificial intelligence (AI), one major challenge is teaching models to recognise when they encounter something they’ve never seen before—known as out-of-distribution (OOD) data. Imagine training a ...

28 January 2020

Lost in masses of clinical data? Help is here

The intensive care unit where Dr. Jean-Daniel Chiche works in Paris is what you would expect from an ICU. Amidst an atmosphere of respectful quiet and hushed tones lie patients ...

26 March 2019

Future Wearables: smaller, faster and more independent

For those who’ve taken the plunge into the world of wearable devices — 61 million of us by the year’s end, as estimates predict — the leap can be liberating. ...

11 October 2018

Online Shopping and the Science of Serendipity: NUS Computing Researcher Jack Jiang on Product Search in Social Commerce

Have you ever gone to an e-commerce website with the intention of buying one specific thing, but then ended up with something totally different? ...

How AI Models Learn to Read – and Learn From – Unnatural Language

SHARE THIS ARTICLE

Trending Posts

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES