Lost? Eyes in the sky can tell you where you are

19 December 2019

Artificial Intelligence, Department of Computer Science, Faculty, Feature, Media, Research

Lee Gim Hee

Associate Professor

Computer Science

SHARE THIS ARTICLE

No matter how many times you’ve flown, sitting at the window seat and watching the world shrink away from view as the plane takes off never seems to grow old. Towering trees and skyscrapers become mere pixels, roads and rivers now thin winding ribbons, and vast tracts of land appear as tiny thumbnails below.

The familiar can become unrecognizable as we’re transported from the ground up into the air. People sometimes struggle with this change in perspective, and it turns out machines do too — especially those tasked with helping to make navigation easier.

Striving to create more accurate geolocation systems, researchers have in recent years been making use of satellite imagery. The underlying idea is simple: take the image in question and compare it with those from a database of geotagged satellite images. Find a match and you’ll be able to pinpoint your location. The snag, however, is that such ground-to-aerial matching — with its potential for use in navigation, autonomous vehicles, augmented reality and other applications — is incredibly challenging.

“It’s difficult because of a drastic change in viewpoints,” says Assistant Professor Gim Hee Lee, who studies computer vision and robotic perception at the National University of Singapore’s (NUS) School of Computing. “When you compare two images from satellite and street views, they’re hardly recognisable.”

Cross-view matching, as it’s formally called, has gained increasing attention in recent years. Traditionally, geo-localisation involves comparing two images — a query one against a reference one — both taken from the ground view. This approach is relatively easy to implement but suffers from two main drawbacks. “Your reference map needs to be well-covered,” says Lee. “But it’s impossible to access every part of the world no matter how much money or manpower you have.”

Furthermore, reference images, often crowdsourced from sites such as Flickr, tend to be very biased. Images of popular places are often abundantly available while those of more isolated areas are lacking. “For example in Singapore, you see a lot of images that are focused on Gardens by the Bay, Marina Bay Sands, or the Merlion,” says Lee. “But if you want to navigate to NUS, then there will be very few images. Not to mention the heartlands like Clementi or Ang Mo Kio.”

Employing satellite images can help overcome these issues. “We can easily access them, and they have worldwide coverage,” says Lee. Which explains why ground-to-aerial matching systems have become increasingly popular for geo-localisation in recent years.

Still, one big hurdle remains: how to overcome the drastic change in viewpoint when comparing an image taken on the ground to one taken up above.

Aggregating features

Spurred on by this challenge, Lee and his PhD student Sixing Hu began working on a possible solution in early 2017. What they came up with was the Cross-View Matching Network, or CVM-Net, a machine-learning based algorithm that makes ground-to-aerial geo-localisation possible.

“We exploit the very popular deep learning approach because it can extract features from images in a very powerful way,” says Lee. Feature extraction — the identification of features in a given image — is the first step of CVM-Net.

The second stage involves aggregating these features to form a unique signature for each image. “Just like how your thumbprint is unique to you, the signature is unique to the image,” he explains. The signature generated, recorded as a string of numbers, can then be compared against pre-computed, geotagged ones in the database of satellite images to determine the location in question.

Crucially, it’s the creation of this distinctive thumbprint that has made ground-to-aerial localisation possible. “This particular step actually makes the whole process more robust and rotationally invariant,” says Lee. In other words, aggregating features within a particular image to form a unique signature can be used to pinpoint its location, regardless of the illumination or orientation of the picture.

A moonshot

After training the CVM-Net model, the researchers tested its effectiveness using two large datasets. One involved nearly 9,000 image pairs, while the other close to a million. In both instances, CVM-Net outperformed all other geo-localisation approaches in terms of accurate identification.

The researchers then proceeded to do real-world testing. Using a car fitted with 12 infrared cameras offering views in four directions, the team drove around two test sites (one urban and the other rural) in Singapore. The tests demonstrated — for the first time ever — that by simply providing images or videos of your surroundings while in a moving vehicle, CVM-Net can tell you where you are in real-time.

The impact of Lee and Hu’s work has been far and wide-reaching. “All the subsequent research has followed what we are doing,” says Lee. “We became a benchmark that everybody has to follow in order to reach this kind of performance in ground-to-aerial geo-localisation.”

Work in the field is, however, far from over. “I don’t claim that we have solved the problem,” says Lee. “There are still a lot of other problems that remain.”

One thing he and other researchers are looking into is how to do semantic labeling. “Let’s say I show you a map, can you show me where all the road networks are? Or which ones are buildings?” he says.

Generalisation is another big issue in the field. “If you train your network on dataset from one geographic location, will it also work when you bring your car to another part of the world?” says Lee.

Despite the challenges that remain, Lee is proud of how far his team has come. “When we first began, I was quite skeptical. This was like a moonshot thing because it sounded almost impossible to do in reality,” he recalls. “But then we showed a proof-of-concept and CVM-Net actually worked on a real vehicle.”

Paper:
Image-Based Geo-Localization Using Satellite Imagery

Trending Posts

24 September 2021

Making sense of messy data with ThunderGP

Choice is good, but sometimes having too much choice can be a bad thing. Just ask anyone who’s ever tried to delve into a new film on Netflix, discover new ...

15 December 2023

To Attract VCs’ Attention, Should Startups Go with Crowdfunding or Angel Investing?

Roughly a decade ago, there was a big shake-up to the startup world. Entrepreneurs looking to fund their latest business venture no longer had to seek seed capital from traditional ...

26 March 2019

Future Wearables: smaller, faster and more independent

For those who’ve taken the plunge into the world of wearable devices — 61 million of us by the year’s end, as estimates predict — the leap can be liberating. ...

18 March 2022

A course that lets you get your hands dirty

‘EPP’ is an acronym that rolls easily off the tongue, and is something that all first-year Computer Engineering undergraduates at NUS are intimately familiar with. Short for ‘Engineering Principles and ...

15 February 2021

More than Assignments: Developing Software for the Real World

In 2011, Damith Rajapakse was teaching a few modules at NUS Computing when he ran into a problem. Part of his modules comprised an aspect of project work, and he ...

3 June 2025

Ripple Effects of Empathy: How a Distant Disaster Reshaped Global Microloans

A new study by NUS Computing’s Shaw Professor Bernard Tan and his former PhD students reveals how empathy and subtle social cues shape generosity on microlending platforms like Kiva. ...

15 May 2025

Helping AI Helps Us Too: The Surprising Mental Health Benefits of Assisting Artificial Intelligence

A study led by Assistant Professor LEE Yi-Chieh and his team at the AI 4 Social Good Lab (AI4SG) at NUS Computing has uncovered a surprising finding that assisting even ...

28 January 2020

Lost in masses of clinical data? Help is here

The intensive care unit where Dr. Jean-Daniel Chiche works in Paris is what you would expect from an ICU. Amidst an atmosphere of respectful quiet and hushed tones lie patients ...

19 March 2021

Teaching Hands-On Computer Engineering

For Ravi Suppiah, the term “teaching innovation” has never just been some far-off ideal to strive for when one has the time or energy for reflective improvement. Instead, it’s ingrained ...

22 October 2021

Bug-bane begone — enter the era of Automated Program Repair

Consider a programmer sitting at her desk, trying to fix an error in a software system. First, she had to determine what was causing the problem and trace its source ...

20 April 2022

Walk, Watch, Learn: On-the-go video learning

As COVID crept across the world, confining people to their homes and chaining them to their desks — for work, school, and play — Zhao Shengdong was no exception. Involved ...

17 December 2020

Towards personalised medicine: subtyping patients using their genomic data

Most pundits gazing into the crystal ball will likely shout two words in their prediction of healthcare’s future: precision medicine. Increasingly, there is growing recognition that tailoring treatments based on ...

24 August 2018

Blockchain gets better: moving beyond Bitcoin

Steeped in every culture since the beginning of time are legendary figures of mythical proportions. The Greeks and Romans had Hercules, the Celts had King Arthur, and the Chinese had ...

4 June 2025

Up-to-date AI Without the Side Effects: How AlphaEdit Could Change the Way We Edit Knowledge in Language Models

KITHCT Chair Professor Chua Tat Seng and his team have developed AlphaEdit, a breakthrough approach that allows AI models to learn new facts without losing what they already know. ...

28 December 2020

Protecting IoT devices from attack

In 2017, a casino in North America reported that their database had been hacked. The news in itself wasn’t surprising — more than 5,000 such breaches took place last year ...

26 November 2020

Giving start-ups a head start

Every semester, Francis Yeoh spends part of his time in pitch slams. These are intense sessions where teams of students have five minutes to try and sell their start-up ideas. ...

15 June 2020

When cloud providers pool and throttle to win the race

When Yingda Zhai was working on his PhD in Austin, Texas, he used to stroll through the neighbourhood he lived in not too far from campus. On these walks, he ...

30 December 2024

Unlocking the Power of High-Dimensional Simulations with STDE

In a world increasingly driven by artificial intelligence and complex computations, tackling the most challenging problems—from modeling galaxies to designing personalized medicine—requires innovation. One such breakthrough is the Stochastic Taylor ...

6 December 2019

The holy grail of seamless systems integration

Hospital visits can be complicated things. Sometimes it starts out as a visit to the outpatient clinic, where a doctor draws blood or orders some scans to investigate your niggling ...

2 April 2020

Visualising Algorithms with a Click

It was July 2011 in Pattaya, Thailand. While guiding the Singaporean team at the International Olympiad for Informatics (IOI), Dr Steven Halim was struck by an idea to improve the ...

27 November 2017

Picture Perfect: How Two Guys Changed Drone Photography

It is the 28th of July 2016. A crowd is gathered around a marked off area near the entrance of Level 2, COM1. Among them are NUS President, Professor Tan ...

6 November 2024

Reasoning and Planning: New Frontiers for AI

If artificial intelligence (AI) were a person, it would be an adolescent who’s just gone through a growth spurt and come of age. AI can now detect tumours with great ...

2 October 2024

Bytes and Barriers: Overcoming the Challenges of Integrating Electronic Health Records

Understanding the tensions and their mitigations is key to effectively managing the challenges involved in creating a seamless system of patient records. Sometimes a trip to the GP’s clinic isn’t ...

6 May 2021

Can Mobile Apps Make Us Eat Better and Be Healthier?

Every decade has an exercise trend or two that defines it. Step aerobics and the Thighmaster were popular in the ‘90s, for instance, while exer-gaming and CrossFit were all the ...

3 July 2025

When AI Talks in Groups: How Multi-Agent Systems May Be Shaping Your Opinions

Explore how scalable collaborative zk-SNARKs enable fast, secure zero-knowledge proofs across multiple servers. This breakthrough improves privacy and scalability for AI verification, blockchain, and data markets, making advanced cryptography more ...

1 October 2020

Lost? Eyes in the sky can tell you where you are

SHARE THIS ARTICLE

Trending Posts

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES

Programmes

ADMISSIONS

RESEARCH

DEPARTMENTS

RESOURCES