Learning Maps from Geospatial Data Captured by Logistics Operations
Logistics operations produce a huge amount of geospatial data and this talk tells how we can use it to create a mapping service such as Google Maps and Here Maps!
E-commerce and logistics operations produce a vast amount of geospatial data while moving and delivering packages. As a logistics company supporting the e-commerce operations in multiple Asian countries, Delhivery produces over 50 million geo-coordinates daily. These geo-coordinates represent the movement of trucks and bikes or delivery events to the given postal addresses. The data has great potential to mine geospatial knowledge, and we demonstrate that a mapping service similar to Google Maps and Here Maps can be automatically built using the same. Specifically, we describe the learning of regional maps (localities, cities, etc) from the addresses labeled with geo-coordinates and the learning of roads from the geo-coordinates associated with movement.
We propose an algorithm to construct polygons and polylines of the map entities given a set of geo-coordinates. The algorithm involves non-parametric spatial probability modelling of the map entities followed by classification of the cells in a hexagonal grid to the respective map entity. We show that our algorithm is capable of handling noise, which is significantly high in our setting due to various reasons such as scale and device issues. A property about the noise and the correct information is presented such that our algorithm infers a correct map entity. We quantitatively measure the accuracy of our system by comparing its output with the available ground truth. We will showcase some localities that have incorrect polygons in Google Maps whereas we can learn the correct version by our data and algorithm. We also discuss multiple applications of the generated maps in the context of e-commerce and logistics operations.
A part of this work was accepted for publication at ACM/SIGAPP Symposium On Applied Computing 2020:
"Learning Locality Maps from Noisy Geospatial Labels. In SAC 2020 at Brno, Czech Republic"
Outline/Structure of the Talk
- Introduction [5 min]
- Data captured by logistics operations
- Map entities that can potentially be learned
- Motivation: why maps of our own? [2 min]
- Problem statement [1 min]
- Solution [6 min]
- Challenges: noise, scale
- Related work
- Results: Demonstration of generated maps [3 min]
- Applications and Conclusion [3 min]
- Logistics industry and its operations
- AI problems in the logistics industry
- Geospatial data that is captured in logistics operations
- Generative modeling: Kernel density estimation
- Handling noise in geospatial data
- Learning maps from scattered points
The talk is suitable across the board except for a small part (~20%) which would be technical. I also plan to use visualisations to explain the intuition behind the algorithm so that even this part is easier to understand.
Prerequisites for Attendees
- Geospatial data
- GPS coordinates
- A bit of probability (optional)
- Density estimation (optional)
Slides [work in progess]:
Research paper, a part of this work: