Learning Maps from Geospatial Data Captured by Logistics Operations


Logistics operations produce a huge amount of geospatial data and this talk tells how we can use it to create a mapping service such as Google Maps and Here Maps!


E-commerce and logistics operations produce a vast amount of geospatial data while moving and delivering packages. As a logistics company supporting the e-commerce operations in multiple Asian countries, Delhivery produces over 50 million geo-coordinates daily. These geo-coordinates represent the movement of trucks and bikes or delivery events to the given postal addresses. The data has great potential to mine geospatial knowledge, and we demonstrate that a mapping service similar to Google Maps and Here Maps can be automatically built using the same. Specifically, we describe the learning of regional maps (localities, cities, etc) from the addresses labeled with geo-coordinates and the learning of roads from the geo-coordinates associated with movement.

We propose an algorithm to construct polygons and polylines of the map entities given a set of geo-coordinates. The algorithm involves non-parametric spatial probability modelling of the map entities followed by classification of the cells in a hexagonal grid to the respective map entity. We show that our algorithm is capable of handling noise, which is significantly high in our setting due to various reasons such as scale and device issues. A property about the noise and the correct information is presented such that our algorithm infers a correct map entity. We quantitatively measure the accuracy of our system by comparing its output with the available ground truth. We will showcase some localities that have incorrect polygons in Google Maps whereas we can learn the correct version by our data and algorithm. We also discuss multiple applications of the generated maps in the context of e-commerce and logistics operations.

A part of this work was accepted for publication at ACM/SIGAPP Symposium On Applied Computing 2020:

"Learning Locality Maps from Noisy Geospatial Labels. In SAC 2020 at Brno, Czech Republic"


Outline/Structure of the Talk

  • Introduction [5 min]
    • Data captured by logistics operations
    • Map entities that can potentially be learned
  • Motivation: why maps of our own? [2 min]
  • Problem statement [1 min]
  • Solution [6 min]
    • Challenges: noise, scale
    • Related work
    • Algorithm
  • Results: Demonstration of generated maps [3 min]
  • Applications and Conclusion [3 min]

Learning Outcome

  • Logistics industry and its operations
  • AI problems in the logistics industry
  • Geospatial data that is captured in logistics operations
  • Generative modeling: Kernel density estimation
  • Handling noise in geospatial data
  • Learning maps from scattered points

Target Audience

The talk is suitable across the board except for a small part (~20%) which would be technical. I also plan to use visualisations to explain the intuition behind the algorithm so that even this part is easier to understand.

Prerequisites for Attendees

  • Geospatial data
  • GPS coordinates
  • A bit of probability (optional)
  • Density estimation (optional)
schedule Submitted 7 months ago

Public Feedback

comment Suggest improvements to the Author
  • Ashay Tamhane
    By Ashay Tamhane  ~  7 months ago
    reply Reply

    Thanks Manjeet for an interesting proposal. I recall there was a talk on a very similar topic last year from Delhivery. Could you please elaborate on how this talk is different from last year?

    • Dr. Manjeet Dahiya
      By Dr. Manjeet Dahiya  ~  7 months ago
      reply Reply

      Hi Ashay,

      In last year's talk, we discussed multiple problems related to logistics and discussed their high-level solutions. A related problem was discussed in a narrow setting along with many others. In this talk, we focus on a single problem (Learning Maps) and a new solution in detail.

      Note that the solution and the generalized problem has not ever been presented before except at ACM-SAC 2020, where the work was accepted for publication.

      We think little work has been done in the area of using logistics data to create maps, and we hope that disseminating our learnings would be helpful for the data science community.


  • Natasha Rodrigues
    By Natasha Rodrigues  ~  7 months ago
    reply Reply

    Hi Dr. Manjeet,

    Thanks for your proposal! Requesting you to update the Outline/Structure section of your proposal with a time-wise breakup of how you plan to use 20 mins for the topics you've highlighted?

    To help the program committee understand your presentation style, can you provide a link to your past recording or record a small 1-2 mins trailer of your talk and share the link to the same?

    Also, in order to ensure the completeness of your proposal, we suggest you go through the review process requirements.



    • Dr. Manjeet Dahiya
      By Dr. Manjeet Dahiya  ~  7 months ago
      reply Reply

      I have updated the proposal as per the suggestions. I added the related slides. I have also added the video of a panel discussion where I participated as a panelist and I discussed AI problems related to logistics. I hope it is OK.

      • Natasha Rodrigues
        By Natasha Rodrigues  ~  7 months ago
        reply Reply

        Hi Dr. Manjeet,

        Thanks a ton for this.