The Map is not the Territory

newsletter data data science

Seek ground truth whenever possible to accelerate learning.

TJ Palanca https://www.twitter.com/tjpalanca
09-18-2021

The map is not the territory; seek ground truth whenever possible to accelerate learning.

uberHOP is a little example from my experience. The product was a point-to-point (a.k.a UV express) service Uber launched in Manila, along with Seattle and Toronto.

The way it worked was simple: you would make a request to take a specific route during peak hours, and we would batch you in with up to 6 people to take a high occupancy vehicle along the route.

uberHOP needed high occupancy to become profitable

The pricing was at a 70% discount to uberX (the traditional ride product), and drivers were guaranteed earnings, so there was a minimum average occupancy needed to hit profitability. To get to that high occupancy, we needed to ensure that the routes selected were of high quality.

Initial approach: Clustering!

My first instinct as a data person was clustering. We needed to find pairs of longitude and latitude that had enough pickup and dropoff density in them to have a decent chance of becoming profitable.

The launch routes were selected using this method, but we had limited success, even after a novelty period, cancellation rates remained high.

I tried different algorithms, distance metrics, using various map features, dispatch radiuses, all for very incremental gains.

Seeking ground truth

What did help was to actually seek ground truth, and the solution was embarrassingly obvious.

When we physically went to the most successful route’s pickup, the two key factors were: (a) high density residential buildings (as opposed to commercial), and (b) a driveway so drivers weren’t a moving target.

We were able to turn the product profitable in a few weeks! This was easy to do because I was physically located in the market. However, this is a perennial challenge for distributed teams, so it’s even more important to consciously seek ground truth in those situations.

Here’s an abridged version in Twitter thread form:

The map is not the territory; seek ground truth whenever possible to accelerate learning. 🧵

Here's a little example from my experience: uberHOP was a point-to-point (a.k.a UV express) service we launched in Manila. We needed high occupancy so route selection was critical. pic.twitter.com/dfGdDlfE1j

— TJ Palanca (@tjpalanca) September 17, 2021

Corrections

If you see mistakes or want to suggest changes, please create an issue on the source repository.

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY-NC-ND 4.0. Source code is available at https://www.github.com/tjpalanca/tjpalanca.github.io, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Palanca (2021, Sept. 18). TJ Palanca: The Map is not the Territory. Retrieved from https://www.tjpalanca.com/posts/2021-09-18-the-map-is-not-the-territory/

BibTeX citation

@misc{palanca2021the,
  author = {Palanca, TJ},
  title = {TJ Palanca: The Map is not the Territory},
  url = {https://www.tjpalanca.com/posts/2021-09-18-the-map-is-not-the-territory/},
  year = {2021}
}