Asking good questions to avoid the data breadline

Published on 05-02-2020 | By Jasdeep Garcha

Great questions are the crux of data analysis and building compelling insights. As I wrote about in my post on data retrieval, bad questions tank even the best intentions. Most people aren’t good at asking good questions, and it’s not because they lack the motivation to do so — it’s because to get really good at it the asker needs to understand where data is coming from.

The best questions tend to have the following qualities:

All of these qualities are due to how data is stored (in databases), how it is queried (in languages like SQL), and the cost of querying (ambiguity requires more data, which is more expensive).

If we view data retrieval/analysis as a rate limiting step, which it often is (e.g. the trope “the data breadline”), then there are two ways to approach it: 1) make data retrieval much cheaper (e.g. don’t penalize bad questions). Right now, this requires super expensive tools like Looker or Tableau to enable business users to be their own analysts, or 2) increase the quality of questions so the work that’s queued is much simpler and more people are enabled to find data themselves.

I tend to lean towards empowering people. That often requires education in both data infrastructure and data retrieval tools such as SQL. Once someone has familiarity with both, they are much more equipped to wrap their head around what a good question is and is not .. and to find it on their own.

Examples of good questions:

A couple of other thoughts: