You should summarize data with the geometric mean

Illustration by the author

The Median

Extreme Values

The Arithmetic Mean

Make Sure it Makes Sense

The Geometric Mean

Skewed Data

  1. Plot the distribution of your data, after applying a logarithm to them (any will do).
  2. If the curve appears bell-shaped, i.e. “normal” or “gaussian,” then the original distribution was approximately log-normal.
  1. Compare the range of your data (minimum and maximum) with the mean: Find differences between them and the mean, and also the quotients.
  2. If the differences are about the same, it means the data are fairly symmetric, and normal. But if the quotients are similar, the data are more likely log-symmetric, and skewed to the left lognormally.

Equal Ratios

Compound rates

Different Scales

Evenness

Logarithms

Removing Zeros

Some other Tidbits

Parting Thought

--

--

--

Data, graphics, games. So You Need to Learn R.

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

DataOps Testing, AirBnBs Quality Initiative, Testing with dbt; ThDPTh #3

Boost your efficiency and process Excel-files with Python

COVID 19 — Explanatory Data Analysis

Principal Components: Leveraging AI to Unlock Unstructured Data

How to create a Poll in WhatsApp

Introduction to PowerBI and Get started with PowerBI, Prepare data for analysis and Model data in…

DataViz Case Study: Seven Subtly Different Ways to Plot College Enrollment

Statistics 101 — Part 1 (Basics)

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Jasper McChesney

Jasper McChesney

Data, graphics, games. So You Need to Learn R.

More from Medium

R Programming: Non-deterministic testing

Paired Sample T-test in R

PCA (Almost) From Scratch in R

The Data Sandbox | Fitness Tracker Modelling: ML