Robust estimation of inequality from binned incomes
Abstract
Researchers must often estimate income inequality using data that give only the number of cases (e.g., families or households) whose incomes fall in "bins" such as $0-9,999, $10,000-14,999,..., $200,000+. We find that popular methods for estimating inequality from binned incomes are not robust in small samples, where popular methods can produce infinite, undefined, or arbitrarily large estimates. To solve these and other problems, we develop two improved estimators: the robust Pareto midpoint estimator (RPME) and the multimodel generalized beta estimator (MGBE). In a broad evaluation using US national, state, and county data from 1970 to 2009, we find that both estimators produce very good estimates of the mean and Gini, but less accurate estimates of the Theil and mean log deviation. Neither estimator is uniformly more accurate, but the RPME is much faster, which may be a consideration when many estimates must be obtained from many datasets. We have made the methods available as the rpme and mgbe commands for Stata and the binequality package for R.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2014
- DOI:
- 10.48550/arXiv.1402.4061
- arXiv:
- arXiv:1402.4061
- Bibcode:
- 2014arXiv1402.4061V
- Keywords:
-
- Statistics - Methodology
- E-Print:
- 39 pages, 7 tables, 7 figures