A Further (Itakura-Saito/beta=0) Bi-stochaticization and Associated Clustering/Regionalization of the 3,107-County 1995-2000 U. S. Migration Network
Abstract
We extend to the beta-divergence (Itakura-Saito) case beta =0, the comparative bi-stochaticization analyses-previously conducted (arXiv:1208.3428) for the (Kullback-Leibler) beta=1 and (squared-Euclidean) beta = 2 cases -of the 3,107 - county 1995-2000 U. S. migration network. A heuristic, "greedy" algorithm is devised. While the largest 25,329 entries of the 735,531 non-zero entries of the bi-stochasticized table - in the beta=1 case - are required to complete the widely-applied two-stage (double-standardization and strong-component hierarchical clustering) procedure, 105,363 of the 735,531 are needed (reflective of greater uniformity of entries) in the beta=0 instance. The North Carolina counties of Mecklenburg (Charlotte) and Wake (Raleigh) are considerably relatively more cosmopolitan in the beta=0 study. The Colorado county of El Paso (Colorado Springs) replaces the Florida Atlantic county of Brevard (the "Space Coast") as the most cosmopolitan, with Brevard becoming the second-most. Honolulu County splinters away from the other four (still-grouped) Hawaiian counties, becoming the fifth most cosmopolitan county nation-wide. The five counties of Rhode Island remain intact as a regional entity, but the eight counties of Connecticut fragment, leaving only five counties clustered.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2012
- DOI:
- 10.48550/arXiv.1210.1840
- arXiv:
- arXiv:1210.1840
- Bibcode:
- 2012arXiv1210.1840S
- Keywords:
-
- Physics - Physics and Society;
- Computer Science - Social and Information Networks;
- Statistics - Applications;
- 91C20;
- 62H30;
- 05C82;
- J.4
- E-Print:
- 39 pages, one 34-page dendrogram. Through further iterations of our heuristic, greedy algorithm, we are able to reduce the (Burg-entropy-based) objective function from the previously-reported 1.60316 x 10^{11} to 1.59538 x 10^{11}. Some significant clustering changes (in the ordering of cosmopolitan counties) are noted