Generating Shapefiles for Principal Cities in U.S. Core-Based Statistical Areas

A New York Times reporter whose work I enjoy recently posted the following request on Twitter:

I don’t think that particular GIS shapefile exists anywhere on the web. At least, I don’t think there is a ready-made shapefile of census places (e.g., cities, boroughs, towns, etc.) that is restricted only to principal cities in a CBSA. So I made one myself!

I grabbed the delineation files from here and used a few handy GIS operations to generate a shapefile of all principal cities in US core-based statistical areas. You can find my R script here.

For reference, according to the Office of Management and Budget:

The Principal City (or Cities) of a CBSA will include:

  • The largest incorporated place with a 2010 Census population of at least 10,000 in the CBSA or, if no incorporated place of at least 10,000 population is present in the CBSA, the largest incorporated place or census designated place in the CBSA; and
  • Any additional incorporated place or census designated place with a 2010 Census population of at least 250,000 or in which 100,000 or more persons work; and
  • Any additional incorporated place or census designated place with a 2010 Census population of at least 50,000, but less than 250,000, and in which the number of workers working in the place meets or exceeds the number of workers living in the place; and
  • Any additional incorporated place or census designated place with a 2010 Census population of at least 10,000, but less than 50,000, and at least one-third the population size of the largest place, and in which the number of workers working in the place meets or exceeds the number of workers living in the place.

There are several possible ways to generate this shapefile. First, you can use the entire polygon—i.e., the boundary for the city’s administrative jurisdictional—for every principal city in the United States. Here is a map1 that shows all principal cities as polygons:

Principal Cities in U.S. CBSAs, Polygons

However, that might not be very legible, as some boundaries are tiny and others are enormous. Thus, the data can also be transformed into points, generated by calculating the polygon’s centroid:

Principal Cities in U.S. CBSAs, Points

One last option is whether to keep all principal cities in the U.S. or only those that are the largest within their CBSA. Let’s look at the Albany-Schenectady-Troy, NY MSA. The largest city in the metro area is obviously Albany, but Schenectady and Troy do technically qualify as principal cities under the OMB definition above. To remove some clutter from the map, it may be useful only to plot the largest2 principal cities for each CBSA:

Largest Principal Cities in U.S. CBSAs, Points

I hope this is useful to someone down the road.

Data

PolygonsCentroids
All Central Cities.zip.zip
Largest Central City Only.zip.zip

  1. In every map displayed in this post, the polygons or points that represent CBSA principal cities are shaded according to their 2010 population, mostly just for aesthetic purposes. I also plotted the lower/contiguous 48 states in the maps to keep it simple…sorry Alaska and Hawaii! ↩︎

  2. Not pictured in this post is the polygon version of the largest principal cities map. A shapefile .zip is still available above. ↩︎

Andrew J. Van Leuven
Andrew J. Van Leuven
Assistant Professor of Community and Rural Development