Exploring the Seats-Votes Curve

A historical primer on Congressional seat-votes curves and partisan bias, symmetry, and responsiveness, from 2000 - 2016.


By Jeffrey Shen

Image courtesy of Wikipedia

What is the Seats-Votes Curve?

The seats-votes curve is a computer generated graph that plots the portion of seats a party will win for a certain vote share. The shape of the seats-votes curve provides information on the partisan characteristics of a particular state, such as how election outcomes are effected by changes in voting and whether elections are fair to both parties. In particular, there are three main statistics that can be derived from it:


  1. Partisan Bias (a.k.a, Gelman-King Bias): The difference between a party’s seat share at 50% of the vote and 50%. For example, if Republicans won 60% of the seats at 50% of the vote, the partisan bias would be 10%. The partisan bias calculated from the Republican curve should be the same as that calculated from the Democrat curve. Partisan bias is indicative of how much a districting plan favors one particular party, with 0% being the least biased, and 50% being the most biased. A positive partisan bias score indicates that it favors Republicans. Intuitively, it makes sense that both parties should win 50% of the seats at 50% of the vote; After all, this is both proportional and symmetric.

    However, partisan bias only captures the characteristics of a seats-votes curve at the 50% vote share. Furthermore, since seats-votes curves are generated using computer algorithms—not real elections results—partisan bias is less accurate for non-competitive elections, where the vote share is far from 50%.


  2. Partisan Symmetry: The average distance between the seats-votes curve of one party and that of the other party, over a range of vote shares (most commonly between 45% and 55% of the vote). It is widely agreed upon that both parties should receive the same percentage of seats at the same vote share in a "fair" districting plan. In other words, in a fair plan, Republicans can get 50% of the seats at 40% of the vote so long as Democrats also get 50% of the seats at 40% of the votes. Partisan symmetry measures how similar the seats-votes curve for Republicans is to the seats-votes curve for Democrats, thereby measuring the aforementioned standard. An ideal score would be 0%, meaning that the seats-votes curve for Republicans and Democrats are identical (symmetric) and the worse score is 50%. A positive symmetry score indicates that it favors Republicans.


  3. Responsiveness: The average slope of the seats-votes curve, over a range of vote shares (most commonly between 45% and 55% of the vote). Unlike the previous two measures, there is no ideal responsiveness score; Rather, responsiveness describes how votes translate into seats in a state. A low responsiveness score (less than 1) would indicate an incumbent protection gerrymander, meaning that seat shares are unresponsive to changes in voting, and therefore the districting plan helps preserve the seats of incumbents, whether they be Democrat or Republican. A responsiveness score of approximately 1 would indicate a proportional districting plan, meaning a 1% vote increase translates into a 1% seat increase. A high responsiveness score (greater than 1) would indicate a heavy "winner-take-all" characteristic, meaning the majority party tends to win more seats than votes. Winner-take-all plans are characteristic of most American elections. Lastly, a responsiveness score around 2 would indicate a high level of compliance with the "Efficiency Gap" (EG), a recently developed measure of partisan gerrymandering that you can read about here.

    It's important to recognize that responsiveness does not favor one particular party; Low responsiveness protects incumbents of both parties (although there still may be a partisan bias, for example) and high responsiveness will always benefit the majority party. In this regard, responsiveness can be tweaked to benefit specific parties—say, a majority party wants to increase responsiveness to secure more seats—but there is no set standard for responsiveness.

Methodology

How I generate the seats-votes curve

The seats-votes curve is generated using mostly hypothetical, computer generated election results:


  1. Start with the actual voting result for the state. Say (as seen in the table below), Democrats won 50% of the vote but 60% of the seats in the states overall. Then, we could plot the point (50, 60) on the Democrat curve, and (50, 40) on the Republican curve (as seen in the graph below).

  2. DistrictVotes for DemocratsVotes for Republicans
    16040
    25545
    37030
    43070
    53565
    Total250250


  3. Add endpoints. For every state, 0% of the vote for one party will always result in 0% of the seats, and 100% of the vote for one party will always result in 100% of the seats. Therefore, the points (0,0) and (100,100) can be added to both the Democrat and Republican curve.

  4. Fill in the rest of the curve using an assumption known as uniform partisan swing, which assumes that each district is representative of the state as a whole (Clearly, this assumption is flawed, as districts are often not microcosms of states, but it is convenient for the purposes of generating a hypothetical seats-votes curve. The nature of this assumption means that seats-votes curves generally lose validity towards the extremes). Essentially, starting from the initial election results, add 1% of the vote share to one party, and decrease 1% of the vote share for the other party in each district. The change in vote share will eventually result in a change of seat shares state-wide, and the seat-votes curve will gradually be filled in from (0,0) to (100, 100) for both parties.

  5. The resulting seats-votes curve will be a step function, as under the uniform partisan swing model, votes don't smoothly translate into seats; There will be plateaus, followed by a vertical jump / fall when a district changes party hands. In order to smooth the curve out, apply a 5% (as per the findings of Gerrymandering in America) variation to the voting results in each district. The goal of adding a 5% random variation is to simulate the small factor of randomness present in each election. So, instead of adding 1% to each district, add 1% plus a random number between -5% and 5%. This variation should be simulated 1000 times for each vote level and averaged out, resulting in a smoother curve.

Source code for the algorithm to generate seats-votes curves is available here.


Assumptions Made / Special Notes

  • Uncontested districts (district where only one party produced candidates) technically have a vote share of 100% to 0%. However, treating uncontested districts as such in the seats-votes algorithm would skew data on a statewide scale. Therefore, we assume that in uncontested districts, the non-represented party would have garnered 25% of the vote had it run in the district. So, instead of a 100-0 vote split, uncontested districts have a 75-25 vote split (25% was chosen-albeit arbitrarily-because it was assumed that uncontested districts are by nature uncompetitive, but not so uncompetitive as to garner 0%).
  • Under the uniform partisan swing algorithm described above it is possible for the vote share in a district to go over 100%, which is clearly unrealistic. The seats-votes curves generated below cap off the vote share in any district at 100%, with the excess being distributed evenly across the other districts.
  • Write-in candidates are not calculated for either party.
  • Third party candidates are not included in the calculations, as America lacks major third-parties on a national scale. Therefore, potential generation of a seats-votes curve for third parties would likely be skewed.
  • For Louisiana, where there can be more than one Democrat or Republican running in the general election, the number of votes for each party was tallied as the total number of votes cast for candidates of that party.
  • States with less than 5 districts have been excluded from the graphs below, as they lack a large enough sample size for an accurate curve.
  • Theoretically, the seat share for Republicans and Democrats at 50% of the vote should add up to 100%. However, with the seats-votes curves generated below, the seat shares don't always add up to exactly 100% due to factors such as computational inaccuracies, third party candidates, highly uncompetitive or uncontested elections, etc.

Explore Seats-Votes Curves from 2000-2016

Bibliography

Datasets used

Election Results [csv]. (n.d.). Retrieved from https://transition.fec.gov/pubrec/
  electionresults.shtml


Sources referenced

McGann, A. J., Smith, C. A., Latner, M., & Keenan, A. (2016). Gerrymandering in
  America: The House of Representatives, the Supreme Court, and the future of
  popular sovereignty
. Cambridge: Cambridge University Press.


Petry, E. (n.d.). How the Efficiency Gap Works. Retrieved May 24, 2018, from
  Brennan Center for Justice website: https://www.brennancenter.org/sites/
  default/files/legal-work/How_the_Efficiency_Gap_Standard_Works.pdf


People referenced

I received help and advice from Jason Yung, Herbert Turner, and Kevin Sun.