├── Takeaway.png ├── .gitignore ├── LICENSE ├── cpi.csv └── README.md /Takeaway.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/brianckeegan/Bechdel/HEAD/Takeaway.png -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- 1 | # Byte-compiled / optimized / DLL files 2 | __pycache__/ 3 | *.py[cod] 4 | 5 | # C extensions 6 | *.so 7 | 8 | # Distribution / packaging 9 | .Python 10 | env/ 11 | bin/ 12 | build/ 13 | develop-eggs/ 14 | dist/ 15 | eggs/ 16 | lib/ 17 | lib64/ 18 | parts/ 19 | sdist/ 20 | var/ 21 | *.egg-info/ 22 | .installed.cfg 23 | *.egg 24 | 25 | # Installer logs 26 | pip-log.txt 27 | pip-delete-this-directory.txt 28 | 29 | # Unit test / coverage reports 30 | htmlcov/ 31 | .tox/ 32 | .coverage 33 | .cache 34 | nosetests.xml 35 | coverage.xml 36 | 37 | # Translations 38 | *.mo 39 | 40 | # Mr Developer 41 | .mr.developer.cfg 42 | .project 43 | .pydevproject 44 | 45 | # Rope 46 | .ropeproject 47 | 48 | # Django stuff: 49 | *.log 50 | *.pot 51 | 52 | # Sphinx documentation 53 | docs/_build/ 54 | 55 | -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- 1 | The MIT License (MIT) 2 | 3 | Copyright (c) 2014 Brian Keegan 4 | 5 | Permission is hereby granted, free of charge, to any person obtaining a copy 6 | of this software and associated documentation files (the "Software"), to deal 7 | in the Software without restriction, including without limitation the rights 8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell 9 | copies of the Software, and to permit persons to whom the Software is 10 | furnished to do so, subject to the following conditions: 11 | 12 | The above copyright notice and this permission notice shall be included in all 13 | copies or substantial portions of the Software. 14 | 15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR 16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, 17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE 18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER 19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, 20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE 21 | SOFTWARE. -------------------------------------------------------------------------------- /cpi.csv: -------------------------------------------------------------------------------- 1 | Year,Annual, 2 | 1913,9.8 3 | 1914,10.0 4 | 1915,10.1 5 | 1916,10.4 6 | 1917,11.7 7 | 1918,14.0 8 | 1919,16.5 9 | 1920,19.3 10 | 1921,19.0 11 | 1922,16.9 12 | 1923,16.8 13 | 1924,17.3 14 | 1925,17.3 15 | 1926,17.9 16 | 1927,17.5 17 | 1928,17.3 18 | 1929,17.1 19 | 1930,17.1 20 | 1931,15.9 21 | 1932,14.3 22 | 1933,12.9 23 | 1934,13.2 24 | 1935,13.6 25 | 1936,13.8 26 | 1937,14.1 27 | 1938,14.2 28 | 1939,14.0 29 | 1940,13.9 30 | 1941,14.1 31 | 1942,15.7 32 | 1943,16.9 33 | 1944,17.4 34 | 1945,17.8 35 | 1946,18.2 36 | 1947,21.5 37 | 1948,23.7 38 | 1949,24.0 39 | 1950,23.5 40 | 1951,25.4 41 | 1952,26.5 42 | 1953,26.6 43 | 1954,26.9 44 | 1955,26.7 45 | 1956,26.8 46 | 1957,27.6 47 | 1958,28.6 48 | 1959,29.0 49 | 1960,29.3 50 | 1961,29.8 51 | 1962,30.0 52 | 1963,30.4 53 | 1964,30.9 54 | 1965,31.2 55 | 1966,31.8 56 | 1967,32.9 57 | 1968,34.1 58 | 1969,35.6 59 | 1970,37.8 60 | 1971,39.8 61 | 1972,41.1 62 | 1973,42.6 63 | 1974,46.6 64 | 1975,52.1 65 | 1976,55.6 66 | 1977,58.5 67 | 1978,62.5 68 | 1979,68.3 69 | 1980,77.8 70 | 1981,87.0 71 | 1982,94.3 72 | 1983,97.8 73 | 1984,101.9 74 | 1985,105.5 75 | 1986,109.6 76 | 1987,111.2 77 | 1988,115.7 78 | 1989,121.1 79 | 1990,127.4 80 | 1991,134.6 81 | 1992,138.1 82 | 1993,142.6 83 | 1994,146.2 84 | 1995,150.3 85 | 1996,154.4 86 | 1997,159.1 87 | 1998,161.6 88 | 1999,164.3 89 | 2000,168.8 90 | 2001,175.1 91 | 2002,177.1 92 | 2003,181.7 93 | 2004,185.2 94 | 2005,190.7 95 | 2006,198.3 96 | 2007,202.416 97 | 2008,211.080 98 | 2009,211.143 99 | 2010,216.687 100 | 2011,220.223 101 | 2012,226.665 102 | 2013,230.280 103 | 2014,233.916 -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | Bechdel 2 | ======= 3 | 4 | I tried to retrieve and re-analyze the data that Hickey described in his article, but came to some conclusions that were the same, others that were very different, and still others that I hope are new. 5 | 6 | In the absence of knowing the precise methods used but making reasnable assumptions of what was done, I was able to replicate some of his findings, but not others because specific decisions had to be made about the data or modeling that dramatically change the results of the statistical models. However, the article provides no specifics so we're left to wonder when and where these findings hold, which points to the need for openness in sharing data and code. Specifically, while Hickey found that women's representation in movies had no significant relationship on revenue, I found a positive and significant relationship. 7 | 8 | But the questions and hypotheses Hickey posed about systematic biases in Hollywood were also the right ones. With a reanalysis using different methods as well as adding in new data, I found statistically significant differences in popular ratings also exist. These differences persist after controlling for each other and in the face of other potential explanations about differences arising because of genres, MPAA ratings, time, and other effects. 9 | 10 | In the image below, we see that movies that have non-trivial women's roles get 24% lower budgets, make 55% more revenue, get better reviews from critics, and face harsher criticism from IMDB users. Bars that are faded out mean my models are less confident about these findings being non-random (higher p-values) while bars that are darker mean my models are more confident that this is a significant finding (lower p-values). 11 | 12 | Movies passing the Bechdel test (the red bars): 13 | 14 | * ...receive budgets that are 24% *smaller* 15 | 16 | * ...make 55% *more* revenue 17 | 18 | * ...are awarded 1.8 *more* Metacritic points by professional reviewers 19 | 20 | * ...are awarded 0.12 *fewer* stars by IMDB's amateur reviewers 21 | 22 | ![Summary plot of Bechdel statistics](https://raw.githubusercontent.com/brianckeegan/Bechdel/master/Takeaway.png) 23 | --------------------------------------------------------------------------------