Back to [Archive] Post-review discussions

1
[OQSPS] The Political Attitudes of British Academics
Admin
[]Journal:
[]Open Quantitative Sociology & Political Science

[]Authors:
[]Noah Carl

[]Title:
[]The Political Attitudes of British Academics

[]Abstract:
[]Carl (2017) recently published a report claiming that individuals with left-wing and liberal views are overrepresented in British academia. One weakness of this report was that it relied almost exclusively on party support data. Using data from the 2015 wave of the British Election Study Panel, the present study confirms that the political attitudes of British academics are somewhat more economically left-wing (0.38sd), and are substantially more socially liberal (0.84sd), than those of the general population. It also documents that British academics are substantially more likely to read The Guardian newspaper (the UK’s most left-liberal newspaper) than members of the general population (31 ppts). Adjusting for demographic characteristics, education and openness to experience reduces the difference on social liberalism by 0.20sd, and reduces the difference on Guardian readership by 5 ppts, but increases the difference on economic leftism by 0.07sd.

[]Key words:
[]Academics; political attitudes; left-wing; liberal; education; openness

Length:
[]~3,000[] words, 13[] pages.
[]
Files:

https://osf.io/q9e79/

[]Data can be accessed on the website for the BES:
[]http://www.britishelectionstudy.com [b] [/b]
Admin
I will review this, but I haven't had time yet. Very stressed at the job.
Admin
Noah,

I read your study with interest. In general I think it is promising but needs some work. Find my comments below.

Data
The data are not available. I take it that this is due them being secret/apply-only. Please note that if so. If not, please upload them to OSF (just the relevant variables if necessary due to size limitations). The paper notes that the data are available. This is not true.

---


This report was subjected to several criticisms


Subject?


--

Using data from the 2015 wave of the British Election Study Panel, the present study confirms that the political attitudes of British academics are indeed both more left-wing and more liberal than those of the general population


I note that this uses the economist tradition of stating the findings in the introduction. Seems redundant as they were already mentioned in the abstract above, meaning they get repeated 3 times: abstract, intro, conclusion.

--

(n = ~30,000)



Use ≈

--

the reference category for this variable can be considered to be the general population


It would be more proper to call this "non-academics". For some analyses, it will matter whether one excludes 107 persons or not. This happens in when academics are highly discrepant from the non-academics in some variable, especially if it concerns a discrete variable.

The general population is, well, generally taken to refer to everybody.

--

(respondents who said that they did not read a daily newspaper were coded as missing)



Careful. Now you are contrasting Guardian readers vs. newspaper readers who are not Guardian readers. This is quite different from Guardian readers vs. non-Guardian readers.

--

The following covariates were utilized: age, gender, ethnicity, region, education, self-rated openness to experience



Was O a single item, or based on multiple OCEAN/Big five items? Presumably single-item ratings have strong measurement error, and quite likely substantial systematic error related to self-concept.

--

An important caveat is that two of the results in Table 1 (specifically, those in the first and second columns) were not robust to applying sampling weights; indeed, they were rendered non-significant by doing so. (Full weighted results are given in Appendix B.)



Judging from the table data, it seems likely that the p values are just calculated incorrectly. Perhaps it uses the sampling weights as df.

E.g. a predictor in original model with beta = .38 had a p value of <.001, but p > .05 when beta = .28.

Under the most pessimistic situation, the p value of the first is .00099, which gives a z of -3.093. This implies the SE is 0.123. To find the cutoff for p =.05, thus, we find z = 1.96, which is at about .24. Thus, baring changes in predictor relations, the beta of .28 should have p  somewhat below .05, even under the worst possible assumption.

--

This is somewhat surprising, since one would have expected that, if the difference observed in the first column were attributable to non-random sampling, then it would have
disappeared after controlling for demographic characteristics such as age, gender, ethnicity and region (as in the second column).



Yes. It has to result from non-random sampling with some variable not among these or strongly related to them. Candidates?

--

Appendix C shows that the distribution of party support among academics in the BES is more similar to the distribution of party support among academics in Understanding Society



Please quantify this, e.g. Pearson correlation.

--

Table 1. Estimates from OLS models of economic leftism



Please report correlation matrix of all primary variables. Remember that some readers may be more interested in other predictors, e.g. for meta-analysis.

Did you do the coding correctly this time? Dummy variables cannot be entered as numeric variables. You must set them to dummy status, otherwise the model treats them as numeric variables and underestimates the effect sizes.

It would be useful to show all the betas (in appendix if you want). Readers cannot compare the relative importance of variables when they are grouped and betas not reported, aside indirectly via delta R2.


--

Age, age squared,



Age^2 is not a good way to control for non-linear effects. Please use a spline or similar flexible method. You have sufficient sample size for this.

--

Education



Note that including education as a co-predictor is problematic. It is likely that there is political preference-based self-selection into higher education. Thus, including it as a predictor controls for a mediator to some degree.

The same is true for gender and region.

If you used a path/sem model you could model this appropriately.


--

Openness to experience



What about the overlap of measurement criticism? Some O items concern political stuff very similar or identical to your social items.


--

R2



Are these R2 adjusted or not?


--


Table 3. Estimates from OLS models of Guardian readership.



You cannot use OLS for a binary outcome! Please use a logistic model.

--



social homophily and political typing, individual conformity, status inconsistency, and discrimination (see Carl, 2015b; Carl, 2017a)



It's probably wise to cite something not your own research, e.g. Duarte et al, or those studies of discrimination admissions.

--

Table A.1. Correlation matrix for measures of economic leftism.



The last question is arguably misincluded. It concerns the environment, not economics. By my eye-balling, it also has the weakest relationship to the other variables.

Some questions are hard to interpret. You should include full texts, maybe in a separate table.

You should add factor loadings as a special last column. Same for the second matrix.

--

Were the distributions of politics normal? Please include distribution plots. You can color the two subgroups differently, same as done in e.g. Bell Curve.

--

PDF attached with comments in places.
Admin
Many thanks for the review, Emil. Replies below.

The data are not available. I take it that this is due them being secret/apply-only.


The BES data should be available for download. Please check this link.

Subject?


'Subjected to' means 'received' whereas 'subject to' means 'susceptible to', so the former is correct here.

I note that this uses the economist tradition of stating the findings in the introduction. Seems redundant as they were already mentioned in the abstract above, meaning they get repeated 3 times: abstract, intro, conclusion.


I have altered the relevant sentence so that it now says:

"the present study explores whether the political attitudes of British academics are indeed both more left-wing and more liberal than those of the general population."

Use ≈


This has been changed.

It would be more proper to call this "non-academics".


I have altered the relevant sentence so that it now says:

"Insofar as academics comprise such a small share of the sample (0.3%), the reference category for this variable can be considered to be the general population, although strictly speaking it represents all nonacademics (99.7%)."

Careful. Now you are contrasting Guardian readers vs. newspaper readers who are not Guardian readers.


I have added the following sentence:

"The reference category for this variable is therefore the population of individuals who read some other daily newspaper."

Was O a single item, or based on multiple OCEAN/Big five items?


I have added the following sentence:

"The latter measure is based on the Ten Item Personality Test (TIPI; Gosling et al., 2003), and is included in the dataset as a single variable scaled from 0–10."

Judging from the table data, it seems likely that the p values are just calculated incorrectly.


The p-values for weighted estimates were computed by Stata, and I believe they are correct. Weighting affects the standard errors, as well as the point estimates.

Yes. It has to result from non-random sampling with some variable not among these or strongly related to them. Candidates?


Not sure, unfortunately.

Please quantify this, e.g. Pearson correlation.


I have added a footnote on p. 5, which states the following:

"The correlation between the unweighted distribution from the BES and the average of the two distributions from Understanding society is r = .93 for both the broad and narrow definitions of party identity. By contrast, the correlation between the weighted distribution from the BES and the average of the two distributions from Understanding society is r = .65 for the broad definition of party identity and r = .64 for the narrow definition."

Please report correlation matrix of all primary variables. Remember that some readers may be more interested in other predictors, e.g. for meta-analysis.


I would prefer not to report this. If readers want to find out the bivariate correlations, they can download the data, and run my Stata code.

Age^2 is not a good way to control for non-linear effects.


I have now included dummies for age quintiles in the models instead, but it made essentially no difference.

Note that including education as a co-predictor is problematic... What about the overlap of measurement criticism? Some O items concern political stuff very similar or identical to your social items.


I have added the following statements on p. 4:

"Note that the reason for utilizing education and openness to experience is that each has been posited to at least partially account for the left-liberal skew of academia (see Gross, 2013; Duarte et al., 2014; Carl, 2017). I.e., it has been asserted that academics tend to be have more left-liberal attitudes due to their higher education and greater openness to experience. Including these variables as covariates in a multiple regression analysis allows one to estimate how much of the skew they do in fact account for."

Are these R2 adjusted or not?


Non-adjusted, but it makes very little difference.

You cannot use OLS for a binary outcome! Please use a logistic model.


I got this criticism from another reviewer recently, and I disagree. So I will repeat what I said to that reviewer:

The linear probability model (LPM; i.e., OLS with a binary dependent variable) is widely used in the economics literature, and is now preferred to logit and probit by many econometricians. The two main reasons are: greater interpretability, and lack of small sample bias that afflicts maximum likelihood estimation when specifying fixed effects. 

The conventional criticism of the LPM, namely that predicted probabilities may fall outside the interval 0–1, is not relevant if one’s purpose is simply to estimate the marginal effect of an independent variable. As Wooldridge (2002) notes in his seminal textbook on econometrics (Econometric Analysis of Cross-Section and Panel Data):

“If the main purpose is to estimate the partial effect of [the independent variable] on the response probability, averaged across the distribution of [the independent variable], then the fact that some predicted values are outside the unit interval may not be very important.”

Similarly, in the blog for their own econometrics textbook (Mostly Harmless Econometrics), Angrist and Pischke (2012) write:

“If the conditional expectation function (CEF) is linear, as it is for a saturated model, regression gives the CEF – even for LPM. If the CEF is non-linear, regression approximates the CEF. Usually it does it pretty well. Obviously, the LPM won’t give the true marginal effects from the right nonlinear model. But then, the same is true for the “wrong” nonlinear model! The fact that we have a probit, a logit, and the LPM is just a statement to the fact that we don’t know what the “right” model is. Hence, there is a lot to be said for sticking to a linear regression function as compared to a fairly arbitrary choice of a non-linear one! Nonlinearity per se is a red herring.”

Moreover, as the economist Marc Bellmere (2013) notes on his blog (please see also Allison, 2012; Smart, 2013): “The probit and logit are not well-suited to the use of fixed effects because of the incidental parameters problem.” 

Allison, P. (2012). Logistic Regression for Rare Events. Statistical Horizons, available online.
Smart, F. (2013). Incidental Parameters Problem with Binary Response Data and Unobserved Individual Effects. Econometrics By Simulation, available online.

It's probably wise to cite something not your own research


I have cited Duarte et al. (2014).

The last question is arguably misincluded. It concerns the environment, not economics.


I have replaced this item with an item pertaining to deficit reduction.

Some questions are hard to interpret. You should include full texts


Full wording for all items have been included in Appendix A.

Were the distributions of politics normal? Please include distribution plots.


The distributions have been included in the new Appendix B.

Latest files available here.
Admin
Noah,

Sorry for the delayed review.

Just two suggestions.

1. Consider including the text of the two O questions from TIPI, so that readers can judge the issue of construct overlap.
2. Consider examining the fitted values when you for your OLS model for binary outcome. Are they outside the possible range? If not, I guess you don't have problems. If they are, then it may be a problem.

I approve of this paper.
Admin
I asked Bryan Pesta to review this, pending reply.

Edited: Bryan says he will review it.
I asked Bryan Pesta to review this, pending reply.

Edited: Bryan says he will review it.


I'm close to done, please bear with me, and sorry for the delay!

B
[align=center]Review of “the Political Attitudes of British Academics,” by Noah Carl[/align]
 
In general, I think the manuscript is worthy of publication here. I see it as a brief report, and I suggest that the appendices be separated from the “main” manuscript and be included as a supplementary materials file.
 
1. I can perhaps see why education and openness are control variables in the study, but the rest of the control variables are somewhat arbitrary. I don’t know why age, gender, region, etc., should co-vary with Academic, nor do you tell us why. I suggest dropping all of these other control variables (or relegating analyses that use them to the SM file).
 
2. Can you calculate / report the reliability of the Openness scale?
 
3. A simple correlation matrix is needed as a Table 1, which includes all variables used in the study. You should also report Mean / SD for each.
 
4. You should report mean difference tests, specifically (in addition to the regressions you have) on both education, and Openness by Academic. The correlation matrix in (3) answers the significance question, but I’d like to see how the means on these two variables differ for Academics and non-.
 
3. Table 1 needs to be redrawn as a hierarchical regression. If you decide to drop the variables mentioned in (1) above, then Step 1 should be “Academic,” Step 2 should be Education (or Openness), and Step 3 should be Openness (or Education). Combining Steps 2 and 3 is fine, but the standardized and un-standardized betas / errors should be included in every row.
 
4. The sample size between Academic and non- is massively lopsided. I do not know how / if this affects regression results and / or hypothesis testing.
 
5. Except for maybe the last sentence, the paper is rather a-theoretical. This is not a huge problem for me, given what you are trying to show here (and that this is a Brief Report), but reviewers have frequently dinged my papers on the same issue.
 
6. Please exclude all “**” and “***” indicators of whether your effects are significant at this or that level. I don’t think they are appropriate for regression in general (I think the Beta weights speak for themselves), and especially when the sample size is 20+ thousand. Note that only one value in Table 1 is significant at something other than “***,” p < .001.
Admin
Many thanks to Bryan Pesta for the review. New files have been uploaded. Please find my responses below:


In general, I think the manuscript is worthy of publication here. I see it as a brief report, and I suggest that the appendices be separated from the “main” manuscript and be included as a supplementary materials file.

 
I would prefer not to separate the appendices from the main text. Given that the journal is online-only, there is no space constraint. And I imagine that being able to simply scroll-down to the appendices will make life easier for the reader. 


1. I can perhaps see why education and openness are control variables in the study, but the rest of the control variables are somewhat arbitrary. I don’t know why age, gender, region, etc., should co-vary with Academic, nor do you tell us why. I suggest dropping all of these other control variables (or relegating analyses that use them to the SM file).

 
I have added a paragraph on page 4 explaining why the demographic covariates were included.


2. Can you calculate / report the reliability of the Openness scale?

 


3. A simple correlation matrix is needed as a Table 1, which includes all variables used in the study. You should also report Mean / SD for each.
 
4. You should report mean difference tests, specifically (in addition to the regressions you have) on both education, and Openness by Academic. The correlation matrix in (3) answers the significance question, but I’d like to see how the means on these two variables differ for Academics and non-.
 
3. Table 1 needs to be redrawn as a hierarchical regression. If you decide to drop the variables mentioned in (1) above, then Step 1 should be “Academic,” Step 2 should be Education (or Openness), and Step 3 should be Openness (or Education). Combining Steps 2 and 3 is fine, but the standardized and un-standardized betas / errors should be included in every row.
 
4. The sample size between Academic and non- is massively lopsided. I do not know how / if this affects regression results and / or hypothesis testing.
 
5. Except for maybe the last sentence, the paper is rather a-theoretical. This is not a huge problem for me, given what you are trying to show here (and that this is a Brief Report), but reviewers have frequently dinged my papers on the same issue.
 
6. Please exclude all “**” and “***” indicators of whether your effects are significant at this or that level. I don’t think they are appropriate for regression in general (I think the Beta weights speak for themselves), and especially when the sample size is 20+ thousand. Note that only one value in Table 1 is significant at something other than “***,” p < .001.
Admin
Many thanks to Bryan Pesta for the helpful review. New files have been uploaded. Please find my responses below:

In general, I think the manuscript is worthy of publication here. I see it as a brief report, and I suggest that the appendices be separated from the “main” manuscript and be included as a supplementary materials file.


I would prefer not to separate the appendices from the main text. Since the journal is online-only, there are no constraints on space, and the paper is only 17 pages in total. Moreover, I imagine that being able to simply scroll-down to the appendices would make life easier for the reader.

1. I can perhaps see why education and openness are control variables in the study, but the rest of the control variables are somewhat arbitrary. I don’t know why age, gender, region, etc., should co-vary with Academic, nor do you tell us why.


I have added a new paragraph on p. 4 explaining why these demographic covariates were included. 

2. Can you calculate / report the reliability of the Openness scale?


I have reported the reliability of the scale in a footnote on p. 4. 

3. A simple correlation matrix is needed as a Table 1, which includes all variables used in the study. You should also report Mean / SD for each.


It is inconvenient (and not particularly informative) to report a correlation matrix for all the variables used in the study, since many of them comprise sets of dummy variables (e.g., age groups, education categories). However, I have added a correlation matrix for the key variables utilised, which can be found in the new Appendix C. 

4. You should report mean difference tests, specifically (in addition to the regressions you have) on both education, and Openness by Academic. The correlation matrix in (3) answers the significance question, but I’d like to see how the means on these two variables differ for Academics and non-.


On p 4, I have reported differences between academics and non-academics with respect to both education and openness to experience.

3. Table 1 needs to be redrawn as a hierarchical regression. If you decide to drop the variables mentioned in (1) above, then Step 1 should be “Academic,” Step 2 should be Education (or Openness), and Step 3 should be Openness (or Education).


I would prefer not to run hierarchical regressions. In my opinion, the way I have reported the regression results is the most informative so far as the main topic of the paper is concerned. That is to say, the tables demonstrate that academics are substantially more socially liberal and economically left-wing than the general population, and that these differences are relatively robust. Hierarchical regression is very rarely if ever used in sociology (perhaps it is more common in psychology journals.)

However, I have noted on p. 5 that "both education and openness to experience had independent, statistically significant effects on all three dependent variables".

4. The sample size between Academic and non- is massively lopsided. I do not know how / if this affects regression results and / or hypothesis testing.


As far as I'm aware, this shouldn't impact the regressions. I have entered academic as a dummy variable, so the coefficient is just the difference between the means for academics and non-academics.

5. Except for maybe the last sentence, the paper is rather a-theoretical. This is not a huge problem for me, given what you are trying to show here (and that this is a Brief Report), but reviewers have frequently dinged my papers on the same issue.


The aim of the paper is to show critics of the earlier report (see citations therein) that academics are indeed more socially liberal and economically left-wing than the general population. Hence I would prefer not to introduce an extended theory section. But I have added the following footnote on p. 3: 

"For a discussion of the theoretical mechanisms by which the academy’s left-liberal skew arose, see Gross (2013), Duarte et al. (2014) and Carl (2015b, 2017)."

6. Please exclude all “**” and “***” indicators of whether your effects are significant at this or that level. I don’t think they are appropriate for regression in general (I think the Beta weights speak for themselves), and especially when the sample size is 20+ thousand.

 
I would prefer not to remove the asterisks. Including them is standard practice in sociology, political science and empirical economics. They provide a simple, clear way for the reader to check the significance of results. And although the overall sample size is >20,000, the number of academics is relatively small (n = 107), so one shouldn't take the significance of results for granted.
Noah, I didn't notice till just recently that you had replied. Sorry for the extreme delay. I'm fine with your revisions, and for the reasons you list for not including some of my suggestions. I recommend accept. Bryan
Admin
Author has submitted the final version, and it has been published.

https://openpsych.net/paper/56
1