Google Scholar Metrics and Scholarly Productivity in International Relations

28 Comments

Jeff Colgan on 6 August 2015 at 14.42 EDT

Cullen, thanks for offering a fact base for us to think about! Interesting stuff here. I would only caution you to take more seriously the caveat you add at the end about treating GS citations as only one tool among many. You undermine it earlier in your post when you imply that published reviews give us little insight on quality, whereas GS citations do. I have a working paper that highlights some of the key limitations to GS citations, especially in regard to how we think about scholarly influence. I find that many IR pieces with lots of GS citations are rarely taught (at least in the US), and conversely some of the “canonical”, widely-taught articles have relatively few GS citations. I’m reluctant to say more because the working paper is *almost* through the review process, but not yet.
- Cullen Hendrix on 6 August 2015 at 14.52 EDT
  
  Jeff, definitely looking forward to reading it when it eventually comes out. I’d only point out the following: whether something shows up on graduate and undergraduate syllabi is another imperfect measure of impact, based on specific assumptions about what matters to the discipline. GS-based metrics are no different, but I’d argue they make weaker assumptions about what matters.
Michael C. Horowitz on 6 August 2015 at 15.09 EDT

Awesome article. Really, really useful and interesting. Just to push on one thing for further discussion – I can think of 2 places where someone might have a pretty low citation
count, but I’m not sure it would be fair to call their work a dud: 1) If you wrote *the* book about something, and it’s a great book, but most scholars are chasing a kickball elsewhere on the field. That is to say, if the book is actually really good, but everyone is obsessed with whatever the thing is of the moment, I’m not sure it means your work is a dud. It might mean it’s not at the core of where the field is at a given point (and over an entire career, perhaps that evens out) 2) If you wrote *the* book or article that basically ends a research agenda or debate on a topic by resolving things to the general satisfaction of the field, and then people
move on to something else.
Steven T. Zech on 6 August 2015 at 15.23 EDT

Great piece Cullen, extremely interesting! A quick point about why we may cite certain work. Many see citations in terms of a network approach (there’s actually a large academic literature on this in several disciplines). Empirically, citation data seem to generate a “scale-free network” with a power-law distribution — i.e. preferential attachment in which newcomers form ties with existing nodes with a probability proportional to the degree count of those existing nodes. Network “hubs” that are already heavily cited continue to get cited because they are often first on the scene in an emerging research agenda, though sometimes not the best work. I’m not sure the resulting citation data metrics actually capture scholarly “productivity” (which you recognize in the piece). If the goal is objective, data-driven evaluation, these network dynamics seem to suggest an issue with measurement validity. Citation metrics might indicate “innovation,” “agenda-setting,” “academic creativity,” “fad-capture,” or many other concepts in addition to or aside from “productivity.” Just a few things to think about… Better metrics certainly could compliment subjective evaluations and I look forward to seeing how this develops!
- Cullen Hendrix on 6 August 2015 at 16.52 EDT
  
  Steve, this is really thoughtful stuff. I am in full agreement that reputation and human networks go a long way in explaining the scholarly networks that arise. My quick-and-dirty response is simply that the same issues bedevil non-metrics based evaluations, including what you mean when you say “the best work” (hence the part about being well liked, of having an influential advisor, etc.). There is no equivalent of AP exam grading (anonymous graders grading anonymous writers with no specific stake in the outcome) for academic work. It exists in a social space. I’m just more worried about the small-sample properties of that social space (i.e., recommendation/tenure letter writers and reviewers) than those that might arise from analyzing citations.
  
  I think you are definitely on to something in that citations are probably capturing a lot of other stuff, although I’d like to think innovation, agenda-setting, creativity, and even fad capture are net positives in terms of evaluating scholarly profiles.
Sverrir on 6 August 2015 at 15.59 EDT

Could you provide a list of the 1,000 most cited IR scholars?
paul on 6 August 2015 at 19.49 EDT

The Moneyball reference is interesting. Sabremetrics and quantitative reasoning has actually been downplayed in recent years. It was something of a fad. The qualitative scouts are back in demand, actually.
- Cullen Hendrix on 9 August 2015 at 11.16 EDT
  
  Nowadays, virtually all teams use both.
David Samuels on 7 August 2015 at 17.55 EDT

Cullen – excellent post, really enjoyed reading it. A couple of things to add, basd on my 2 articles on citation patterns in PS. You wonder about selection bias, which implies the counts may be higher than IRL. True, but GS also under-counts scholarly impact because it does not count citations articles receive in books – and comparativists and IR scholars publish lots of books. (The only way to find cites of an article in a book is to enter the article title as a quote in Google Books. At least for the books Google has scanned/digitized, this will give you an iudea of how many times your article has been cited in a book.) Second, GS doesn’t accurately count citations OF books, again because it can’t pick up the citations that a book receives in other books. So GS is really unfriendly to scholars whose “big hits” are books, and also under-counts scholarship that has a big impact among other scholars who publish books.
- Cullen Hendrix on 9 August 2015 at 11.16 EDT
  
  David,
  
  Thanks for your response. Do we have any idea how large the undercounting effect is? I searched Activists Beyond Borders and it certainly seems highly cited: https://scholar.google.com/scholar?hl=en&q=activists+beyond+borders&btnG=&as_sdt=1%2C6&as_sdtp=
  - David Samuels on 9 August 2015 at 11.43 EDT
    
    Cullen – Let me reiterate that there are two issues at stake. The first is that GS undercounts citations that ARTICLES receive, because it undercounts citations that articles receive IN BOOKS. Counting cites articles receive in books reveals GS undercounts cites by about 50%, depending on the subfield (undercounting is worse in IR and CP). The second issue is that GS doesn’t count citations that BOOKS receive very well, either in articles or in other books. Activists Beyond Borders is probably not an optimal example, as it is in the top .000001% of whatever distribution you choose to use. Somewhat solipsistically in the 2013 PS article I used my own 2003 book as an example. GS only returned 6 citations IN BOOKS in the 1st five years, but GB showed many more. It’s not quite ‘fair’ to compare books against articles (in my dept we resolve the apples/oranges argument by counting a Univ press book as equal to 3 top-tier articles, FWIW) but after a slog of counting by hand (web scraping wouldn’t work b/c of lots of garbage/repeats in GB) I could confirm that that approach actually makes some sense b/c the average university-press book receives about 3X the citations that an SSCI-indexed article does. About 1/3 of those cites come in articles, 2/3 in other books.
    - Cullen Hendrix on 10 August 2015 at 13.54 EDT
      
      David – thanks. I hadn’t seen your PS piece, so this is helpful. Knowing something about undercounting would presumably allow for adjustment.
    - Cullen Hendrix on 10 August 2015 at 14.13 EDT
      
      The average uni-press book receives 3X the citations as an SSCI-indexed article? What’s three times zero? ;-)
      - David Samuels on 10 August 2015 at 14.19 EDT
        
        Haha, but Gary King’s famous line (which I think he cribbed from someone else) vastly overstates the incidence of zero-cited work, especially (this was the point of the article, after all) when you count citations that articles receive in books.
- Andy Hom on 20 August 2015 at 18.18 EDT
  
  Really interesting discussion all around, thanks Cullen et al. Just a quick question from David’s point here, I just checked my own GS profile and 5 of the few cites an article of mine has received are from books–some edited, some single-authored. Is it possible that GS has updated this very recently, or (more likely) that I’m a Luddite who can’t parse the citation count?
  - David Samuels on 20 August 2015 at 20.19 EDT
    
    GS has gotten better but it still undercounts books, for reasons beyond me. Type the full title of your book into GB, you’ll get some similar and some different results…
Alex Montgomery on 9 August 2015 at 16.35 EDT

Did your intrepid RAs capture anything else other than year of PhD? There are probably a lot of other factors that contribute to cite counts (and h-indexes), including institutional incentives for publishing. I wonder if you also could hook this into TRIP’s article citation datasets in some useful way; they would also provide a few useful cross-checks.
- Cullen Hendrix on 10 August 2015 at 14.12 EDT
  
  Alex, I think a lot of the institutional incentives stuff is rolled up into the self-selection process; that’s not as useful as being able to directly model it, but I do think it’s somewhat accounted for. I do have current institution and PhD-granting institution, so there is some stuff that could be done with that. As you might imagine, number of current institutions > number of PhD-granting institutions.
Megan MacKenzie on 10 August 2015 at 01.01 EDT

Thanks for this great post- I’ve been wanting to read more about the h-index and some of the pros and cons for the move in social sciences to readily refer to it. One thing I’ve always been unclear on: how does self citation impact one’s h-index? As in, if a scholar cites themselves as much as possible in each of their publications do they skew their h-index? Is this part of the Moneyball game? Of course it is perfectly acceptable to cite oneself on occasion, but I’ve noticed some scholars that seem to manage to include every publication in each subsequent article. Just curious.
- Dan Nexon on 10 August 2015 at 11.40 EDT
  
  GS doesn’t exclude self-citations.
  - Cullen Hendrix on 10 August 2015 at 13.58 EDT
    
    One way to know if this is a problem: I could pull WoS citation metrics for a subset of these scholars and see how highly correlated their scores are with and without self-citations. My prior is that the self-citation bias is real but declining in total citations.
    - David Samuels on 10 August 2015 at 14.24 EDT
      
      My impression from 1) staring at these data so long and 2) editing CPS is that self-citing is overblown as a concern, at the end of the scale that matters, ie the top 10-15% (or less). It’s just not possible to cite your own article dozens or hundreds of times. At CPS we’ve implemented a “no more than 5” self-cites in any initial submission, partly to preserve anonymity as much as possible and partly because, well, that’s plenty. The issue here isn’t really whether doing so does or does not preserve anonymity, but that we rarely have to return a ms for resubmission because of “over-self-citing” – it happens, but <5%, probably more like 2% of the time.
      - Megan MacKenzie on 10 August 2015 at 18.25 EDT
        
        Good to know David- I like the ‘no more than 5 self cites’ rule! If only it were standard. :)
josh busby on 10 August 2015 at 12.44 EDT

One other issue is the some scholars’ most cited pieces may not be peer-reviewed. If you look at my GS page, the second highest item is a 2007 report I wrote for the Council on Foreign Relations. Is that appropriate? I don’t know. I teach in an inter-disciplinary policy school. Given the other problems David Samuels flagged about under-counting of citations in books, the lessons I draw is that the metric may be problematic if we adopt as a signal for tenure, promotion, raises, etc. Ideally, we would have something more appropriate that reflects idiosyncrasies in our field.
- Cullen Hendrix on 10 August 2015 at 13.56 EDT
  
  I think it could be useful as a part of assessment, but not the whole enchilada. What we have now strikes me as unacceptably idiosyncratic. I’m glad people are offering useful potential additions/correctives to the basic framework.
- Cullen Hendrix on 10 August 2015 at 14.00 EDT
  
  In re. the CFR piece: I suppose it depends on what kind of place you work. That these citations (and this kind of work) would be heavily/completely discounted strikes me as absolutely maddening. Peer-reviewed vs. non-peer-reviewed would seem to be a bright line, but in my experience I’ve had easier times getting PR stuff accepted than some policy reports!
- David Samuels on 10 August 2015 at 14.34 EDT
  
  From my research I think your case is fairly rare, may have a lot to do with your work and audience than most political scientist. It’s far more common among economists, where SSRN and esp NBER papers have made journal publishing almost an afterthought. The reason I wrote those 2 pieces was b/c I was tired of sitting in faculty mtngs where Americanists went on and on about “APSR, JOP, AJPS yada yada” without any clue about publishing OR citing patterns in the other subfields. But after all that grubbing around in the data, what did I find? Highly-cited articles are highly-cited no matter what index you use, and the correlation b/w citations an article receives in articles and what it receives in books is, not surprisingly, high. Bottom line: GS does give us a useful proxy for overall impact. It dramatically undercounts. But an article with 3 cites after 10 years on GS is not a high-impact article, and one with 300 is. End of discussion. And although we do spend a lot of time arguing about these things, I don’t think we’ll ever have an unproblematic metric. I like that GS gives total counts, the H-index, and the I10-index – we should have multiple ways of looking at impact, not just one.
Roxanne on 16 August 2015 at 02.06 EDT

Utterly, utterly fascinating – thank you for running the numbers on this, and for doing it in such a thoughtful way. I think an added parameter that would be interesting would be seeing a breakdown by sex of the people who made the list of most cited/most influential. Institutional affiliation would also provide an additional parameter of interest. Thank you again!

Trackbacks/Pingbacks

Weekly Links | Political Violence @ a Glance - […] scholars measure their contribution to the field of International Relations? Cullin Hendrix reviews how the publication citation count came…
Academia isn’t Baseball | Duck of Minerva - […] Hendrix’s guest post is a must read for anyone interested in citation metrics and international-relations scholarship. […]
New Evidence on Gender Bias in IR Syllabi | Duck of Minerva - […] what it can teach us about the decline of IR theory, the value of Google Scholar (a subject of…
Editorial der zib 1/2016: Die Bedeutung der Deutung, bedeutend zu sein, oder: die Impact-Manie – Zeitschrift für Internationale Beziehungen - […] »H-Werte«, die auch deutschsprachige Wissenschaftler_innen locker erreichen können. Vgl. https://www.duckofminerva.com/2015/08/google-scholar-metrics-and-scholarly-productivity-in-international-…; […]

Variable	Obs	Mean	Std. Dev.	Min	Max
GS citations	713	915.2	2804.9	0	40978
ln GS citations	713	4.8	2.2	0	10.6
h-index	713	8.5	8.9	0	73
i10 index	713	10.6	18.3	0	188
ln i10 index	713	1.6	1.3	0	5.2
Most Cited	713	184.9	567.7	0	9429
Ln Most Cited	713	3.4	2.1	0	9.2
Most Cited Solo	713	121.3	361.9	0	4620
Ln Most Cited Solo	713	3.2	1.9	0	8.4
PhD Year	713	2003.5	9.4	1961	2015

PhD Year	Predicted GS Cites	Predicted h-index	Predicted i10
1990	2685.4	17.2	26.9
1991	2510.9	16.6	25.5
1992	2339.8	15.9	24.2
1993	2174.3	15.3	22.9
1994	2012.4	14.7	21.5
1995	1852.2	14.0	20.2
1996	1698.1	13.3	18.9
1997	1549.4	12.7	17.6
1998	1399.8	12.0	16.3
1999	1260.7	11.3	15.0
2000	1132.8	10.7	13.8
2001	1006.5	10.1	12.7
2002	880.4	9.4	11.4
2003	765.2	8.7	10.2
2004	640.6	8.1	9.0
2005	506.3	7.4	7.8
2006	393.7	6.7	6.5
2007	305.5	6.0	5.5
2008	223.3	5.3	4.5
2009	170.8	4.8	3.6
2010	135.4	4.3	3.0
2011	108.9	3.8	2.4
2012	87.0	3.3	1.9
2013	64.9	2.9	1.4
2014	47.2	2.6	1.1
2015	46.2	2.5	1.0

PhD Year	Median GS Cites	Median h-index	Median i10	N
1990	1786.0	18.0	24.0	9
1991	2160.0	19.0	27.0	9
1992	1491.5	19.0	29.5	10
1993	1654.0	18.0	22.5	16
1994	1643.0	15.0	19.0	9
1995	1983.0	16.0	17.0	7
1996	583.5	9.5	9.5	20
1997	396.0	10.0	10.0	15
1998	376.0	10.0	11.0	11
1999	755.0	12.5	14.5	24
2000	701.0	11.0	12.0	19
2001	301.0	9.5	9.5	18
2002	153.5	6.0	4.0	28
2003	220.0	8.0	7.0	28
2004	213.0	7.0	5.0	25
2005	144.0	6.0	4.0	15
2006	105.0	5.0	3.5	38
2007	98.0	5.0	4.0	46
2008	78.0	5.0	2.0	54
2009	76.0	5.0	3.0	29
2010	34.5	3.0	1.0	42
2011	22.0	3.0	1.0	46
2012	21.0	2.0	0.0	41
2013	19.0	2.0	1.0	42
2014	17.0	2.0	0.0	33
2015	8.0	1.0	0.0	17

The Duck of Minerva

Google Scholar Metrics and Scholarly Productivity in International Relations

Amanda Murdie

Amanda Murdie

28 Comments

Trackbacks/Pingbacks

Google Scholar Metrics and Scholarly Productivity in International Relations

Amanda Murdie

share this post

Amanda Murdie

28 Comments

Trackbacks/Pingbacks