My literature review process

After I published my review of the literature on donation matching, some people asked me, basically, how I did it.

Being able to do a thorough review of the academic literature is a pretty useful skill, and there are lots of topics I’d like to see a review of. So I’ve written up some notes about my process in the hope that they inspire you to do some literature reviewing of your own.

Are there parts you’re still wondering about? Crucial steps I missed? Things you think could improve? Let me know in the comments!

Set up your system

It’s important to maintain your set of papers in some sort of organized format, so that you can easily keep track of the big picture, and quickly add new papers to your analysis. For instance, I kept a spreadsheet of all the papers I found on donation matching so that I could just copy-paste some cells to run calculations on new papers.

In addition to a spreadsheet, I also kept PDFs and notes on the papers in Evernote. When I published the analysis, I hosted my own copy of all the papers I relied on, and linked to those in addition to external versions (see the bibliography). I strongly recommend this, since links to papers on other sites may break without your knowledge and make your analysis hard to verify or reproduce.1

If you know a programming language like R or Python and you want to create plots, it might be worth writing the entire analysis using something like knitr. Knitr makes it easy to interweave code output (from R, Python or numerous other languages) with rich text in Markdown or LaTeX format, so you have a single file that stores and explains the entire analysis and can be updated automatically when your code changes. I haven’t written a full lit review in them yet, but I probably will for my next one. For my donation-matching survey analysis, I used a separate Python script that produced a bunch of images as output.

Finally, if your literature review involves code or calculations that you perform yourself, you should make these available for review as well. If you’re willing to learn a new program, I’d recommend using Git and putting everything up open-source on Github. Git and Github are great general-purpose tools for managing different versions and updates to files: it’s easy to make changes, incorporate other people’s suggestions, and see how the analysis has changed over time.2 This makes your analysis much more transparent.

Search for papers

In formal, peer-reviewed meta-analyses, people will pre-specify a set of keywords, search on those keywords in curated databases, read the abstract of every result and decide whether to include it based on that.

Don’t do that. It’s incredibly time-consuming.3

I generally take a less systematic approach. I’ll look through the first few pages of Google Scholar for a bunch of relevant searches, taking note of any papers that look relevant, until I think I’m hitting diminishing returns. Then as I read those papers I’ll take note of any citations that seem like they might also be worth including. This seems to find most or all of the available papers. It’s somewhat more prone to bias, since it’s less well-defined than the keyword-search approach, but I don’t think that’s a huge issue if you’re diligent about it.

However, it does make you more likely to miss an important paper on the first time through. So if you take this approach, it’s important that you use spreadsheets or programs to make it easy to add more papers after your initial search.

Interpret each study

The next step is to extract the important information from each study you’re including. I don’t have super good advice for doing this systematically, but here are some important considerations:

Evaluate reliability

After you have a handle on what each study says, the next (and more difficult) step is to figure out how strongly to believe it.

This is a process you can pay basically endless amounts of attention to, and I can’t get into all of details here. But here are some important points to watch out for:

For more details on assessing study reliability, check out the Cochrane Collaboration handbook of review guidelines. Part 2, chapter 8 is an excellent overview of how to assess studies (particularly randomized trials) for bias. Cochrane’s guidelines are quite strict—maybe a bit too strict in some cases—but it’s a good overview of the different risks to study validity.

Synthesize

Hopefully, at this point you have some idea what “the big picture” is. Try to describe it clearly. It’s kind to your readers if this section can stand alone so that readers can skip the analysis and spot-check it later.

The big picture might be that we don’t have enough evidence to know much for sure! Heck, that was the main conclusion of my donation-matching post. But even still, you can probably draw some conjectures or potential questions for future research.

Examples of literature reviews

This is nowhere near a complete description of what makes a good literature review. You can probably get a better sense by reading some high-quality ones. For instance:

Conclusion

Obviously these notes are incomplete—it would take far longer to get into the weeds of study review or statistical meta-analysis techniques. Nevertheless, I hope they’re at least enough to get started. Go forth and review some literature!

And as I mentioned, please hit the comment section if you’re still wondering about anything, or if you have any suggestions for improvements.


  1. Also, make sure you back up your hosted copies of the papers; I didn’t, and almost lost them when I accidentally erased the Amazon S3 bucket in question. ↩︎

  2. The disadvantage is that Git can be somewhat cumbersome to learn, although in my opinion it’s worth the effort. ↩︎

  3. It also requires access to the databases in question, which is hard if you’re not currently at a university. Google Scholar, by contrast, often has convenient links to PDFs of preprints even when the final paper isn’t open-access. ↩︎

Comments

email me replies

format comments in markdown.

Your comment has been submitted! It should appear here within 30 minutes.
Greg

This is really nice. What are some of the topics you’d like to see reviewed?

You can also email authors, for example to ask about methodological details, if you don’t have enough information to make good comparisons.

email me replies

format comments in markdown.

Your comment has been submitted! It should appear here within 30 minutes.

Ben

@Greg: good question! Here are some questions off the top of my head (I don’t know how feasible they are):

Good point about emailing the authors. I probably should have emailed Dean Karlan to ask about the bizarre randomization failure in that one study in the matching review.

email me replies

format comments in markdown.

Your comment has been submitted! It should appear here within 30 minutes.