Although there is a strong push within the academic community to make research freely accessible (“open access”) to everyone, many research papers are unfortunately still published and available “behind the paywall”. That means that the publisher of the journal, book or a conference proceedings requires someone to subscribe the content. Typically that payer is a university library, who then can make the content accessible to the university students and employees.

But making use of this subscription can be complicated. Even if the seeker for papers – such as a student or a researcher – would have an affiliation with a university, it is not self-evident how one can get past the paywall and download the paper.

This post presents ways to get through the paywall, but requires that you have a user account in a university.

The focus of this post builds on an assumption that the seeker’s main literature search tool is Google Scholar, due to its simple use and excellent coverage of different research papers.

Google Scholar with basic internet connection: limited access to “full texts”

It is important to notice that Google Scholar provides access to articles differently depending on the type of internet connection. The difference shows in the links that Scholar displays in its search results.

When a user searches papers with Google Scholar with an ordinary internet connection, Google Scholar helps find papers, but cannot always provide the user with the “full text”. By full text, publishing companies mean the PDF that contains the article in its entirety. If access is not full text, then the user can only see the title, abstract, references, and maybe the first page of the paper.

It is sometimes possible to get an access to a full text even with this internet setup. This happens if:

The text has been published as Open Access (OA) or with a permission that the authors are allowed to upload a copy of the text to their personal public repository of papers.
Someone has uploaded the text somewhere in the Internet even if it is is not OA, thus possibly breaking the copyright of that content. Google has then found that paper.
The paper’s preprint or so-called “accepted manuscript” version is available from the authors of the paper. A pre-print is often an earlier version of the paper: similar or even identical in its content with the final one, but not copy-edited to the final layout, and therefore lacks correct page numbers, and may have minor typing errors.

The screenshot below shows an example of what a Google Scholar search can produce as an outcome.

From the screenshot, different kinds of links can be seen. They provide different levels of access to the content:

[PDF]: full text can be downloaded from the linked web page
[HTML]: full text can be usually downloaded, but not always
Getit@Grifols: this is a link whose purpose is not clear to me. No full texts available, anyway.
The link of the paper title: takes the user to the publisher’s website. Full text may or may not be downloadable from it.

As a base rule, if the text does not have any links in the right-side column, the full text PDF is not available from any source.

Google Scholar with university’s VPN connection

The access to papers can dramatically improve if you tunnel your Internet connection through a university’s VPN service. With VPN, all the Internet traffic from a computer will travel (i.e., is “tunneled”) through a designated server. Both Google Scholar and the publishing companies then recognize that the user is connecting them from an Internet address that has a right to access also subscribed contents.

It therefore makes sense to use VPN in using Google Scholar. Here is how the same search results look like with VPN. Note that the two previously inaccessible papers are now downloadable via a “sfx@Aalto” link. Sadly, Klein & Weitzenfeld’s text in Educational Psychologist still remains unaccessible: my university does not have a subscription to its contents:

I cannot be sure, but I believe all the universities provide a similar service as Aalto University does, and therefore VPN opens doors to papers (for those who have a university account, of course). Here are the instructions for Aalto users for how you install the VPN client on your computer.

Accessing papers without VPN

It is not necessary to use VPN to access texts, however. An alternative is to navigate to a university library’s search interface and download the paper through it. Every library works slightly differently in service its users. At Aalto University, it is possible to enter e.g., the paper’s title, and the service tries to find a matching piece of content from its digitally subscribed sources. Aalto University’s article search interface is here.

A good idea is to use the text’s DOI (digital object identifier) as the search term instead of a paper’s name, because otherwise you may get lots of unnecessary search hits too. DOI uniquely identifies the paper, and can be always found somewhere from the publisher’s website.

When you find the desired search result, click on the links that promises to take you to the electronic full text. At that stage you will need to provide your user name and password, to prove that you are entitled to access the content.

Another Aalto-specific tip is to use the list of digital paper libraries in libproxy.aalto.fi. If you know the publisher of a paper your are interested in, you can go to that library via a link in the libproxy page. As long as you browse papers in that library, you are surfing within the paywall, and will be able to download full texts.

Paper request from ResearchGate

As the last resort, you may go to ResearchGate or Academia.edu. These are services where researchers can upload their works and create profile pages. If a researcher has uploaded the paper to the service, ResearchGate/Academia.edu provides a feature where you can ask a researcher to send a copy of the paper to you privately.

A word of advice: It is best to use this feature only after you have tried the other possibilities above and they have failed. By my personal experience, it is always irritating to react to ResearchGate’s paper requests if the requested paper is also available as Open Access. A request in such a situation only shows that the requester has not spent even a minimal bit of effort to find it. I get paper requests from ResearchGate approximately once a week. Much more cited researchers therefore probably get several requests each day.

Final words

As can be seen, retrieval of a text can require quite a bit of work. It is better to minimize the amount of work that you need to spend in downloading. Therefore, always download and store the paper on your computer! Also make sure that you can easily find that paper later on from your computer. For example, use a systematic file-naming principle (e.g., authorname year paper title.pdf) for all the papers, or start using reference manager programs such as Mendeley or Zotero. That will save you time next time, and lets you annotate the texts that you read with your own observations.

Acknowledgments

Thanks to Markku Reunanen for informing about the digital library listing at libproxy.aalto.fi.

Introduction

Knowing and following what others have written about (and around) your own research topic is the basic requirement for any academic project. But how should this be done? Some things are obvious: this task involves, at least 1) searching for possible texts to read; 2) scanning them to filter the good ones out from the mass of less relevant ones; 3) reading the most promising ones; and 4) building syntheses about this mass of texts.

The list is rather simple, but it is clear a lot of complexity underlies these steps. Some of the questions are:

What tools should be used for finding texts?
What types of texts are out there?
What indicators reveal what texts are more credible than others?
How these texts can be downloaded?
How should the article collection be maintained?
How can the synthesis be generated out of a collection of texts?

This blog post focuses only on questions 1–3 and a bit on question 4. I have answered to question 4 also separately in a different post. The others may be looked at later.

1. What tools should be used for finding texts?

At least the following ways exist for finding literature:

1. Standard Google search helps you find a very eclectic mix of texts that have a varying quality and underlying intentions. Very few of the papers found this way are academic papers. Instead they can be, for example, memos written by thinktanks and lobbyist groups, governmental bodies’ reports, press releases, essays written by students. Occasionally, also academic papers can be found with standard Google search, but what one finds is very unpredictable.

While what one finds using this method may be useful, it is usually best to regard contents found this way more as “data” rather than as research knowledge.

2. Google Scholar. This is the best tool for exploratory search for literature: for those situations where you want to find out “what is out there”. Using Google Scholar is like using standard search, but the results are different: they are from academic sources, such as journals, conferences, and books. The search results also contain additional information that help you interpret which papers are better to investigate in more detail

Because Google Scholar is a great tool, its use is covered in more detail below.

3. Snowballing. Academic papers always contain a section called References that lists all the other research that has been cited. By reading the paper and finding out what earlier works are cited, and what the writers say about them, it is possible to get to the sources of knowledge. This helps you find the “must-reads” of the research topic. The problem is that snowballing works only backwards in time: it does not help you find the most recent research.

4. Content alerts. It is possible to ask journals and conferences to send email to you every time they publish a new issue. This is a good way for staying up to date on the most recent research. The problem is that this brings a lot of email to your mailbox, and every issue does not contain articles that you would be interested about. Google Scholar, however, lets the user create keyword-based alerts: send you email every time it finds new research that matches given keywords. You can turn on this feature by clicking on Create alert button in Google Scholar in the left side of the screen (see the image below).

For a PhD students and researchers who need to keep themselves up to date about a research area over a long period of time, this is an essential feature to use.

5. Databases. EBSCOHost, Proquest, ABInform, IEEExplore, Scopus, ACM Digital Library and other databases are great for systematic literature reviews when you know exactly what keywords to use and what journals and conferences to include in your search. But because each database only covers certain journals/conferences, and usually disregards books completely, they are not optimal for exploratory search for knowledge. For that, Google Scholar is much better and nicer to use too.

2. What types of texts are out there?

Given that I recommended Google Scholar as the primary tool for searching texts, I will focus on its use from now on more than on the others.

Academic peer-reviewed papers

In the above, the main characteristic that I mentioned as the difference between the standard Google search and Google Scholar search was that the latter one finds only “academic papers” instead of just any texts or search hits. It is therefore important to define what is unique in academic papers.

The main characteristic of an academic paper is that it is “peer reviewed”. This means that there is a particular editorial process (“review process”) that the paper has undergone before it has been published in a journal or a conference. In this process, It has been examined by a jury of researchers and the authors have had to improve the paper until it has met the necessary quality requirements. Without an exception, the process includes at least one cycle of improvements: the authors have first sent (“submitted”) their paper for review, the members of the jury have evaluated it by writing statements about it, and the authors have been asked improve the paper. Alternatively, the authors have received a “reject” meaning that this process is terminated, and they have to find a different journal/conference that may be willing to publish the paper. They have to start the process again with that other outlet. If the paper, however, was considered promising enough, the second cycle starts when the improved version is received from the authors. The reviewers will evaluate whether the changes are sufficient, and provide further comments. In conferences, one cycle is common; in journals at least two cycles is the norm. In every stage, the possibility of a reject is always possible.

Usually the review process is “double-blind”: the authors do not know who will read their paper, and the members of the jury (i.e., “reviewers”) do not know whose paper they are reading. The communication between the authors and the reviewers is handled by an editor who is a senior researcher in the field, and is responsible for keeping up high standard of this process. The blindness increases the neutrality of the process: even famous academics’ papers can be rejected, and the reviewers do not need to face the consequences of furious authors who are angry at the rejection of their paper. Most papers are rejected; good conferences typically reject 70-75% of the submissions, for example. Top journals reject a larger percentage than that.

The heaviness of the whole process makes paper publishing a slow business. To publish a paper in a good journal often takes at least 2 years, with 3–4 cycles of improvement. But it ensures much better quality for the content, compared to papers that have not had a review process. Thus puts the academic papers apart from other materials that standard Google search can offer.

Books and book chapters

Books are another common type of academic texts. They exist in two kinds: full books that have been written by the same group of people from the beginning to the end, and edited collections where different chapters have been written by different authors. Edited collections have editors who have gathered the texts together and have usually had at least some form of peer review process in the book chapters’ preparation.

Other sources of academic-like texts

There are also semi-academic papers: ones that have been written by researchers, but which have not undergone the review process. These include research institutions’s “white papers” and technical reports, as well as texts that accompany presentations given in research seminars and workshops. These texts are usually published only in a website, instead of in a journal or a conference proceedings.

There are also papers that have been submitted for a review, but which have also been saved in a public repository such as Arxiv, Biorxiv, Citeseer or SSRN. Although doing so breaks the blind review policy, in some fields of science this is accepted and widely used practice. One of the reasons for this practice is the competition within the scientific field: researchers compete for being the first ones to make a certain finding. They do not want to wait the 2 years in the review process before they can tell about the finding. They may also fear that an anonymous reviewer steals their idea, replicates the study, and publishes it as their own. Public archiving protects authors from that.

Finally, papers are also available from ResearchGate and Academia.edu. These are self-archiving repositories where researchers sometimes upload copies of their published works, or where they just publish their research, thereby bypassing the review process. The quality of the content in ResearchGate and Academia.edu varies wildly, and needs to be verified: has the paper been published somewhere, or has it been only uploaded here?

3. What indicators reveal what texts are more credible than others?

It so far seem that just using Google Scholar ensures that every text has the required quality and can be used as a good piece of literature. The truth is not that simple: there are conferences and journals with different levels of quality. Some papers, even if they are per-reviewed, have low quality. Using just any source that one finds may lead to 1) misleading directions; 2) unnecessary amount of work.

There are three simple indicators for finding out which paper is more worthwhile to read than others:

The exact topic of the paper

This is the simple one: it is better to read papers whose titles and abstracts have a good fit with the information that one is looking for. Google Scholar presents the titles of the papers very clearly. In addition, the abstract of the paper can be inspected by clicking at the title. It either shows a popup window or takes the user to the publisher’s website.

The number of citations

Good papers end up usually cited more often than others by other researchers. The number of citations is the total count of all the other papers that cite a given paper. In the following screenshot, for example, Google Scholar tells that the 4th search result has been cited 2325 times by other researchers while other papers have been cited much less. This tells that Dorst and Cross’s paper published in Design Studies is probably more appreciated by researchers than other papers, when it comes to “framing in design process” as the topic.

Screenshot of a Google Scholar search result

Example of a search result in Google Scholar.

Citation count is a good indicator for choosing which papers are “must reads” and give the most relevant information.

The quality of the journal or conference

Although the citation count is a good indicator, it works poorly especially in the evaluation of the importance of very recent research. Recent publications have not had a chance to accumulate citations yet, and seem therefore less relevant. In addition, sometimes there are no publications that would be highly cited, because the research area that the user is interested about is very particular and not much researched.

Then the user should look at the quality of the journal or conference that has published the research. For most journals, it is possible to find what its impact factor is. It is a value that is computed by the number of citations that the papers in the journal gather on average over time. Clarivate Analytics’ JCR (Journal Citation Records) is the most often used impact factor service. It was earlier known as Thomson Reuters. JCR is not publicly accessible: one needs to access it through an university library. Aalto University users can click here to access JCR.

The impact factors for journals range from 0 to several dozens. For example, in the top, impact factors for New England Journal of Medicine, Lancet, Nature and Science are currently 75, 60, 43 and 42, respectively. The problem with the impact factors is that in other fields the best journal may have much lower impact factors. In HCI, Human-Computer Interaction has the highest impact factor, which is currently 4.2. In design research, Design Studies is the leading journal, and its impact factor is 2.8. Design Issues – another good one – is not listed at all, surprisingly. These differences do not mean that design or HCI journals would be of poorer quality than natural science journals – fields cannot be compared based on their journals’ impact factores. Many factors affect the impact factor, including the publication volume in the field, peer competition, the status of conferences or books as reputable publishing outlets, and the centeredness of the field around only a handful of journals, for example.

All this just means that impact factors are meaningful only if one knows already what the range the values is in a given field. In addition, Clarivate Analytics’ JCR does not provide impact factors for conferences, which makes its relevance to HCI much less meaningful.

A better approach, at least in Finland, is to use Finland’s own academic ranking system called “JUFO” – short for “Julkaisufoorumi”. It ranks every journal and conference using 4 levels:

3 = the top journal/conference in its own field
2 = a really good journal or conference
1 = other journals and conferences that have a peer review process
0 = journals and conferences that are known to exist but which cannot prove that they follow the sufficient academic review standards

Generally speaking, any paper published in journal or conference of level 2 or 3 has content that can be considered seriously. Many level 1 outlets are also really good, but there the quality varies a lot. Level 0 conferences and journals should not be used as references. JUFO can be accessed here.

Summary

When you search for literature, you can use the following process:

Try different kinds of search terms in Google Scholar. Often you do not manage to use the best search terms at the first attempt.
When you seem to be getting promising results, look at the titles and the citation counts: they tell which papers are 1) best matches with your interests and 2) most valued by other researchers.
If all citation counts are low, look at the outlets: which conferences and journals have published these works? Prioritise ones whose JUFO ratings are 2 or 3. Consider also ones that have a JUFO 1 rating.
Download every promising paper on your computer.

4. How these texts can be downloaded?

The last step above involves a challenge that will be addressed in the blog post: a vast majority of academic papers is not freely available. Instead they are available from publishers who sell them to universities with a subscription fee. To see, download and read them, one needs to use a university’s authenticated Internet connection.

I have written about this separately too, but my quick advices are to 1) look for links that Google Scholar marks with [PDF] – those are freely accessible papers; 2) tunnel your internet traffic through a university’s VPN service. That makes the publishers open most of the doors for you. Then you can find links with “sfx” in them – they are contents that your university has subscribed; 3) use your university’s article search service: copy the paper title and paste it to the university’s search engine. If the paper can be accessed, you get a link where you can download the paper. Here is the link to Aalto university’s search interface.

See how the same search result page as above has changed when I have used VPN:

Screenshot of Google Scholar's results when using VPN connection

Read also my other blog post to find out how to access and download articles when they are not openly accessible.

Writing about Design

Principles and tips for design-oriented research

Monthly Archives: December 2020

How to get access to articles that are not Open Access