paper graph

Visualised network of sources

Inspiration

We are three PhD students. Thesis and paper writing is a part of academia, but it kinda sucks. Literature review especially. We built 'paper graph' to make it literature review easier and better.

What it does

paper graph takes in key papers from your literature review. It visually relates the forward and backward citations of these papers. A network is build, showing which papers you have are most important, as well as suggesting papers your colleagues think are important, but you haven't looked at yet.

Scholar search isn't very good. Jargon is common, and key words can mean very different things in different fields. We build paper graph to automate how people already use citation to explore the literature.

How we built it

paper graph is mainly built in python.

We use the Menderly API to get DOI identifiers from the pdf files of papers you have. We search PubMed with these ID numbers, which gives forward and backward citations. We store these citations in a postgresql database.

Beaker is used to visualise the network. Sources from within your existing collection are identified against papers you don't have yet. Clusters show more important works. Nodes of the network can be clicked, which links to the abstract of the article on PubMed.

Challenges we ran into

Data access was hard - Google Scholar locked us out for too many GET calls! Source identifiers are different on different sources - Google Scholar, Menderly, and PubMed. A fair bit of our work was translating between the various ID numbers.

We spent quite a bit of time trying to scrape pdf files themselves for metadata before finding a much easier solution through Menderly API!

Accomplishments that we're proud of

We got it done!

What we learned

Learned a lot about HTTP requests, using Beaker and various APIs (or not APIs).

Spend more time looking for a good API!

What's next for paper graph

We'll use it in our theses, and hopefully develop it further!

Built With

beaker
d3.js
javascript
menderly
pubmed
python

Submitted to

Hack Cambridge

Created by

I worked on the back end code that allows paper graph to identify a target paper in a chosen database (Scopus, PubMed, Elsevier) using metadata as input. This was my first time time using python (love it!) and seeing Flask in action. I had previously worked on building webapps but only now do I realise how little I knew about the HTTP request cycle. This was a great and fun learning experience.

Kyata Chibalabala
Liam Atkinson
Graeham Douglas

Updates

Graeham Douglas started this project — Jan 31, 2016 06:58 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.