Over on this link, you’ll find a data-driven document (D3 FTW!) showing collaboration between the most authorship-intensive institutions in astronomy. The document is a chord diagram showing the strength of collaboration between research centres, based on co-authorship of papers. I’ve included some screenshots here to give you the idea – the one above is for worldwide institutions […]Read More Visualising Collaboration in Astronomy
A couple of weeks ago I began to geocode the database of astronomical research I scraped from NASA ADS during .Astronomy 4. This database consists of all the published astronomical research in five major journals (almost 250,000 papers going back decades, from MNRAS, ApJ, AJ, A&A and PASP) up to July 2012. You can read […]Read More Mapping Collaboration in Astronomy
I (or rather my computer) spent most of this morning geocoding the database of astronomical papers that I scraped from NASA ADS a while back. I’ve got about a quarter of a million papers, covering several of the major astronomical journals (MNRAS, ApJ, A&A, PASP and AJ) back to their first publications. There are 7 […]Read More Geocoding the Literature
If you’ve been following the recent series of posts about my data mining, then a) I apoligise and b) it just got better! The short story is that research in astrophysics is generally made available online and is entirely available, in digital form, all the way back to the begining of the refereed jounrals on […]Read More Authorship in Astronomy
Just for fun, I took a list of molecules you can find in space and made a word matrix from it. The result shows the relationship between molecular species and their occurrence in the astronomical literature. There is a nice cluster of Hydrogen-bearing molecules that seem to correlate well, same for Carbon. I don’t even know […]Read More Astrochemistry Word Matrix
I have been exploring the terms used in the astronomical literature (see previous post), and have turned my attention to terms that seem to correlate with each other in astronomy publications. I thought it would at least be interesting to see how often one word is mention alongside another. To do this I take terms […]Read More More Astronomy Data Mining: It’s Word-Matrix Time!
At .Astronomy 4 in Heidelberg, I began hacking on some natural language processing of the astronomical literature as part of my Hack Day project with Sarah Kendrew and Karen Masters. It began as a version of BrainScanr for astronomy – which it can still become – however it also provides an interesting database to explore, […]Read More Mining the Astronomical Literature