Last week, the arXiv received a three-year, $883,000 grant from the National Science Foundation, thanks to federal stimulus money from the American Recovery and Reinvestment Act (ARRA).
According to the grant description, the project “proposes to investigate and implement a variety of tools for enhancing the very widely used and popular Arxiv.org infrastructure, based on information filters for assisted service discovery and selection, text-mining, information genealogy, automated classification and identification of composite resources, data-mining, usage analyses, matching and ranking heuristics, support for next-generation document formats, and semantic markup.”
In 2001, Paul Ginsparg, the creator of the e-print repository and principal investigator of the grant, brought the arXiv with him to Cornell University. Since then, the arXiv has been managed and maintained by the Cornell University Library.
The grant will generate jobs for two graduate students and one half-time programmer. Interviewed for the Cornell Chronicle, Ginsparg outlined why such improvements for the arXiv were necessary:
Academic publishing has lagged behind the commercial Internet in providing interactive enhancements that today’s students take for granted. Configuring research communications infrastructure for the next generation of researchers requires getting into the heads of near-term future researchers — undergrads and grad students — coming of age in the Google/Facebook/Twitter era.