Skip Navigation Bar
National Library of Medicine Technical BulletinNational Library of Medicine Technical Bulletin

Table of Contents: 2015 NOVEMBER–DECEMBER No. 407

Previous Next


NIH Manuscript Collection Optimized for Text-Mining and More

NIH Manuscript Collection Optimized for Text-Mining and More. NLM Tech Bull. 2015 Nov-Dec;(407):b8.

2015 December 04 [posted]

[Editor's Note: This is a reprint of an announcement from the NIH Extramural Nexus. To automatically receive news, updates, and blog posts on extramural grant policies, processes, events, and resources please see the subscribe page.

NIH-supported scientists have made over 300,000 author manuscripts available on PubMed Central (PMC) since 2008. Now, NIH is making these papers accessible to the public in a format that will allow robust text analyses.

You can download the entire PMC collection of NIH-supported author manuscripts as a package in either XML or plain text formats. The collection will encompass all NIH manuscripts posted to PMC since July 2008. While the public can access the articles’ full text and accompanying figures, tables, and multimedia on the PMC Web site, the newly available article packages include full text only, in a form that facilitates text-mining.

We developed this resource to increase the impact of NIH funding. Through this collection, scientists will be able to analyze these manuscripts, further apply the findings of NIH research, and generate new discoveries.

For more information visit the PMC author manuscript collection Web site.

NLM Technical Bulletin National Library of Medicine National Institutes of Health