Distribution of “characteristic” terms in medline literatures

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Given the occurrence frequency of any term within any set of articles within MEDLINE, we define “characteristic” terms as words and phrases that occur in that literature more frequently than expected by chance (at p < 0.001 or better). In this report, we studied how the cut-off criterion varied as a function of literature size and term frequency in MEDLINE as a whole, and have compared the distribution of characteristic terms within a number of journal-defined, affiliation-defined and random literatures. We also investigated how the characteristic terms were distributed among MEDLINE titles, abstracts, and last sentence of abstracts, including “regularized” terms that appear both in the title and abstract of the same paper for at least one paper in the literature. For a set of 10 disciplinary journals, the characteristic terms comprised 18% of the total terms on average. Characteristic terms are utilized in several of our web-based services (Anne O’Tate and Arrowsmith), and should be useful for a variety of other information-processing tasks designed to improve text mining in MEDLINE.

References Powered by Scopus

LexRank: Graph-based lexical centrality as salience in text summarization

2445Citations
N/AReaders
Get full text

Author name disambiguation in MEDLINE

218Citations
N/AReaders
Get full text

Automatic extraction of keywords from scientific text: Application to the knowledge domain of protein families

169Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Anne O’Tate: Value-added PubMed search engine for analysis and text mining

4Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Smalheiser, N. R., Zhou, W., & Torvik, V. I. (2011). Distribution of “characteristic” terms in medline literatures. Information (Switzerland), 2(2), 266–276. https://doi.org/10.3390/info2020266

Readers over time

‘13‘14‘15‘16‘2000.751.52.253

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

50%

Researcher 2

33%

Professor / Associate Prof. 1

17%

Readers' Discipline

Tooltip

Computer Science 4

67%

Medicine and Dentistry 1

17%

Social Sciences 1

17%

Article Metrics

Tooltip
Mentions
References: 1

Save time finding and organizing research with Mendeley

Sign up for free
0