Sign up & Download
Sign in

Predicting biomedical document access as a function of past use

by J. C. Goodwin, T. R. Johnson, T. Cohen, J. R. Herskovic, E. V. Bernstam
Journal of the American Medical Informatics Association ()


ObjectiveTo determine whether past access to biomedical documents can predict future document access.Materials and methodsThe authors used 394 days of query log (August 1, 2009 to August 29, 2010) from PubMed users in the Texas Medical Center, which is the largest medical center in the world. The authors evaluated two document access models based on the work of Anderson and Schooler. The first is based on how frequently a document was accessed. The second is based on both frequency and recency.ResultsThe model based only on frequency of past access was highly correlated with the empirical data (R(2)=0.932), whereas the model based on frequency and recency had a much lower correlation (R(2)=0.668).DiscussionThe frequency-only model accurately predicted whether a document will be accessed based on past use. Modeling accesses as a function of frequency requires storing only the number of accesses and the creation date for the document. This model requires low storage overheads and is computationally efficient, making it scalable to large corpora such as MEDLINE.ConclusionIt is feasible to accurately model the probability of a document being accessed in the future based on past accesses.

Cite this document (BETA)

Readership Statistics

11 Readers on Mendeley
by Discipline
by Academic Status
27% Researcher (at an Academic Institution)
18% Ph.D. Student
9% Other Professional
by Country
45% United States
9% Spain
9% Mexico

Sign up today - FREE

Mendeley saves you time finding and organizing research. Learn more

  • All your research in one place
  • Add and import papers easily
  • Access it anywhere, anytime

Start using Mendeley in seconds!

Sign up & Download

Already have an account? Sign in