Sign up & Download
Sign in


Papers in this group

1 - 20 of 869
  1. OBJECTIVES: To provide an overview and tutorial of natural language processing (NLP) and modern NLP-system design. TARGET AUDIENCE: This tutorial targets the medical informatics generalist who has limited acquaintance with the principles behind NLP…
  2. This paper proposes two new graph-based query strategies for active learning in a framework that is convenient to combine with semi-supervised learning based on label propagation. The first strategy selects instances independently to maximize the…
  3. Nowadays, the increasing popularity of smartphone devices equipped with multiple sensors (e.g. accelerometer, gy- roscope, etc) have opened several possibilities to the deployment of novel and exciting context-aware mobile applications. In this…
  4. We depend on talks to communicate our work, and we spend much of our time as audience members in talks. However, few scientists are taught the well-established principles of giving good talks. Here, I describe how to prepare, present, and answer…
  5. The process of training new PhDs is complex and has significant dropout rates associated with loss of financial and time investments by the student, mentor, and program. One approach to improve graduate education is to make explicit the skills…
  6. Because of the complexity of gene-phenotype relationships machine learning approaches have considerable appeal as a strategy for modelling interactions. A number of such methods have been developed and applied in recent years with some modest…
  7. In the Life Sciences 'omics' data is increasingly generated by different high-throughput technologies. Often only the integration of these data allows uncovering biological insights that can be experimentally validated or mechanistically modelled,…
  8. This paper provides an elementary introduction to the probabilistic automaton (PA) model, which has been developed by Segala. We describe how distributed systems with discrete probabilities can be modeled and analyzed by means of PAs. We explain how…
  9. We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the…