Programming Scala

  • Abbott M
  • Fisher M
  • Ahmed E
 et al. 
  • 11

    Readers

    Mendeley users who have this article in their library.
  • N/A

    Citations

    Citations of this article.

Abstract

Disambiguating named entities in natural- language text maps mentions of ambiguous names onto canonical entities like people or places, registered in a knowledge base such as DBpedia or YAGO. This paper presents a ro- bust method for collective disambiguation, by harnessing context from knowledge bases and using a new form of coherence graph. It unifies prior approaches into a comprehensive frame- work that combines three measures: the prior probability of an entity being mentioned, the similarity between the contexts of a mention and a candidate entity, as well as the coherence among candidate entities for all mentions to- gether. The method builds a weighted graph of mentions and candidate entities, and computes a dense subgraph that approximates the best joint mention-entity mapping. Experiments show that the new method significantly outper- forms prior methods in terms of accuracy, with robust behavior across a variety of inputs.

Author-supplied keywords

  • 1st
  • 2011
  • 2012
  • 2013
  • 2nd
  • 3 information storage and
  • Abstracting and Indexing as Topic
  • Abstracting and Indexing as Topic: methods
  • Answers
  • Automatic Data Processing
  • Automatic Data Processing: methods
  • Collaborative filtering
  • Common
  • Computer Algorithms
  • Content-based filtering
  • Cookbook
  • Data Structures
  • Evidence-Based Medicine
  • Folksonomies
  • Hyperlink
  • Information Storage and Retrieval
  • Information Storage and Retrieval: methods
  • Information scent
  • Knowledge Extraction
  • Language
  • Machine Learning Applications Track
  • NLTK
  • Natural
  • Natural Language Processing
  • Navigat
  • Network Security
  • Ontologies
  • Practical
  • Problems
  • Processing
  • Python
  • Quick
  • Recipes
  • Recommender systems
  • Semantics
  • Social bookmarking
  • Techniques
  • Text
  • Web Programming
  • Web-navigation
  • Wikipedia
  • WordNet
  • a given text will
  • a limited vocabulary problem
  • abstract types
  • accepted
  • agdistis gerber group_aksw hellmann n3 nlp2rdf_pub
  • analysis
  • automatic aid
  • bases
  • bayesian learning
  • be mapped onto their
  • believe is the topic
  • block chain definition
  • blockchain definition
  • blockchain explained
  • boilerplate removal
  • by optimizing the corresponding
  • category
  • classes
  • classification
  • cloud infrastructures
  • cluster labeling
  • co-
  • cognitive models
  • combining labeled and unlabeled
  • complex systems
  • components
  • concepts
  • concepts nodes which we
  • corresponding concepts in the
  • crawling
  • data
  • data mining
  • december 28
  • december 5
  • disributed ledgers
  • distributed
  • distributed ledger definition
  • distributed systems
  • document categorization
  • dynamic
  • dynamic analyses
  • e-mail classification into folders
  • e-mail spam
  • email mining
  • event counters
  • event tracing
  • expectation-maximization
  • externally visible
  • filtering
  • fingerprint
  • foursquare
  • full-text extrac-
  • geocoding
  • h
  • hamming distance
  • how does blockchain work
  • however
  • human mobility
  • hyperlink
  • icle
  • information extraction
  • information retrieval
  • information scent
  • information storage and retrieval
  • integrating supervised and unsupervised
  • irlbot
  • knowledge
  • knowledge bases
  • labeled
  • large-scale
  • learn about blockchain
  • learn to rank
  • learning
  • learning-based
  • location data
  • log
  • machine learning
  • march 15
  • march 21
  • methods
  • mixins
  • mobile devices
  • model
  • multiple classifiers
  • natural language processing
  • navigation support
  • near-duplicate
  • october 15
  • october 21
  • of the target text
  • ontology
  • performance analysis
  • performance monitoring
  • play
  • problem diagnosis
  • random forest
  • relational learning
  • retrieval
  • revised
  • sampling
  • scala
  • search
  • semi-supervised learning
  • september 23
  • similarity
  • single node among the
  • sketch
  • spam
  • spam e-mail filtering
  • spatial search
  • spatiotemporal models
  • static tracing
  • submitted
  • template detection
  • text classi cation
  • text classification
  • text cleaning
  • text mining
  • tion
  • topic identification
  • tracing
  • traditional methods
  • training
  • unsolicited bulk messages
  • we will pick a
  • web crawl
  • web document
  • web document modeling
  • web spider
  • web-navigation
  • what is blockchain
  • which are extracted from
  • workflow monitoring
  • world wide web
  • www.it-ebooks.info

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

  • ML Abbott

  • MT Fisher

  • Ejaz Ahmed

  • Abdullah Gani

  • Mehdi Sookhak

  • Siti Hafizah Ab Hamid

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free