Using the DIFF Command for Natural Language Processing

  • Murata M
  • Isahara H
  • 2


    Mendeley users who have this article in their library.
  • N/A


    Citations of this article.


Diff is a software program that detects differences between two data sets and is useful in natural language processing. This paper shows several examples of the application of diff. They include the detection of differences between two different datasets, extraction of rewriting rules, merging of two different datasets, and the optimal matching of two different data sets. Since diff comes with any standard UNIX system, it is readily available and very easy to use. Our studies showed that diff is a practical tool for research into natural language processing.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


  • Masaki Murata

  • Hitoshi Isahara

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free