Comparable English-Russian book review corpora for sentiment analysis
roceedings of the 1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (2010)
Available from
Taras Zagibalov's profile on Mendeley.
or
Abstract
We present newly-produced comparable corpora of book reviews in English and Russian. The corpora are comparable in terms of domain, style and size. We are using them for cross-lingual experiments in document-level sentiment classication. Quantitative analyses of the corpora and the language dierences they exhibit highlight a number of issues that must be considered when developing systems for automatic sentiment classication. We also present experiments with a sentiment classication system applied to the corpora. The results suggest that dierences in the way that sentiment is expressed in the two languages lead to large dierence in sentiment classication accuracy.
Author-supplied keywords
Available from
Taras Zagibalov's profile on Mendeley.
Page 1
Comparable English-Russian book review corpora for sentiment analysis
Comparable English-Russian
Book Review Corpora for Sentiment Analysis
Taras Zagibalov 1 and Katerina Belyatskaya2 and John Carroll 3
Abstract. We present newly-produced comparable corpora
of book reviews in English and Russian. The corpora are
comparable in terms of domain, style and size. We are using
them for cross-lingual experiments in document-level senti-
ment classication. Quantitative analyses of the corpora and
the language dierences they exhibit highlight a number of is-
sues that must be considered when developing systems for au-
tomatic sentiment classication. We also present experiments
with a sentiment classication system applied to the corpora.
The results suggest that dierences in the way that sentiment
is expressed in the two languages lead to large dierence in
sentiment classication accuracy.
1 INTRODUCTION
Automatic classication of document sentiment (and more
generally extraction of opinion from text) has recently at-
tracted a lot of interest. One of the main reasons for this
is the importance of such information to companies, other
organizations, and individuals. Applications include market-
ing research tools that help a company analyse market or
media reaction towards their brands, products or services,
or search engines that help potential purchasers make an in-
formed choice about a product they are considering buying.
Most extant sentiment classication systems use ap-
proaches based on supervised machine learning, which require
substantial manually-produced or -curated resources includ-
ing texts annotated at the document level and below, senti-
ment term dictionaries and thesauri, and some level of lan-
guage analysis.
There are a number of publicly available sentiment-
annotated corpora, such as MPQA [15], and Pang and Lee's
Movie Review corpus [8]. However, most of these corpora con-
sist just of English text. As for other languages, we are aware
of only one publicly available corpus, of Chinese product re-
views [20]. There are other corpora designed for cross-lingual
evaluations, but these seem not to be publicly available (for
example the NTCIR MOAT corpora of English, Japanese and
Chinese [12]).
As part of an on-going research eort in sentiment analy-
sis, we have designed and built comparable corpora of book
reviews in English and Russian, which we are making pub-
licly available, in the expectation that they will contribute to
1 University of Sussex, email: T.Zagibalovl@sussex.ac.uk
2 Siberian Federal University, email: e.o.belyatskaya@gmail.com
3 University of Sussex, email: J.A.Carroll@sussex.ac.uk
research in cross-lingual sentiment processing4. The Russian
corpus is probably the rst sentiment-annotated resource in
that language.
In this paper, as well as describing the corpora and quan-
tifying various relevant aspects of them, we analyse some
important language-specic and domain-specic issues that
would be likely to impact on automatic sentiment processing.
We also describe experiments with standard machine learning
sentiment classication technique applied to the corpora.
The paper is structured as follows. Section 2 surveys re-
lated work in sentiment classication. Section 3 describes the
corpora. Section 4 presents experiments with the corpora and
Section 5 concludes.
2 RELATED WORK
Most work on sentiment classication has used approaches
based on supervised machine learning. For example, Pang et
al. [9] collected movie reviews that had been annotated with
respect to sentiment by their authors, and used this data to
train supervised classiers. A number of studies have investi-
gated the impact on classication accuracy of dierent factors,
including choice of feature set, machine learning algorithm,
and pre-selection of the segments of text to be classied. For
example, Dave et al. [2] experiment with the use of linguistic,
statistical and n-gram features and measures for feature selec-
tion and weighting. Pang and Lee [8] use a graph-based tech-
nique to identify and analyse only subjective parts of texts. Yu
and Hatzivassiloglou [18] use semantically oriented words for
identication of polarity at the sentence level. Most of this
work assumes binary classication (positive and negative),
sometimes with the addition of a neutral class (in terms of
polarity, representing lack of sentiment).
Turney [13] carried out an early and in
uential study into
unsupervised sentiment classication. The approach starts
from two `seed' words and builds lists of positive and sen-
timent vocabulary from large amounts of text using a tech-
nique based on pointwise mutual information. For sentiment
classication of movie reviews the approach achieves a rel-
atively modest 65% accuracy (although reviews of automo-
biles are classied with 84% accuracy). Turney attributes this
discrepancy in accuracy between domains to the much more
complex structure of movie reviews. Popescu and Etzioni [10]
extend the approach, applying hand-made rules, linguistic in-
formation and WordNet resources. Kobayashi et al. [4] employ
4 The corpora are available for download from
http://www.informatics.sussex.ac.uk/users/tz21/.
Book Review Corpora for Sentiment Analysis
Taras Zagibalov 1 and Katerina Belyatskaya2 and John Carroll 3
Abstract. We present newly-produced comparable corpora
of book reviews in English and Russian. The corpora are
comparable in terms of domain, style and size. We are using
them for cross-lingual experiments in document-level senti-
ment classication. Quantitative analyses of the corpora and
the language dierences they exhibit highlight a number of is-
sues that must be considered when developing systems for au-
tomatic sentiment classication. We also present experiments
with a sentiment classication system applied to the corpora.
The results suggest that dierences in the way that sentiment
is expressed in the two languages lead to large dierence in
sentiment classication accuracy.
1 INTRODUCTION
Automatic classication of document sentiment (and more
generally extraction of opinion from text) has recently at-
tracted a lot of interest. One of the main reasons for this
is the importance of such information to companies, other
organizations, and individuals. Applications include market-
ing research tools that help a company analyse market or
media reaction towards their brands, products or services,
or search engines that help potential purchasers make an in-
formed choice about a product they are considering buying.
Most extant sentiment classication systems use ap-
proaches based on supervised machine learning, which require
substantial manually-produced or -curated resources includ-
ing texts annotated at the document level and below, senti-
ment term dictionaries and thesauri, and some level of lan-
guage analysis.
There are a number of publicly available sentiment-
annotated corpora, such as MPQA [15], and Pang and Lee's
Movie Review corpus [8]. However, most of these corpora con-
sist just of English text. As for other languages, we are aware
of only one publicly available corpus, of Chinese product re-
views [20]. There are other corpora designed for cross-lingual
evaluations, but these seem not to be publicly available (for
example the NTCIR MOAT corpora of English, Japanese and
Chinese [12]).
As part of an on-going research eort in sentiment analy-
sis, we have designed and built comparable corpora of book
reviews in English and Russian, which we are making pub-
licly available, in the expectation that they will contribute to
1 University of Sussex, email: T.Zagibalovl@sussex.ac.uk
2 Siberian Federal University, email: e.o.belyatskaya@gmail.com
3 University of Sussex, email: J.A.Carroll@sussex.ac.uk
research in cross-lingual sentiment processing4. The Russian
corpus is probably the rst sentiment-annotated resource in
that language.
In this paper, as well as describing the corpora and quan-
tifying various relevant aspects of them, we analyse some
important language-specic and domain-specic issues that
would be likely to impact on automatic sentiment processing.
We also describe experiments with standard machine learning
sentiment classication technique applied to the corpora.
The paper is structured as follows. Section 2 surveys re-
lated work in sentiment classication. Section 3 describes the
corpora. Section 4 presents experiments with the corpora and
Section 5 concludes.
2 RELATED WORK
Most work on sentiment classication has used approaches
based on supervised machine learning. For example, Pang et
al. [9] collected movie reviews that had been annotated with
respect to sentiment by their authors, and used this data to
train supervised classiers. A number of studies have investi-
gated the impact on classication accuracy of dierent factors,
including choice of feature set, machine learning algorithm,
and pre-selection of the segments of text to be classied. For
example, Dave et al. [2] experiment with the use of linguistic,
statistical and n-gram features and measures for feature selec-
tion and weighting. Pang and Lee [8] use a graph-based tech-
nique to identify and analyse only subjective parts of texts. Yu
and Hatzivassiloglou [18] use semantically oriented words for
identication of polarity at the sentence level. Most of this
work assumes binary classication (positive and negative),
sometimes with the addition of a neutral class (in terms of
polarity, representing lack of sentiment).
Turney [13] carried out an early and in
uential study into
unsupervised sentiment classication. The approach starts
from two `seed' words and builds lists of positive and sen-
timent vocabulary from large amounts of text using a tech-
nique based on pointwise mutual information. For sentiment
classication of movie reviews the approach achieves a rel-
atively modest 65% accuracy (although reviews of automo-
biles are classied with 84% accuracy). Turney attributes this
discrepancy in accuracy between domains to the much more
complex structure of movie reviews. Popescu and Etzioni [10]
extend the approach, applying hand-made rules, linguistic in-
formation and WordNet resources. Kobayashi et al. [4] employ
4 The corpora are available for download from
http://www.informatics.sussex.ac.uk/users/tz21/.
Page 2
an iterative semi-automatic approach to extracting opinion-
bearing expressions, although this requires human input at
each iteration. Unsupervised and semi-supervised techniques
may oer the promise of overcoming domain dependence since
they do not require training data in order to be applied to a
new domain. Wiebe and Rilo [14] present an unsupervised
sentence-level subjectivity classier that uses an extensive set
(about 8000) of rules (subjectivity clues). Li, Zhang and Sind-
hwani [5] used labelled documents to adjust a hand-built sen-
timent lexicon to a domain. The extensive use of knowledge
(rule or lexicons) make these approaches language-dependent.
An alternative approach to overcoming domain dependence
is presented by Aue and Gamon [3], who attempt to solve the
problem of the absence of large amounts of labelled data by
customizing sentiment classiers to new domains using train-
ing data from other domains. Blitzer et al. [1] investigate
domain adaptation for sentiment classiers using structural
correspondence learning.
There has been little previous work on applying sentiment
analysis to languages with scarce relevant language resources.
A notable exception is the work towards producing cross-
lingual subjectivity analysis resources from English data by
Mihalcea et al. [7]. They use a parallel corpus to adjust a sub-
jectivity lexicon translated from English to Romanian. Other
multilingual opinion mining work (in English, Japanese and
Chinese) was carried out by Zagibalov and Carroll ([19] and
[21]), using techniques requiring limited manual input to clas-
sify newswire documents with respect to subjectivity and to
extract opinion holders and targets.
A number of studies include development of linguistic re-
sources for sentiment analysis. The text corpora are quite
often annotated by a several annotators to produce dier-
ent kinds of annotation. For example, Read [11] developed an
annotation scheme with about 30 dierent tags that closely
follows the Appraisal Theory [6]. Wilson and Wiebe [16] de-
veloped a detailed annotating scheme for expressions of opin-
ions, beliefs, emotions, sentiment and speculation. To ensure
annotation robustness, the authors calculate inter-annotator
agreement. Another approach uses tags produced by authors
(`self tagged') of the documents included to the corpus [2].
3 THE CORPORA
The English and Russian book review corpora consist of
reader reviews of science ction and fantasy books by pop-
ular authors. The reviews were written in 2007, ensuring that
the language used is current.
The Russian corpus consists of reviews of Russian transla-
tions of books by popular science-ction and fantasy authors,
such as S. King, S. Lem, J.K. Rowling, T. Pratchett, R. Sal-
vatore, J.R.R. Tolkien as well as by Russian authors of the
genre such as S. Lukianenko, M. Semenova and others. The
reviews were published on the website www.fenzin.org.
The English corpus comprises reviews of books by the same
authors if available. If some of the authors were not reviewed
on the site or did not have enough reviews, they were sub-
stituted with other writers of the same genre. As a result
the English corpus contains reviews of books such as: S. Er-
ickson (Guardians of the Moon, Memories of Ice), S. King
(Christine, Duma Key, Gerald's Game, Dierent Season and
others), S. Lem (Solaris, Star Diaris of Iyon Tichy, The Cy-
briad), A. Rise (Interview with the Vampire, The Tale of the
Body Thief and others), J.K. Rowling (Harry Potter), J.R.R.
Tolkien (The Hobbit, The Lord of the Rings, The Silmaril-
lion), S. Lukyanenko (The Night Watch, The Day Watch, The
Twilight Watch, The Last Watch), and some others. The re-
views were published on the website www.amazon.co.uk.
We annotated each review as `POS' if positive sentiment
prevails or `NEG' if the review is mostly negative based on
the tags assigned by reviewers, but moderated where the tag
was obviously incorrect. Each corpus consists of 1500 reviews,
half of which are positive and half negative. The annotation
is simple and encodes only the overall sentiment of a review,
for example:
[TEXT = POS]
Hope you love this book as much as I did. I thought
it was wonderful!
[/TEXT]
English reviews contain a mean of 58 words (the mean
length for positive and negative reviews being almost the
same). Positive Russian reviews have a mean length of only 30
words; negative reviews are slightly longer, at 38 words (see
Table 1). It is not possible to compare these gures directly
across the languages as they have dierent grammar struc-
tures which makes English more `wordy' as it has function
words (articles, auxiliary verbs) which are almost completely
absent in Russian.
Russian, being a synthetic language, has a lot of forms of
the same lemma. This results in a large number of distinct
word forms: the corpus contains a total of 13472 word forms,
with 6589 (42%) in positive reviews and 8993 (58%) in nega-
tive. The total number of words in the corpus is 50745, which
means that every word form was used a little more than 3
times on average. The English corpus has only 7489 word
forms in the whole corpus, 4561 (47%) in positive reviews,
and 5098 (53%) in negative. The re-use of word forms in En-
glish is much higher: every word form was used 9 times on
average (the total number of word in the corpus is 87539).
These gures suggest that the Russian reviewers used a richer
vocabulary for expressing negative opinion (compared to the
number of unique words used in Russian positive reviews)
than English reviewers.
Further evidence of the dierent ways in which people dis-
tinguish sentiment polarity in Russian compared with English
is the distribution of lengths of positive and negative reviews.
The Russian corpus has a large number of short reviews (less
than 50 words) with a median of 15 words for positive reviews
and 10 words for negative reviews. Apart from the language-
specic dierences mentioned above that partly account for
the smaller number of words in Russian documents, there is a
clear dierence from English reviews in terms of length. The
English reviews feature a more or less equal number of doc-
uments of dierent lengths (mostly in the range 15 to 75).
The prevalence of short reviews in the Russian corpus, to-
gether with the rich morphological variation, may lead to data
sparseness which would be a problem for many current senti-
ment classication techniques.
Although both of the sites from which the reviews were col-
lected feature review-ranking systems (e.g. one to ten stars),
many reviewers did not use the system or did not use it prop-
erly. For this reason all of the reviews were read through
bearing expressions, although this requires human input at
each iteration. Unsupervised and semi-supervised techniques
may oer the promise of overcoming domain dependence since
they do not require training data in order to be applied to a
new domain. Wiebe and Rilo [14] present an unsupervised
sentence-level subjectivity classier that uses an extensive set
(about 8000) of rules (subjectivity clues). Li, Zhang and Sind-
hwani [5] used labelled documents to adjust a hand-built sen-
timent lexicon to a domain. The extensive use of knowledge
(rule or lexicons) make these approaches language-dependent.
An alternative approach to overcoming domain dependence
is presented by Aue and Gamon [3], who attempt to solve the
problem of the absence of large amounts of labelled data by
customizing sentiment classiers to new domains using train-
ing data from other domains. Blitzer et al. [1] investigate
domain adaptation for sentiment classiers using structural
correspondence learning.
There has been little previous work on applying sentiment
analysis to languages with scarce relevant language resources.
A notable exception is the work towards producing cross-
lingual subjectivity analysis resources from English data by
Mihalcea et al. [7]. They use a parallel corpus to adjust a sub-
jectivity lexicon translated from English to Romanian. Other
multilingual opinion mining work (in English, Japanese and
Chinese) was carried out by Zagibalov and Carroll ([19] and
[21]), using techniques requiring limited manual input to clas-
sify newswire documents with respect to subjectivity and to
extract opinion holders and targets.
A number of studies include development of linguistic re-
sources for sentiment analysis. The text corpora are quite
often annotated by a several annotators to produce dier-
ent kinds of annotation. For example, Read [11] developed an
annotation scheme with about 30 dierent tags that closely
follows the Appraisal Theory [6]. Wilson and Wiebe [16] de-
veloped a detailed annotating scheme for expressions of opin-
ions, beliefs, emotions, sentiment and speculation. To ensure
annotation robustness, the authors calculate inter-annotator
agreement. Another approach uses tags produced by authors
(`self tagged') of the documents included to the corpus [2].
3 THE CORPORA
The English and Russian book review corpora consist of
reader reviews of science ction and fantasy books by pop-
ular authors. The reviews were written in 2007, ensuring that
the language used is current.
The Russian corpus consists of reviews of Russian transla-
tions of books by popular science-ction and fantasy authors,
such as S. King, S. Lem, J.K. Rowling, T. Pratchett, R. Sal-
vatore, J.R.R. Tolkien as well as by Russian authors of the
genre such as S. Lukianenko, M. Semenova and others. The
reviews were published on the website www.fenzin.org.
The English corpus comprises reviews of books by the same
authors if available. If some of the authors were not reviewed
on the site or did not have enough reviews, they were sub-
stituted with other writers of the same genre. As a result
the English corpus contains reviews of books such as: S. Er-
ickson (Guardians of the Moon, Memories of Ice), S. King
(Christine, Duma Key, Gerald's Game, Dierent Season and
others), S. Lem (Solaris, Star Diaris of Iyon Tichy, The Cy-
briad), A. Rise (Interview with the Vampire, The Tale of the
Body Thief and others), J.K. Rowling (Harry Potter), J.R.R.
Tolkien (The Hobbit, The Lord of the Rings, The Silmaril-
lion), S. Lukyanenko (The Night Watch, The Day Watch, The
Twilight Watch, The Last Watch), and some others. The re-
views were published on the website www.amazon.co.uk.
We annotated each review as `POS' if positive sentiment
prevails or `NEG' if the review is mostly negative based on
the tags assigned by reviewers, but moderated where the tag
was obviously incorrect. Each corpus consists of 1500 reviews,
half of which are positive and half negative. The annotation
is simple and encodes only the overall sentiment of a review,
for example:
[TEXT = POS]
Hope you love this book as much as I did. I thought
it was wonderful!
[/TEXT]
English reviews contain a mean of 58 words (the mean
length for positive and negative reviews being almost the
same). Positive Russian reviews have a mean length of only 30
words; negative reviews are slightly longer, at 38 words (see
Table 1). It is not possible to compare these gures directly
across the languages as they have dierent grammar struc-
tures which makes English more `wordy' as it has function
words (articles, auxiliary verbs) which are almost completely
absent in Russian.
Russian, being a synthetic language, has a lot of forms of
the same lemma. This results in a large number of distinct
word forms: the corpus contains a total of 13472 word forms,
with 6589 (42%) in positive reviews and 8993 (58%) in nega-
tive. The total number of words in the corpus is 50745, which
means that every word form was used a little more than 3
times on average. The English corpus has only 7489 word
forms in the whole corpus, 4561 (47%) in positive reviews,
and 5098 (53%) in negative. The re-use of word forms in En-
glish is much higher: every word form was used 9 times on
average (the total number of word in the corpus is 87539).
These gures suggest that the Russian reviewers used a richer
vocabulary for expressing negative opinion (compared to the
number of unique words used in Russian positive reviews)
than English reviewers.
Further evidence of the dierent ways in which people dis-
tinguish sentiment polarity in Russian compared with English
is the distribution of lengths of positive and negative reviews.
The Russian corpus has a large number of short reviews (less
than 50 words) with a median of 15 words for positive reviews
and 10 words for negative reviews. Apart from the language-
specic dierences mentioned above that partly account for
the smaller number of words in Russian documents, there is a
clear dierence from English reviews in terms of length. The
English reviews feature a more or less equal number of doc-
uments of dierent lengths (mostly in the range 15 to 75).
The prevalence of short reviews in the Russian corpus, to-
gether with the rich morphological variation, may lead to data
sparseness which would be a problem for many current senti-
ment classication techniques.
Although both of the sites from which the reviews were col-
lected feature review-ranking systems (e.g. one to ten stars),
many reviewers did not use the system or did not use it prop-
erly. For this reason all of the reviews were read through
Page 3
Mean Mean Total Total
tokens tokens types types
POS NEG POS NEG
English 58 58 7349 8014
Russian 30 38 9290 12309
Table 1. Overall quantitative measures of the English and
Russian corpora.
and hand-annotated. There were a lot of re-occurring short
reviews like: Õîðîøî (Good); Èíòåðåñíàß êíèãà (Interesting
book); Ñóïåð! (Superb! ); Íóäßòèíà!! (Boring!! ); Íèæå ñðåä-
íåãî (Below average); Awesome!; Amazing!; The best book
I've ever read!; Boring, and so on. These reviews were added
to the corpus only once. Also both sites had a number of
documents which did not have any direct relation to book re-
viewing, such as advertisements, announcements and o-topic
postings. Such texts were excluded as irrelevant.
The documents that were included in the corpora were not
edited or altered in any other way.
3.1 Ways of Expressing Sentiments
To better understand the dierence between the English and
the Russian corpora, we have investigated the means used
to express opinion and how this may impact on automatic
sentiment classication5.
Sentiment can be expressed at dierent levels in a language,
from lexical and phonetic levels up to the discourse level.
This range is re
ected in the corpora (see Tables 2 and 3).
As the Tables show, the two languages express sentiment in
slightly dierent ways. English makes heavy use of adjectives
to express sentiment (this class of words is used to express
sentiment in a third of all documents). In contrast, Russian
uses verbs as often as adjectives to express sentiment (both
of these classes are used in about quarter of all reviews) and
makes more use of nouns (expressing sentiment in 15% of all
documents compared to 11% in English). The Russian cor-
pus also demonstrates a tendency to combine dierent ways
of expressing sentiments in a document: the total number of
uses of dierent ways in the English corpus is 4083 compared
to 4716 in Russian, which means that given equal number of
reviews for each language, Russian reviews tend to have more
dierent ways of expressing sentiment per document.
Syntactic
Lexical
Phonetic
Verb Adj Noun Other
Positive 432 312 708 225 325 12
Negative 367 389 652 238 407 16
Total 799 701 1360 463 732 28
Table 2. Ways of expressing sentiment in the English Book
Review Corpus (numbers of documents).
Lexical Level
5 All the numerical data presented below comes from manual count-
ing and is not represented in the corpus annotation.
Syntactic
Lexical
Phonetic
Verb Adj Noun Other
Positive 417 492 648 374 367 27
Negative 475 578 567 334 394 43
Total 892 1070 1215 708 761 70
Table 3. Ways of expressing sentiment in the Russian Book
Review Corpus (numbers of documents).
Adjectives Adjectives are the most frequent means of
expressing opinions in both languages, closely followed by
verbs in the Russian corpus. 1215 Russian reviews use ad-
jectives to express sentiment and 1070 reviews use verbs. In
the English corpus there are 1360 reviews that use adjectives,
but only 701 use verbs to express opinion.
Apart from adjectives, which are recognised as the main
tool for expressing evaluation, other parts of speech are also
often used in this function, most notably verbs and nouns. The
English reviews also feature adverbials and both languages
also use interjections.
Verbs As observed by some researchers, opinions deliv-
ered by verbs are more expressive compared to opinions ex-
pressed in other ways. This is explained by the fact that a
verb's denotation is a situation and the semantic structure of
the verb re
ects linguistically relevant elements of the situ-
ation described by the verb. Appraisal verbs not only name
an action, but also express a subject's attitude to an event or
fact. Consider the following examples:
(1) I truly loved this book, and I KNOW you will, too!
(2) ïîíðàâèëîñü, íàó÷íàß ôàíòàñòèêà â õîðîøåì
èñïîëíåíèè
I liked it, it's science fiction in a very good
implementation
The English verbs loved and liked describe a whole situ-
ation which is completed by the time of reporting it. This
means that a subsequent shift in sentiment polarity is all but
impossible:
(3) *I truly loved this book, but it turned out to be boring.
Nouns Nouns can both identify an object and provide
some evaluation of it. But nouns are less frequently used for
expressing opinion compared to verbs. Nonetheless in the Rus-
sian corpus, nouns were used more than in the English cor-
pus. There are 708 Russian reviews that have opinions ex-
pressed by nouns, however, only 463 English reviews made
use of a noun to describe opinion. The most frequent such
nouns used in Russian reviews are ÷óäî (miracle), êëàññèêà
(classics), øåäåâð (masterpiece), ãåíèé (genius), ïðåëåñòü
(delight), áðåä (nonsense), ìóðà (raspberry), æâà÷êà (mind-
numbing stuff ), åðóíäà (bugger).
tokens tokens types types
POS NEG POS NEG
English 58 58 7349 8014
Russian 30 38 9290 12309
Table 1. Overall quantitative measures of the English and
Russian corpora.
and hand-annotated. There were a lot of re-occurring short
reviews like: Õîðîøî (Good); Èíòåðåñíàß êíèãà (Interesting
book); Ñóïåð! (Superb! ); Íóäßòèíà!! (Boring!! ); Íèæå ñðåä-
íåãî (Below average); Awesome!; Amazing!; The best book
I've ever read!; Boring, and so on. These reviews were added
to the corpus only once. Also both sites had a number of
documents which did not have any direct relation to book re-
viewing, such as advertisements, announcements and o-topic
postings. Such texts were excluded as irrelevant.
The documents that were included in the corpora were not
edited or altered in any other way.
3.1 Ways of Expressing Sentiments
To better understand the dierence between the English and
the Russian corpora, we have investigated the means used
to express opinion and how this may impact on automatic
sentiment classication5.
Sentiment can be expressed at dierent levels in a language,
from lexical and phonetic levels up to the discourse level.
This range is re
ected in the corpora (see Tables 2 and 3).
As the Tables show, the two languages express sentiment in
slightly dierent ways. English makes heavy use of adjectives
to express sentiment (this class of words is used to express
sentiment in a third of all documents). In contrast, Russian
uses verbs as often as adjectives to express sentiment (both
of these classes are used in about quarter of all reviews) and
makes more use of nouns (expressing sentiment in 15% of all
documents compared to 11% in English). The Russian cor-
pus also demonstrates a tendency to combine dierent ways
of expressing sentiments in a document: the total number of
uses of dierent ways in the English corpus is 4083 compared
to 4716 in Russian, which means that given equal number of
reviews for each language, Russian reviews tend to have more
dierent ways of expressing sentiment per document.
Syntactic
Lexical
Phonetic
Verb Adj Noun Other
Positive 432 312 708 225 325 12
Negative 367 389 652 238 407 16
Total 799 701 1360 463 732 28
Table 2. Ways of expressing sentiment in the English Book
Review Corpus (numbers of documents).
Lexical Level
5 All the numerical data presented below comes from manual count-
ing and is not represented in the corpus annotation.
Syntactic
Lexical
Phonetic
Verb Adj Noun Other
Positive 417 492 648 374 367 27
Negative 475 578 567 334 394 43
Total 892 1070 1215 708 761 70
Table 3. Ways of expressing sentiment in the Russian Book
Review Corpus (numbers of documents).
Adjectives Adjectives are the most frequent means of
expressing opinions in both languages, closely followed by
verbs in the Russian corpus. 1215 Russian reviews use ad-
jectives to express sentiment and 1070 reviews use verbs. In
the English corpus there are 1360 reviews that use adjectives,
but only 701 use verbs to express opinion.
Apart from adjectives, which are recognised as the main
tool for expressing evaluation, other parts of speech are also
often used in this function, most notably verbs and nouns. The
English reviews also feature adverbials and both languages
also use interjections.
Verbs As observed by some researchers, opinions deliv-
ered by verbs are more expressive compared to opinions ex-
pressed in other ways. This is explained by the fact that a
verb's denotation is a situation and the semantic structure of
the verb re
ects linguistically relevant elements of the situ-
ation described by the verb. Appraisal verbs not only name
an action, but also express a subject's attitude to an event or
fact. Consider the following examples:
(1) I truly loved this book, and I KNOW you will, too!
(2) ïîíðàâèëîñü, íàó÷íàß ôàíòàñòèêà â õîðîøåì
èñïîëíåíèè
I liked it, it's science fiction in a very good
implementation
The English verbs loved and liked describe a whole situ-
ation which is completed by the time of reporting it. This
means that a subsequent shift in sentiment polarity is all but
impossible:
(3) *I truly loved this book, but it turned out to be boring.
Nouns Nouns can both identify an object and provide
some evaluation of it. But nouns are less frequently used for
expressing opinion compared to verbs. Nonetheless in the Rus-
sian corpus, nouns were used more than in the English cor-
pus. There are 708 Russian reviews that have opinions ex-
pressed by nouns, however, only 463 English reviews made
use of a noun to describe opinion. The most frequent such
nouns used in Russian reviews are ÷óäî (miracle), êëàññèêà
(classics), øåäåâð (masterpiece), ãåíèé (genius), ïðåëåñòü
(delight), áðåä (nonsense), ìóðà (raspberry), æâà÷êà (mind-
numbing stuff ), åðóíäà (bugger).
Page 4
Phonetic Level Although the corpora consist of written
text and do not have any speech-related mark-up, some of
the review authors used speech-related methods to express
sentiment, for example:
(4) A BIG FAT ZEEROOOOOOOOOOOOO for M.A
(5) Íó ÷òî ñêàçàòü. . . ÷åïóõà. . . ×Å-ÏÓ-ÕÀ.
What shoud I say... boloney... BO-LO-NEY
Another way to express opinion in Russian is based on the
use of a sub-culture language, Padonky. This sociolect has
distinctive phonetic and lexical features that are distant from
`standard' Russian (both ocial and colloquial). For example,
a phrase usually used to express negative attitude to an author
about his book:
(6) Àôôòîð, âûïåé ÉÀÄÓ
(lit) Autor, drink some POIZON
Padonky is close to some variants of slang (corresponding in
English to expressions such as u woz, c u soon etc.), however
it is more consistent and is used quite often on the Web.
Sentence Level Sentence-level means of expressing senti-
ment (mostly exclamatory clauses, imperatives or rhetorical
questions) is slightly more frequent in the Russian corpus than
in the English: 892 and 799 respectively. The distribution of
positive and negative sentiments realised at the sentence level
is opposite in the two corpora: syntactic means are used more
frequently in negative reviews in Russian but they are more
frequent in positive reviews in English.
One particularly common sentiment-relevant sentence-level
phenomenon is the rhetorical question. This is a question only
in form, since it usually expresses a statement. For example:
(7) È îòêóäà ñòîëüêî âîñòîðæåííûõ îòçûâîâ? Êîðîáèò
îò êðóòîñòè ãëàâíûõ ãåðîåâ
Why are there so many appreciative reviews? The
`coolness' of the main characters makes me sick
(8) ×òî æå òàêîãî ïèë/ïðèíèìàë/íþõàë àâòîð, ÷òîáû
íàïèñàòü òàêîå?
What did the author drink / eat / sniff to write stuff
like that?
Some `borderline' cases like the following are also used to
express sentiment:
(9) Èíòåðåñíî, êòî-íèáóäü äîòßíóë õîòß áû äî ñåðåäè-
íû? Ëè÷íî ß - íåò.
I wonder if anyone managed to get to the middle? I
failed.
Considering imperatives, the review author is telling their
audience `what to do', which is often to read a book or to
avoid doing so.
(10) Run away! Run away!
(11) Pick up any Pratchett novel with Rincewind and
re-read it rather than buying this one
(12) ×èòàòü îäíîçíà÷íî.
Definitely should read.
(13) ×èòàòü !!!!!!!!!!! ÂÑÅÌ
Read!!!!!!!! EVERYONE
Another way of expressing sentiment by means of syntactic
structure is exclamatory clauses, which are by their very na-
ture aective. This type of sentence is widely represented in
both corpora.
(14) It certainly leaves you hungering for more!
(15) Buy at your peril. Mines in the bin!
Discourse Level Some of the means of sentiment expres-
sion are quite complex and dicult to analyse automatically:
(16) È
so
ýòî
this
àâòîð
author
âû÷èñëèòåëß
calculator
è
and
ëåììèíãîâ?
lemmings?
...
...
ÍÅ
(DO)NOT
ÂÅÐÞ!
BELIVE!
Ñàäèñü,
sit
Ãðîìîâ,
gromov
äâà.
two
So is this the author of The Calculator and of The
Lemmings? . . . Can't believe it! Sit down, Gromov,
mark `D' !
This short review of a new book by Gromov, the author of the
popular novels The Calculator and The Lemmings, consists of
a rhetorical question, an exclamatory phrase and an impera-
tive. All of these means of expression are dicult to process.
Even the explicit appraisal expressed by utilising a secondary
school grade system is problematic as it requires specialised
real-word knowledge. Otherwise the numeral `two'6 has noth-
ing to do with appraisal per se.
The example below also features an imperative sentence
used to express negative sentiment. This review also lacks
any explicit sentiment markers. The negative appraisal is ex-
pressed by the verbs `stab' and `burn' that only in this context
show negative attitude.
(17) Stab the book and burn it!
Discussion The reviews in English and in Russian often
use dierent means of expressing sentiment, many of which
are dicult (if at all possible) to process automatically. Often
opinions are described through adjectives (86% of reviews con-
tain adjectives). The second most frequent way of expressing
sentiment is through verbs (59% of reviews have sentiment-
bearing verbs). Less frequent is the noun, in 39% of reviews.
Sentence-level and discourse-level sentiment phenomena are
found in 56% of reviews. 3% of reviews contain phonetic phe-
nomena.
6 Russian schools use a 5-grade marking system, with 5 as the
highest mark. Thus 2 can be thought of as equivalent to `D'.
text and do not have any speech-related mark-up, some of
the review authors used speech-related methods to express
sentiment, for example:
(4) A BIG FAT ZEEROOOOOOOOOOOOO for M.A
(5) Íó ÷òî ñêàçàòü. . . ÷åïóõà. . . ×Å-ÏÓ-ÕÀ.
What shoud I say... boloney... BO-LO-NEY
Another way to express opinion in Russian is based on the
use of a sub-culture language, Padonky. This sociolect has
distinctive phonetic and lexical features that are distant from
`standard' Russian (both ocial and colloquial). For example,
a phrase usually used to express negative attitude to an author
about his book:
(6) Àôôòîð, âûïåé ÉÀÄÓ
(lit) Autor, drink some POIZON
Padonky is close to some variants of slang (corresponding in
English to expressions such as u woz, c u soon etc.), however
it is more consistent and is used quite often on the Web.
Sentence Level Sentence-level means of expressing senti-
ment (mostly exclamatory clauses, imperatives or rhetorical
questions) is slightly more frequent in the Russian corpus than
in the English: 892 and 799 respectively. The distribution of
positive and negative sentiments realised at the sentence level
is opposite in the two corpora: syntactic means are used more
frequently in negative reviews in Russian but they are more
frequent in positive reviews in English.
One particularly common sentiment-relevant sentence-level
phenomenon is the rhetorical question. This is a question only
in form, since it usually expresses a statement. For example:
(7) È îòêóäà ñòîëüêî âîñòîðæåííûõ îòçûâîâ? Êîðîáèò
îò êðóòîñòè ãëàâíûõ ãåðîåâ
Why are there so many appreciative reviews? The
`coolness' of the main characters makes me sick
(8) ×òî æå òàêîãî ïèë/ïðèíèìàë/íþõàë àâòîð, ÷òîáû
íàïèñàòü òàêîå?
What did the author drink / eat / sniff to write stuff
like that?
Some `borderline' cases like the following are also used to
express sentiment:
(9) Èíòåðåñíî, êòî-íèáóäü äîòßíóë õîòß áû äî ñåðåäè-
íû? Ëè÷íî ß - íåò.
I wonder if anyone managed to get to the middle? I
failed.
Considering imperatives, the review author is telling their
audience `what to do', which is often to read a book or to
avoid doing so.
(10) Run away! Run away!
(11) Pick up any Pratchett novel with Rincewind and
re-read it rather than buying this one
(12) ×èòàòü îäíîçíà÷íî.
Definitely should read.
(13) ×èòàòü !!!!!!!!!!! ÂÑÅÌ
Read!!!!!!!! EVERYONE
Another way of expressing sentiment by means of syntactic
structure is exclamatory clauses, which are by their very na-
ture aective. This type of sentence is widely represented in
both corpora.
(14) It certainly leaves you hungering for more!
(15) Buy at your peril. Mines in the bin!
Discourse Level Some of the means of sentiment expres-
sion are quite complex and dicult to analyse automatically:
(16) È
so
ýòî
this
àâòîð
author
âû÷èñëèòåëß
calculator
è
and
ëåììèíãîâ?
lemmings?
...
...
ÍÅ
(DO)NOT
ÂÅÐÞ!
BELIVE!
Ñàäèñü,
sit
Ãðîìîâ,
gromov
äâà.
two
So is this the author of The Calculator and of The
Lemmings? . . . Can't believe it! Sit down, Gromov,
mark `D' !
This short review of a new book by Gromov, the author of the
popular novels The Calculator and The Lemmings, consists of
a rhetorical question, an exclamatory phrase and an impera-
tive. All of these means of expression are dicult to process.
Even the explicit appraisal expressed by utilising a secondary
school grade system is problematic as it requires specialised
real-word knowledge. Otherwise the numeral `two'6 has noth-
ing to do with appraisal per se.
The example below also features an imperative sentence
used to express negative sentiment. This review also lacks
any explicit sentiment markers. The negative appraisal is ex-
pressed by the verbs `stab' and `burn' that only in this context
show negative attitude.
(17) Stab the book and burn it!
Discussion The reviews in English and in Russian often
use dierent means of expressing sentiment, many of which
are dicult (if at all possible) to process automatically. Often
opinions are described through adjectives (86% of reviews con-
tain adjectives). The second most frequent way of expressing
sentiment is through verbs (59% of reviews have sentiment-
bearing verbs). Less frequent is the noun, in 39% of reviews.
Sentence-level and discourse-level sentiment phenomena are
found in 56% of reviews. 3% of reviews contain phonetic phe-
nomena.
6 Russian schools use a 5-grade marking system, with 5 as the
highest mark. Thus 2 can be thought of as equivalent to `D'.
Page 5
3.2 Issues that may Aect Automatic
Processing
One of the features of web content not mentioned above is
a high level of mistakes and typos. Sometimes authors do
not observe the standard rules on purpose (for example us-
ing sociolects, as outlined above). For example, in the cor-
pora 52% of all documents contain spelling mistakes in words
that have sentiment-related meaning. The English corpus is
less aected as authors do not often change spelling on pur-
pose and use contractions that have already become conven-
tional (e.g. wanna, gonna, and u). However the number of
spelling mistakes is still high: 48% of reviews contain mis-
takes in sentiment-bearing words. The number of misspelled
words in the Russian corpus is higher, at 58%.
Of course, a spelling error is not always fatal for automatic
sentiment classication of a document, since reviews usually
have more sentiment indicators than just one word. However,
as many as 8% of the reviews in both corpora have all of their
sentiment bearing words misspelled. This would pose severe
diculties for automatic sentiment classication.
Another obstacle that makes sentiment analysis dicult is
topic shift, in which the majority of a review describes a
dierent object and compares it to the item under review.
The negative review below is an example of this:
(18) Äî÷èòàëà ñ òðóäîì. Íè÷åãî èíòåðåñíîãî ñ òî÷êè
çðåíèß èíôîðìàöèè. Îáðàçåö èíòåëëåêòóàëüíîãî
äåòåêòèâà ðîìàíû Ó.Ýêî. È ÷èòàòü ïðèßòíî, è
ãëóáèíà ôèëîñîôèè, è â èñòîðè÷åñêîì ïëàíå ïî-
çíàâàòåëüíî. À â ýñòåòè÷åñêîì îòíîøåíèè âîîáùå
âûøå âñßêèõ ïîõâàë.
Hardly managed to read to the end. Nothing
interesting from the point of view of information. An
example of intellectual detective stories are novels
by U.Eko. It's a pleasure to read them, and (they
have) deep philosophy, and quite informative from
the point of view of history. And as for aesthetics it's
just beyond praise.
The novel being reviewed is not the one being described, and
all the praise goes to novels by another author. None of the
positive vocabulary has anything to do with the overall senti-
ment of the review's author towards the book under review.
Other reviews that are dicult to classify are those that
describe some positive or negative aspects of a reviewed item,
but in the end give an overall sentiment of the opposite
direction. Consider the following positive review:
(19) Ñþæåò äîâîëüíî îáû÷åí, ßçûê èçëîæåíèß ïðîñò
äî áåçîáðàçèß. Ìíîãî ãðßçè, ìíîãî êðîâè è ñìåð-
òè. Ñëèøêîì ðåàëüíî äëß ñêàçêè êîåé ßâëßåòñß
ôýíòåçè. Íî èíîãäà òàêèå êíèãè ÷èòàòü ïîëåçíî,
èáî îíè îïèñûâàþò íåïðèãëßäíóþ ðåàëüíîñòü.
The plot is quite usual, the language is wickedly
simple. A lot of filth, a lot of blood and death. Too
true-to-life for a fairy-tale, which a fantasy genre
actually is. But it is useful to read such books from
time to time, as they depict ugly reality.
The large number of negative lexical units may mislead an
automatic classier to a conclusion that the review is negative.
The three issues described above are present in approxi-
mately one third of all reviews in the corpora. This suggests
that a sentiment classier using words as features could only
correctly classify around 55{60% of all reviews.
This performance may be even worse for the Russian corpus
as many its reviews feature very unexpected ways of express-
ing opinion. Unlike most of the English reviews, in which a
reviewer simply gives a positive or negative appraisal of a
book backing it with some reasoning and probably providing
some description and analysis of the plot, Russian reviews of-
ten contain irony, jokes, and use non-standard words
and phrases, making use of a variety of language tools, as
illustrated in the following examples:
(20) Ñêóøíàà. äîø¼ë äî áåãñòâà Ãà â ìèð ßíóñà, è âíå-
çàïíî ïîíßë (ß), ÷òî ãîðè îí (ÃÃ) õîòü ñèíèì ïëà-
ìåíåì
Booorin'. got to the (episode of) GG fleeing to the
world of Janus, and suddenly (I) realised that (lit.)
let it (GG) burn with blue flames ( I do not at all
care about GG)
(21) ß ýòó ìóòü íå ïîêóïàë. Shift+del.
I didn't buy this garbage. Shift+del.
Since there are more reviews of this kind in the Russian corpus
than in the English, it is very likely that a Russian sentiment
classier would have lower accuracy.
4 EXPERIMENTS
We used Nave Bayes multinomial (NBm) and a Support Vec-
tor Machine classiers7 to investigate performance of stan-
dard supervised classiers on the two corpora . The feature
sets were the lexical units extracted from the relevant cor-
pora. We extracted all words from the corpora but did not
process them in any way (no stemming or lemmatisation).
15582 words were extracted from the Russian corpus and 9659
words were found in the English book reviews. The evaluation
technique is 10-fold cross-validation.
NBm SVM
Corpus P R F P R F
English book reviews 0.88 0.88 0.88 0.84 0.84 0.84
Russian book reviews 0.81 0.81 0.81 0.78 0.78 0.78
Table 4. Supervised classication results (Precision, Recall and
F1, 10-fold cross-validation)
Table 4 show the results of supervised classication, Rus-
sian review classication being 6-7 percentage points worse
the results obtained from the English corpus.
5 CONCLUSION
In this paper we presented comparable corpora of English and
Russian book reviews, providing the research community with
a resource that can be used for cross-lingual sentiment classi-
cation experiments. We examined language-specic features
7 We used WEKA 3.4.11 [17]
(http://www.cs.waikato.ac.nz/~ml/weka )
Processing
One of the features of web content not mentioned above is
a high level of mistakes and typos. Sometimes authors do
not observe the standard rules on purpose (for example us-
ing sociolects, as outlined above). For example, in the cor-
pora 52% of all documents contain spelling mistakes in words
that have sentiment-related meaning. The English corpus is
less aected as authors do not often change spelling on pur-
pose and use contractions that have already become conven-
tional (e.g. wanna, gonna, and u). However the number of
spelling mistakes is still high: 48% of reviews contain mis-
takes in sentiment-bearing words. The number of misspelled
words in the Russian corpus is higher, at 58%.
Of course, a spelling error is not always fatal for automatic
sentiment classication of a document, since reviews usually
have more sentiment indicators than just one word. However,
as many as 8% of the reviews in both corpora have all of their
sentiment bearing words misspelled. This would pose severe
diculties for automatic sentiment classication.
Another obstacle that makes sentiment analysis dicult is
topic shift, in which the majority of a review describes a
dierent object and compares it to the item under review.
The negative review below is an example of this:
(18) Äî÷èòàëà ñ òðóäîì. Íè÷åãî èíòåðåñíîãî ñ òî÷êè
çðåíèß èíôîðìàöèè. Îáðàçåö èíòåëëåêòóàëüíîãî
äåòåêòèâà ðîìàíû Ó.Ýêî. È ÷èòàòü ïðèßòíî, è
ãëóáèíà ôèëîñîôèè, è â èñòîðè÷åñêîì ïëàíå ïî-
çíàâàòåëüíî. À â ýñòåòè÷åñêîì îòíîøåíèè âîîáùå
âûøå âñßêèõ ïîõâàë.
Hardly managed to read to the end. Nothing
interesting from the point of view of information. An
example of intellectual detective stories are novels
by U.Eko. It's a pleasure to read them, and (they
have) deep philosophy, and quite informative from
the point of view of history. And as for aesthetics it's
just beyond praise.
The novel being reviewed is not the one being described, and
all the praise goes to novels by another author. None of the
positive vocabulary has anything to do with the overall senti-
ment of the review's author towards the book under review.
Other reviews that are dicult to classify are those that
describe some positive or negative aspects of a reviewed item,
but in the end give an overall sentiment of the opposite
direction. Consider the following positive review:
(19) Ñþæåò äîâîëüíî îáû÷åí, ßçûê èçëîæåíèß ïðîñò
äî áåçîáðàçèß. Ìíîãî ãðßçè, ìíîãî êðîâè è ñìåð-
òè. Ñëèøêîì ðåàëüíî äëß ñêàçêè êîåé ßâëßåòñß
ôýíòåçè. Íî èíîãäà òàêèå êíèãè ÷èòàòü ïîëåçíî,
èáî îíè îïèñûâàþò íåïðèãëßäíóþ ðåàëüíîñòü.
The plot is quite usual, the language is wickedly
simple. A lot of filth, a lot of blood and death. Too
true-to-life for a fairy-tale, which a fantasy genre
actually is. But it is useful to read such books from
time to time, as they depict ugly reality.
The large number of negative lexical units may mislead an
automatic classier to a conclusion that the review is negative.
The three issues described above are present in approxi-
mately one third of all reviews in the corpora. This suggests
that a sentiment classier using words as features could only
correctly classify around 55{60% of all reviews.
This performance may be even worse for the Russian corpus
as many its reviews feature very unexpected ways of express-
ing opinion. Unlike most of the English reviews, in which a
reviewer simply gives a positive or negative appraisal of a
book backing it with some reasoning and probably providing
some description and analysis of the plot, Russian reviews of-
ten contain irony, jokes, and use non-standard words
and phrases, making use of a variety of language tools, as
illustrated in the following examples:
(20) Ñêóøíàà. äîø¼ë äî áåãñòâà Ãà â ìèð ßíóñà, è âíå-
çàïíî ïîíßë (ß), ÷òî ãîðè îí (ÃÃ) õîòü ñèíèì ïëà-
ìåíåì
Booorin'. got to the (episode of) GG fleeing to the
world of Janus, and suddenly (I) realised that (lit.)
let it (GG) burn with blue flames ( I do not at all
care about GG)
(21) ß ýòó ìóòü íå ïîêóïàë. Shift+del.
I didn't buy this garbage. Shift+del.
Since there are more reviews of this kind in the Russian corpus
than in the English, it is very likely that a Russian sentiment
classier would have lower accuracy.
4 EXPERIMENTS
We used Nave Bayes multinomial (NBm) and a Support Vec-
tor Machine classiers7 to investigate performance of stan-
dard supervised classiers on the two corpora . The feature
sets were the lexical units extracted from the relevant cor-
pora. We extracted all words from the corpora but did not
process them in any way (no stemming or lemmatisation).
15582 words were extracted from the Russian corpus and 9659
words were found in the English book reviews. The evaluation
technique is 10-fold cross-validation.
NBm SVM
Corpus P R F P R F
English book reviews 0.88 0.88 0.88 0.84 0.84 0.84
Russian book reviews 0.81 0.81 0.81 0.78 0.78 0.78
Table 4. Supervised classication results (Precision, Recall and
F1, 10-fold cross-validation)
Table 4 show the results of supervised classication, Rus-
sian review classication being 6-7 percentage points worse
the results obtained from the English corpus.
5 CONCLUSION
In this paper we presented comparable corpora of English and
Russian book reviews, providing the research community with
a resource that can be used for cross-lingual sentiment classi-
cation experiments. We examined language-specic features
7 We used WEKA 3.4.11 [17]
(http://www.cs.waikato.ac.nz/~ml/weka )
Page 6
of the reviews that are relevant to sentiment classication and
showed that sentiment in dierent languages is expressed in
slightly dierent ways, covering all levels of the language: from
phonetic to discourse. The experiments suggest that these dif-
ferences have an impact on the accuracy of a standard, super-
vised sentiment classication technique.
In future work, we intend to investigate in more depth
which specic characteristics of dierent languages lead to dif-
ferences in sentiment classication accuracy, using sentiment-
annotated corpora of English, Russian, Chinese and Japanese.
References
[1] John Blitzer, Mark Dredze, and Fernando Pereira, `Biogra-
phies, bollywood, boom-boxes and blenders: Domain adapta-
tion for sentiment classication', in Proceedings of the 45th
Annual Meet- ing of the Association of Computational Lin-
guistics., pp. 440{447, Prague, Czech Republic, (June 2007).
Association for Computational Linguistics.
[2] Kushal Dave, Steve Lawrence, and David M. Pennock, `Min-
ing the peanut gallery: Opinion extraction and semantic clas-
sication of product reviews', in Proceedings of the 12th in-
ternational conference on Information and Knowledge Man-
agement, pp. 519 { 528, Budapest, Hungary, (2003). ACM
Press.
[3] Michael Gamon and Anthony Aue, `Automatic identica-
tion of sentiment vocabulary: exploiting low association with
known sentiment terms', in Proceedings of the ACL Work-
shop on Feature Engineering for Machine Learning in Natu-
ral Language Processing, pp. 57{64. Association for Compu-
tational Linguistics, (2005).
[4] Nozomi Kobayashi, Kentaro Inui, Yuji Matsumoto, Kenji
Tateishi, and Toshika Fukushima, `Collecting evaluative ex-
pressions for opinion extraction', Natural Language Process-
ing{IJCNLP 2004, 13(12), 596{605, (December 2004).
[5] Tao Li, Yi Zhang, and Vikas Sindhwani, `A Non-negative Ma-
trix Tri-factorization Approach to Sentiment Classication
with Lexical Prior Knowledge', in Proceeding of Association
for Computational Linguistics, number August, pp. 244|-
252, Morristown, NJ, USA, (2009). Association for Compu-
tational Linguistics.
[6] J.R. Martin and Peter Robert Rupert White, The language of
evaluation: Appraisal in English, Palgrave Macmillan, 2005.
[7] Rada Mihalcea, Carmen Banea, and Janyce M Wiebe, `Learn-
ing multilingual subjective language via cross-lingual projec-
tions', in 976 Proceedings of the 45th Annual Meeting of
the Association of Computational Linguistics, volume 45, pp.
976|-983, Prague, Czech Republic, (2007).
[8] Bo Pang and Lillian Lee, `A sentimental education: Sentiment
analysis using subjectivity summarization based on Minimum
Cuts', in the 42nd Annual Meeting on Association of Com-
putational Linguistics, Barcelona, Spain, (2004).
[9] Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan,
`Thumbs up?: sentiment classication using machine learning
techniques', in Conference on Empirical Methods in Natural
Language Processing, pp. 79|-86, (2002).
[10] Ana-Maria Popescu and Oren Etzioni, `Extracting product
features and opinions from reviews', in Natural Language Pro-
cessing and Text Mining, pp. 9{28, Vancouver, Canada, (Oc-
tober 2005). Springer.
[11] Jonathon Read, David Hope, and John Carroll, `Annotating
expressions of appraisal in English', ACL 2007, 93, (2007).
[12] Yohei Seki, David K. Evans, Lun-Wei Ku, Le Sun, Hsin-Hsi
Chen, and Noriko Kando, `Overview of multilingual opin-
ion analysis task at NTCIR-7', Proceedings NTCIR-7, NII,
Tokyo, 185{203, (2008).
[13] Peter D. Turney, `Thumbs up or thumbs down? Semantic ori-
entation applied to unsupervised classication of reviews', in
Annual Meeting of Assosiation of Computational Linguistics,
pp. 417{424, Philadelphia, Pennsylvania, (2002).
[14] Janyce M Wiebe and Ellen Rilo, `Creating subjective and
objective sentence classiers from unannotated texts', Com-
putational Linguistics and Intelligent Text Processing, 486{
497, (2005).
[15] Janyce M Wiebe, Theresa Ann Wilson, and Claire Cardie,
`Annotating expressions of opinions and emotions in lan-
guage', Language Resources and Evaluation, 39(2), 165{210,
(2005).
[16] Theresa Ann Wilson and Janyce M Wiebe, `Annotating opin-
ions in the world press', in 4th SIGdial Workshop on Dis-
course and Dialogue (SIGdial-03), pp. 13{22, (2003).
[17] I.H. Witten and Eibe Frank, Data Mining: Practical machine
learning tools and techniques, Morgan Kaufmann Pub, San
Francisco, 2nd edn., 2005.
[18] Hong Yu and Vasileios Hatzivassiloglou, `Towards answering
opinion questions: Separating facts from opinions and iden-
tifying the polarity of opinion sentences', in Proceedings of
EMNLP, volume 3, pp. 129{136. Association for Computa-
tional Linguistics, (2003).
[19] Taras Zagibalov and John Carroll, `Almost Unsupervised
Cross Language Opinion Analysis at NTCIR 7', in NTCIR-7,
Tokyo, (2008).
[20] Taras Zagibalov and John Carroll, `Automatic Seed Word Se-
lection for Unsupervised Sentiment Classication of Chinese
Text', in Proceedings of the 22nd International Conference
on Computational Linguistics, pp. 1073|-1080, Manchester,
United Kingdom, (2008).
[21] Taras Zagibalov and John Carroll, `Multilingual Opinion
Holder and Target Extraction using Knowledge-Poor Tech-
niques', in Language and Technology Conference, Pozna~n,
Poland, (2009).
showed that sentiment in dierent languages is expressed in
slightly dierent ways, covering all levels of the language: from
phonetic to discourse. The experiments suggest that these dif-
ferences have an impact on the accuracy of a standard, super-
vised sentiment classication technique.
In future work, we intend to investigate in more depth
which specic characteristics of dierent languages lead to dif-
ferences in sentiment classication accuracy, using sentiment-
annotated corpora of English, Russian, Chinese and Japanese.
References
[1] John Blitzer, Mark Dredze, and Fernando Pereira, `Biogra-
phies, bollywood, boom-boxes and blenders: Domain adapta-
tion for sentiment classication', in Proceedings of the 45th
Annual Meet- ing of the Association of Computational Lin-
guistics., pp. 440{447, Prague, Czech Republic, (June 2007).
Association for Computational Linguistics.
[2] Kushal Dave, Steve Lawrence, and David M. Pennock, `Min-
ing the peanut gallery: Opinion extraction and semantic clas-
sication of product reviews', in Proceedings of the 12th in-
ternational conference on Information and Knowledge Man-
agement, pp. 519 { 528, Budapest, Hungary, (2003). ACM
Press.
[3] Michael Gamon and Anthony Aue, `Automatic identica-
tion of sentiment vocabulary: exploiting low association with
known sentiment terms', in Proceedings of the ACL Work-
shop on Feature Engineering for Machine Learning in Natu-
ral Language Processing, pp. 57{64. Association for Compu-
tational Linguistics, (2005).
[4] Nozomi Kobayashi, Kentaro Inui, Yuji Matsumoto, Kenji
Tateishi, and Toshika Fukushima, `Collecting evaluative ex-
pressions for opinion extraction', Natural Language Process-
ing{IJCNLP 2004, 13(12), 596{605, (December 2004).
[5] Tao Li, Yi Zhang, and Vikas Sindhwani, `A Non-negative Ma-
trix Tri-factorization Approach to Sentiment Classication
with Lexical Prior Knowledge', in Proceeding of Association
for Computational Linguistics, number August, pp. 244|-
252, Morristown, NJ, USA, (2009). Association for Compu-
tational Linguistics.
[6] J.R. Martin and Peter Robert Rupert White, The language of
evaluation: Appraisal in English, Palgrave Macmillan, 2005.
[7] Rada Mihalcea, Carmen Banea, and Janyce M Wiebe, `Learn-
ing multilingual subjective language via cross-lingual projec-
tions', in 976 Proceedings of the 45th Annual Meeting of
the Association of Computational Linguistics, volume 45, pp.
976|-983, Prague, Czech Republic, (2007).
[8] Bo Pang and Lillian Lee, `A sentimental education: Sentiment
analysis using subjectivity summarization based on Minimum
Cuts', in the 42nd Annual Meeting on Association of Com-
putational Linguistics, Barcelona, Spain, (2004).
[9] Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan,
`Thumbs up?: sentiment classication using machine learning
techniques', in Conference on Empirical Methods in Natural
Language Processing, pp. 79|-86, (2002).
[10] Ana-Maria Popescu and Oren Etzioni, `Extracting product
features and opinions from reviews', in Natural Language Pro-
cessing and Text Mining, pp. 9{28, Vancouver, Canada, (Oc-
tober 2005). Springer.
[11] Jonathon Read, David Hope, and John Carroll, `Annotating
expressions of appraisal in English', ACL 2007, 93, (2007).
[12] Yohei Seki, David K. Evans, Lun-Wei Ku, Le Sun, Hsin-Hsi
Chen, and Noriko Kando, `Overview of multilingual opin-
ion analysis task at NTCIR-7', Proceedings NTCIR-7, NII,
Tokyo, 185{203, (2008).
[13] Peter D. Turney, `Thumbs up or thumbs down? Semantic ori-
entation applied to unsupervised classication of reviews', in
Annual Meeting of Assosiation of Computational Linguistics,
pp. 417{424, Philadelphia, Pennsylvania, (2002).
[14] Janyce M Wiebe and Ellen Rilo, `Creating subjective and
objective sentence classiers from unannotated texts', Com-
putational Linguistics and Intelligent Text Processing, 486{
497, (2005).
[15] Janyce M Wiebe, Theresa Ann Wilson, and Claire Cardie,
`Annotating expressions of opinions and emotions in lan-
guage', Language Resources and Evaluation, 39(2), 165{210,
(2005).
[16] Theresa Ann Wilson and Janyce M Wiebe, `Annotating opin-
ions in the world press', in 4th SIGdial Workshop on Dis-
course and Dialogue (SIGdial-03), pp. 13{22, (2003).
[17] I.H. Witten and Eibe Frank, Data Mining: Practical machine
learning tools and techniques, Morgan Kaufmann Pub, San
Francisco, 2nd edn., 2005.
[18] Hong Yu and Vasileios Hatzivassiloglou, `Towards answering
opinion questions: Separating facts from opinions and iden-
tifying the polarity of opinion sentences', in Proceedings of
EMNLP, volume 3, pp. 129{136. Association for Computa-
tional Linguistics, (2003).
[19] Taras Zagibalov and John Carroll, `Almost Unsupervised
Cross Language Opinion Analysis at NTCIR 7', in NTCIR-7,
Tokyo, (2008).
[20] Taras Zagibalov and John Carroll, `Automatic Seed Word Se-
lection for Unsupervised Sentiment Classication of Chinese
Text', in Proceedings of the 22nd International Conference
on Computational Linguistics, pp. 1073|-1080, Manchester,
United Kingdom, (2008).
[21] Taras Zagibalov and John Carroll, `Multilingual Opinion
Holder and Target Extraction using Knowledge-Poor Tech-
niques', in Language and Technology Conference, Pozna~n,
Poland, (2009).
Sign up today - FREE
Mendeley saves you time finding and organizing research. Learn more
- All your research in one place
- Add and import papers easily
- Access it anywhere, anytime
Start using Mendeley in seconds!
Readership Statistics
3 Readers on Mendeley
by Discipline
33% Linguistics
by Academic Status
100% Ph.D. Student
by Country
33% United Kingdom
33% United States
33% Portugal



