This work presents a corpus of Bollywood song lyrics and its metadata, annotated with sentiment polarity. We call this BolLy. It contains lyrics of 1055 songs ranging from those composed in the year 1970 to the most recent ones. This dataset is of utmost value as all the annotation is done manually by three annotators and this makes it a very rich dataset for training purposes. In this work, we describe the creation and annotation process, content, and the possible uses of the dataset. As an experiment, we have built a basic classification system to identify the emotion polarity of the song based solely on the lyrics and this can be used as a baseline algorithm for the same. BolLy can also be used for studying code-mixing with respect to lyrics.
CITATION STYLE
Apoorva, G. D., & Mamidi, R. (2018). BolLy: Annotation of Sentiment Polarity in Bollywood Lyrics Dataset. In Communications in Computer and Information Science (Vol. 781, pp. 41–50). Springer Verlag. https://doi.org/10.1007/978-981-10-8438-6_4
Mendeley helps you to discover research relevant for your work.