Description
Introduction
Douban Movie is a Chinese website that allows Internet users to share their comments and viewpoints about movies. Users are able to post short or long comments on movies and give them marks. This dataset contains more than 2 million short comments of 28 movies in Douban Movie website. It can be used on text classification, text clustering, sentiment analysis, semantic web construction and some other fields that relate to web mining or NLP (of Chinese lol).
metadata
ID the ID of the comment (start from 0)
MovieNameEN the English name of the movie
MovieNameCN the Chinese name of the movie
Crawl_Date the date that the data are crawled
Number the number of the comment
Username the username of the account
Date the date that the comment posted
Star the star that users give to the movie (from 1 to 5, 5 grades)
Comment the content of the comment
Like the count of "like" on the comment