In this competition, you are provided with a supervised dataset X consisting of the raw content (html format) of news articles and the binary popularity (where 1 means "popular" and −1 means "unpopular", calculated based on the number of shares in online social networking services) of these articles as labels. Your goal is to learn a function f from X that is able to predict the popularity of an unseen news article.