Abstract:
The popularity of social media platforms has generated a new social interaction environment thus a new collaboration network among individuals. These platforms own tremendous amount of data about users’ behaviors and sentiments. One of these platforms is Twitter, which provides researchers data potential of benefit for their studies. Based on Twitter data, in this study a multilingual sentiment detection framework is proposed to compute European Gross National Happiness (GNH). This framework consists of a novel data collection, filtering and sampling method, and multilingual sentiment detection algorithm for social media big data, and tested with nine European countries (United Kingdom, Germany, Sweden, Turkey, Portugal, Netherlands, Italy, France and Spain) and their national languages over six-year period. The reliability of the data is checked with peak/troughs comparison for special days from Wikipedia. The validity is checked with a group of correlation analyses with OECD Life Satisfaction survey reports’, currency exchanges, and national stock market time series data. Then, the European GNH map is drawn for six years. Lastly, an exploratory study for determining the relationships between users’ Twitter account features (number of tweets, number of followers etc.) and happiness polarities are analyzed. Main aim of this study is to propose a novel multilingual social media sentiment analysis framework for calculating GNH for countries and change the way of OECD type organizations’ survey and interview methodology. Also, it is believed that this framework can serve more detailed results (e.g. daily or hourly sentiments of society in different languages).