It may be difficult to predict how well a student will perform academically, but a new innovation can do so just by looking at their tweets - and with more than 93 percent accuracy.
A computer model trained on thousands of test scores and one million social media posts to distinguishing between high academic achievers and lower ones based on textual features shared in posts.
The technology, powered by artificial intelligence, determined that students who discuss scientific and cultural topics, along with writing lengthy posts and words are likely to perform well.
However, those who use an abundance of emojis, words or entire phrases written in in capital letters and vocabulary related to horoscopes, driving and military service tend to receive lower grades in school.
The team notes that by 'predict' they do not mean the system creates a future forecast, but rather a correlation between posts and real test scores students earned.
The use of capitalized words, emojis and exclamations were found to be negatively correlated with academic performance. On the other hand, using Latin characters, creating average post and word length, extensive vocabulary size, and entropy of users' texts were found to positively correlate with academic performance
The study was conducted by a team from the National Research University Higher School of Economics, which employed a prediction model that uses mathematical textual analysis capable of rating words, phrases, topics and other content in social media posts.
Ivan Smirnov, the lead researcher, is the mastermind behind the system and experiment gathered test scores from 2,468 students who took the Program for International Students Assessment (PISA), which is a testing system used to measure pupils' performance in math, science and reading.
Along with the exam, the dataset included more than 130,00 social media posts from the European social media site VKontakte - a Facebook alternative.
The results were compared with the average Unified State Exam, which is the equivalent to the SAT test in the US.
Highest scores include (orange): English words; Words related to literature ;