Hunting Bots & Trolls in a Game Environment

Speaker(s):
Dr. Stuart Shulman
Date of Event:
February 12, 2020
Associated SMA Project
No items found.
Hunting Bots & Trolls in a Game Environment

Speaker: Shulman, S. (Texifter)

Date: 12 February 2020

Speaker Session Preview

SMA hosted a speaker session presented by Dr. Stuart Shulman (Texifter) as a part of its SMA General Speaker Series. Dr. Shulman’s brief focused on the ways in which one identify Internet bots and trolls by using text classification programs. He stated that though machines enable one to classify large bodies of text, it remains difficult for machines to be consistently accurate; therefore, a human must remain in the loop. Dr. Shulman then spoke about CoderRank, a program that ranks humans based on trust and knowledge vectors. The idea behind the program is, when training machines for text analysis, greater reliance should be placed on the specific inputs of those humans most likely to create a valid observation. CoderRank, therefore, is designed to identify those who are most qualified to code and classify text. Dr. Shulman proceeded to discuss lessons learned from tracking down fake accounts in the Trudeau and Biden campaigns. In the Trudeau campaign, Dr. Shulman and his team looked for biographical indicators that were not Canadian, conducted searches based on keywords, and analyzed abnormal text. In the Biden campaign, Dr. Shulman and his team looked for accounts with significant followings that had, in reality, purchased social media followers. He stated that as we move into this next technological landscape, it is only going to become more difficult to discern who is real and who is not online. Moreover, if one immerses oneself in social media profile data and looks for specific indicators, one can see a mixture of real, authentic people; human-assisted bots; bot-assisted humans; and pure bots. Dr. Shulman then spoke about bot detection based on the frequency of tweets. He and his team of coders create filters, labels, and other features in order to determine where the tweets are coming from. They also examine Twitter accounts’ networks to decipher whether or not the accounts of interest are operated by humans. Dr. Shulman expressed, however, that gaining access to data is becoming increasingly difficult due to increased regulations and data privacy restrictions, which makes it harder for him and his team to identify bots. To conclude, Dr. Shulman spoke about how his text classification and bot detecting techniques can be applied to a gaming environment, specifically with regards to veteran suicide prevention and school shooters prevention.

: :

Dr. Stuart Shulman

Brief Bio

Dr. Stuart W. Shulman is an inventor, entrepreneur, recovering political science professor, US Soccer National C licensed coach, proud dad and son, CEO of Texifter, the inventor of DiscoverText.

Background

DiscoverText is multilingual-capable platform designed for collecting and cleaning up messy and redundant text data streams. It provides basic research and measurement tools to improve human and machine performance classifying text data over time. The software supports researcher efforts to test inferences using a theoretical and applied model informed by a decade of interdisciplinary, National Science Foundation-funded research into the text classification problem in the context of digital government.


The key scientific breakthrough led to a patent (US No. 9,275,291) issued on March 1, 2016. Texifter built free open source and commercial web-based tools for adjudicating the work of human annotators, or coders. Some coders will have deeper knowledge of the subject and some data will be inscrutable or ambiguous. Adjudication allows an expert to review the way the group labeled the items for the purpose of deciding, if possible, who was right and wrong. This systematic method of stored validation creates “gold standard” training sets and it creates a score over time of the likelihood that an individual coder will create a valid observation, in a specific language and domain.

No items found.
NSI Contributors
No items found.
Region
No items found.
National Security Topic
Outcome

Site-wide Search

Search all site content, including all NSI and SMA publications, SMA Speaker series, NSI Team member bios, services, portfolio projects, company info, and more.