Toxicity, polarizations and cultural diversity in social networks

Using machine learning and natural language processing to analyze these phenomena in social networks

  • Abi Oppenheim
  • Federico Albanese
  • Esteban Feuerstein


Social media have increased the amount of information that people consume as well as the number of interactions between them. Nevertheless, most people tend to promote their favored narratives and hence form polarized groups [1]. This encourages polarization and extremism resulting in extreme violence [2]. Against this backdrop, it is in our interest to find environments, strategies and mechanisms that allow us to reduce toxicity on social media (defining “toxicity” as a rude, disrespectful or unreasonable comment that is likely to make people leave a discussion). We address the hypothesis that a higher cultural diversity among community users reduces the toxicity of the user messages. We use Reddit as a case study, since this platform is characterized by a variety of discussion sub-forums where users debate political and cultural issues. Using community2vec [3], we generate an embedding for each community that allows us to portray users in a demographic and ideological aspect [4]. In order to analyze each user statement, we process the data with different models, thereby obtaining which are the topics of debate and what are the levels of aggressiveness and negativism in them. Finally, we will seek to corroborate the hypothesis by analyzing the relationship between the cultural diversity present in each discussion group and the toxicity found in their posts.

Cómo citar
Oppenheim, A., Albanese, F., & Feuerstein, E. (2022). Toxicity, polarizations and cultural diversity in social networks. Memorias De Las JAIIO, 8(1), 28-29. Recuperado a partir de
AGRANDA - Simposio Argentino de Ciencia de Datos y GRANdes DAtos