Data Visualisation in Linguistics

Visualisation of toxicity in Social Networks made with Django, Highcharts and D3.js.

A research collaboration with the Mathematics and Computer Science Department at the Universitat de Barcelona, focused on building tooling to visualise and analyse toxicity in social networks from a linguistics perspective.

The project resulted in a published paper: “Visualisation of hierarchical multivariate data: Categorisation and case study on hate speech”, published in Information Visualization 22.1 (2023), pp. 31–51.

What It Does

The application allows researchers to upload linguistic datasets (in Excel/XLS format) and explore them through interactive visualisations. It provides a web-based interface for analysing toxicity patterns across social media data, with the goal of making complex hierarchical multivariate data accessible without requiring programming skills.

Tech Stack

Layer Technology
Backend Python 3.8, Django 3.1.6
API Django REST Framework 3.12.2
Visualisation Highcharts, D3.js
Data ingestion openpyxl, pyexcel-xls, xlrd
Database SQLite

Contributors

Developed in collaboration with Anna Puig, Inmaculada Rodríguez, Reyes Chacón, David De-La-Paz, Montserrat Nofre, and Mariona Taulé.