Data Visualisation in Linguistics
Visualisation of toxicity in Social Networks made with Django, Highcharts and D3.js.
A research collaboration with the Mathematics and Computer Science Department at the Universitat de Barcelona, focused on building tooling to visualise and analyse toxicity in social networks from a linguistics perspective.
The project resulted in a published paper: “Visualisation of hierarchical multivariate data: Categorisation and case study on hate speech”, published in Information Visualization 22.1 (2023), pp. 31–51.
What It Does
The application allows researchers to upload linguistic datasets (in Excel/XLS format) and explore them through interactive visualisations. It provides a web-based interface for analysing toxicity patterns across social media data, with the goal of making complex hierarchical multivariate data accessible without requiring programming skills.
Tech Stack
| Layer | Technology |
|---|---|
| Backend | Python 3.8, Django 3.1.6 |
| API | Django REST Framework 3.12.2 |
| Visualisation | Highcharts, D3.js |
| Data ingestion | openpyxl, pyexcel-xls, xlrd |
| Database | SQLite |
Contributors
Developed in collaboration with Anna Puig, Inmaculada Rodríguez, Reyes Chacón, David De-La-Paz, Montserrat Nofre, and Mariona Taulé.