UBC Research Data

Exploring Social Interactions in Vancouver's Neighbourhood Parks Through Sentiment Analysis of Google Reviews Zhang, Daniella

Description

Understanding social cohesion in urban green spaces is essential for fostering inclusive and resilient communities. This study analyzes social sentiment in Vancouver parks using a transfer learning natural language processing (NLP) approach to classify and interpret visitor reviews. A self-attention deep learning model was trained to detect social cohesion–related sentiment and categorize it into five classes: strong negative, moderately negative, neutral, moderately positive, and strong positive. Google reviews for 180 Vancouver parks were collected and pre-processed, including tokenization and translation where needed. Binary classification was first applied to filter for reviews relevant to social cohesion, followed by fine-tuning RoBERTa, a robustly optimized BERT (Bidirectional Encoder Representations from Transformers) model that excels at understanding contextual word meaning within sentences, on a labeled dataset. This fine-tuning process adapted the model’s attention toward content associated with social cohesion themes such as “welcoming,” “family,” “crowded,” and “unsafe.” The average park sentiment score was then used as the independent variable in a generalized linear regression (GLR) model, followed by geographically weighted regression (GWR) to account for spatial variation. Twenty candidate socioeconomic variables from the 2021 Canadian census were tested as predictors of average park sentiment. Results showed that household size and median income were statistically significant positive predictors of higher park sentiment at the dissemination area (DA) scale. The findings highlight disparities in perceived social cohesion across Vancouver’s neighborhoods and reveal the social atmosphere of parks beyond Google star ratings alone, offering evidence-based insights for park planning and management. Key Words: Social cohesion, sentiment analysis, transfer learning, deep learning, geographically weighted regression, natural language processing

Item Media