Link Search Menu Expand Document

Geospatial Monitoring of Sustainable Development in Bolivia

geospatial

map

Sustainable Development

drawing

Introduction

Understanding regional development is crucial for shaping effective policies and ensuring balanced growth across a country. This article explores how sustainable development varies across Bolivia’s 339 municipalities, using geospatial analysis to uncover patterns, disparities, and trends.

By leveraging geospatial data, we aim to map, analyze, and classify municipalities based on their sustainable development performance. Our approach focuses on key spatial analysis methods —spatial dependence, inequality, and heterogeneity— to reveal regional patterns and their implications for policy prioritization.

Why This Matters

  • Data-Driven Policy Making: By identifying high-performing and underdeveloped regions, policymakers can target interventions where they are needed most.

  • Addressing Regional Disparities: Understanding spatial inequality helps in designing balanced development strategies that reduce the gap between urban and rural areas.

  • Customizing Development Strategies: Spatial heterogeneity analysis ensures that policies are tailored to the unique needs of each region, rather than applying a blanket approach.

By applying geospatial methods, we unlock deeper insights into Bolivia’s sustainable development landscape—guiding better decision-making for a more equitable future.

Key Areas of Analysis

  • Mapping the Spatial Distribution of Development The first step in our study is to visualize how sustainable development is distributed across Bolivia’s municipalities. Using geospatial tools, we create maps that show high-performing and low-performing areas, offering an intuitive view of regional disparities.

    This mapping approach helps us answer questions such as:

    • Where are the municipalities with the highest sustainable development index?
    • Are there regions that consistently show lower development levels?
    • How does population distribution correlate with development levels?
  • Spatial Dependence: Identifying Clusters and Outliers Sustainable development does not occur in isolation—neighboring municipalities often share similar characteristics due to factors like infrastructure, economic opportunities, and natural resources.

    By applying spatial dependence analysis, we identify:

    • Global clustering: Do high-development or low-development areas tend to cluster together at a national level?
    • Local clusters: Specific hot spots (high-value clusters) and cold spots (low-value clusters) where development is significantly different from surrounding areas.
    • Spatial outliers: Municipalities that stand out due to unexpectedly high or low development compared to their neighbors.
  • Spatial Inequality: Understanding Regional Disparities Not all regions develop at the same pace, leading to disparities in opportunities and living standards. Spatial inequality analysis allows us to quantify and decompose these disparities, helping policymakers pinpoint:

    • Which regions experience the highest levels of inequality.
    • Whether inequality is driven by urban-rural differences, economic activity, or infrastructure gaps.
    • How sustainable development ranks differ across municipalities and departments.
  • Spatial Heterogeneity: How Relationships Change Across Space What drives development in one region may not be the same in another. Spatial heterogeneity analysis helps us understand how different factors influence development in various parts of Bolivia.

    For example:

    • Does population size always correlate with higher development, or does this vary by region?
    • Do some municipalities benefit more from nighttime light intensity trends (a proxy for economic activity)?
    • Are certain geographical features (e.g., proximity to borders or natural resources) influencing development in unique ways?

By identifying these regional variations, we move beyond one-size-fits-all conclusions and provide insights that tailor policy recommendations to specific areas.

Dataset

To conduct this study, we rely on a georeferenced dataset that includes:

  • Municipality and Department: administrative divisions.
  • Bolivia Index Ranking: a composite ranking of municipalities based on sustainable development indicators.
  • Population 2020 and Estimated Population 2020: official and projected population figures.
  • Log Nighttime Light Per Capita (2020): satellite-based measure of economic activity and infrastructure presence.
  • Trend in Log Nighttime Light Per Capita (2020): growth trends in nighttime illumination, indicating changes in economic activity.
  • Municipal Sustainable Development Index: a key metric for evaluating municipal-level progress.

Load dataset

The data is stored in a GeoJSON file, allowing us to utilize GeoPandas for loading and processing. Additionally, we can load definition data if further context or explanation is required.

dataURL = 'map_and_data.geojson'
gdf = gpd.read_file(dataURL)

dataDefinitions = pd.read_csv('dataDefinitions.csv')
data_dict = dict(zip(dataDefinitions['Variable'], dataDefinitions['Label']))

we can store all required data into parameters for easier access.

INDICATOR1 = 'imds' # Municipal Sustainable Development Index
INDICATOR2 = 'pop2017' # Estimated population in 2017
INDICATOR3 = 'ln_t400NTLpc2017' # Trend log NTL per capita in 2017
INDICATOR4 = 'rank_imds' # Bolivia Index Ranking
INDICATOR5 = 'co2017' # Estimated carbon dioxide in 2017
ADM1 = 'dep' # Department ~ province
ADM3 = 'mun' # Municipality ~ city

gdf = gdf[[ADM1, ADM3, INDICATOR1, INDICATOR2, INDICATOR3, INDICATOR4, INDICATOR5,'geometry','COORD_X','COORD_Y']]

Exploratory data analysis (EDA)

EDA is the first step for every advanced analytics. This process will help us to understand and answer the 5W + 1H question before we expand our analytics.

Description analytics

describe.describe_data(gdf)
Table size 339 x 10
Dataframe has 10 columns.
There are 0 columns that have missing values.
  index Data Type Count Missing % missing Low value Q1 Mean Median Q3 Hi value Mode Stddev Skewness Skewness note Uniques
0 dep object 339 0 0 0 0 0 0 0 0 0 0 0 non-numeric 9
1 mun object 339 0 0 0 0 0 0 0 0 0 0 0 non-numeric 333
2 imds float64 339 0 0 35.7 47 51.05 50.5 54.85 80.2 47.5 6.77 0.59 Moderately Positively Skewed 188
3 pop2017 float64 339 0 0 661.82 6425.71 32858.8 11627.5 22497.3 1.60446e+06 661.82 117649 9.71 Highly Positively Skewed 339
4 ln_t400NTLpc2017 float64 339 0 0 7.87 12.82 13.67 13.7 14.65 16.99 7.87 1.28 -0.54 Moderately Negatively Skewed 339
5 rank_imds int32 339 0 0 1 85.5 170 170 254.5 339 1 98.01 0 Fairly Symmetric (Positive) 339
6 co2017 float64 339 0 0 400.05 402.91 403.59 403.67 404.56 406.23 400.05 1.23 -0.55 Moderately Negatively Skewed 339
7 geometry geometry 339 0 0 0 0 0 0 0 0 0 0 0 non-numeric 339
8 COORD_X float64 339 0 0 -69.48 -67.87 -66.07 -66.18 -64.62 -57.69 -69.48 2.15 0.8 Moderately Positively Skewed 339
9 COORD_Y float64 339 0 0 -22.67 -18.88 -17.45 -17.59 -16.48 -10.11 -22.67 2.35 0.75 Moderately Positively Skewed 339

Using the information above we found that:

  • There is no missing value in our dataset.
  • The dataset includes 9 departments (provinces in Indonesian terms) and 333 municipalities (cities in Indonesian terms).
  • The Municipal Sustainable Development Index (IMDS) exhibits a moderately positive skew, indicating that most values are concentrated toward the lower end of the distribution. The standard deviation is approximately 6.77, with the mean and median being nearly equal, suggesting minimal variation across rows.
  • The estimated population in 2017 (pop2017) shows a highly positive skew, which aligns with geographic differences, particularly between urban areas and large cities.
  • The trend of logarithmic nighttime light per capita in 2017 (ln_t400NTLpc2017) presents a moderately negative skew, implying that nighttime light intensity is relatively high in certain locations. However, the variation in values across rows remains small.

Boxplot and Histogram plot

To get more conprehensive insight about our distribution data, we can make a boxplot and histogram to get more clearer view how our data is.

visual.boxplot(gdf,[INDICATOR1,INDICATOR2,INDICATOR3, INDICATOR4,INDICATOR5,'COORD_X','COORD_Y'],row_col=(3,3),title="Geospatial Monitoring of Sustainable Development in Bolivia",footnote='2017')

drawing drawing

Numeric Descriptive

drawing

Boxplot Comparison

drawing

Choropleth map

drawing

Histogram distribution

drawing

Scatter plot, box and distribution (pop2017_log vs imds)

drawing

Choropleth map for imds compare to night light area

drawing

drawing

Residuals vs Fitted for Linear regression (imds vs dep, mun, pop2017, ln_t400NTLpc2017, rank_imds, co2017, pop2017_log)

drawing

QQ plot

drawing

Scale-Location Plot

drawing

Leverage plot

drawing

Exploratory spatial data analysis (ESDA) - Box plot

drawing drawing

Exploratory spatial data analysis (ESDA) - Fisher Jenks plot

drawing drawing

spatial autocorrelation multivariable

drawing

drawing

drawing

Scatter plot, box and distribution (imds vs imds lag)

drawing

Spatial Autocorrelation Global

drawing drawing drawing

Spatial Autocorrelation Local

drawing drawing drawing

Geospatial Weight Regression (GWR) analysis

drawing drawing

drawing drawing

Exploratory spatial data analysis (ESDA) - Fisher Jenks plot for GWR R2

drawing

Exploratory spatial data analysis (ESDA) - Fisher Jenks plot for GWR Slope

drawing

Exploratory spatial data analysis (ESDA) - Fisher Jenks plot for MGWR Slope

drawing