An eight-project portfolio applying computational social science methods to armed conflict data. Each project addresses a different analytical question about the Ethiopia/Tigray conflict (2020–2022) using data from UCDP, the UN, GDELT, satellite imagery, and other sources.
The portfolio covers geospatial analysis, NLP and text-as-data, network analysis, machine learning forecasting, causal inference, satellite-based damage detection, agent-based modeling, and digital trace data collection. Together, these projects represent the core methodological toolkit of computational conflict research as practiced in 2025–2026.
| # | Project | Methods | Repository |
|---|---|---|---|
| 1 | Conflict Event Data Analysis & Geospatial Visualization | pandas, geopandas, folium, DBSCAN, matplotlib | conflict-event-analysis |
| 2 | LLM-Powered Conflict Text Analysis | BERTopic, zero-shot classification, sentence-transformers, UMAP, HDBSCAN | conflict-text-analysis |
| 3 | Conflict Actor Network Analysis | NetworkX, community detection, centrality metrics | conflict-network-analysis |
| 4 | Conflict Forecasting with Machine Learning | Random Forest, temporal cross-validation, SHAP, panel data | conflict-forecasting-ml |
| 5 | Causal Inference for Conflict with ML | Double Machine Learning, Causal Forests, EconML (PyWhy), sensitivity analysis | conflict-causal-inference |
| 6 | Satellite Imagery for Conflict Damage Assessment | Google Earth Engine, Sentinel-1 SAR, change detection, transfer learning | conflict-satellite-damage |
| 7 | Agent-Based Modeling for Conflict Dynamics | Agent-based simulation, computational modeling | conflict-abm-simulation |
| 8 | Digital Trace Data Collection | ReliefWeb API, GDELT via BigQuery, web scraping, data pipeline design | conflict-data-collection |
All eight projects are complete.
All projects use the same conflict case: the war in the Tigray region of Ethiopia, bounded by the start of hostilities in November 2020 and the Cessation of Hostilities Agreement in November 2022. The UCDP dataset contains 1,764 georeferenced events across state-based conflict, one-sided violence, and non-state conflict. Dominant regions include Tigray, Oromiya, and Amhara.
This case was selected for its analytical clarity: a defined start and end date, multiple conflict actors with distinct operational patterns, and high-quality event data.
- UCDP GED v25.1 — Uppsala Conflict Data Program Georeferenced Event Dataset (Projects 1–5, 7)
- UN Security Council meeting records — Official speeches on Ethiopia/Tigray (Project 2)
- Sentinel-1 SAR imagery via Google Earth Engine (Project 6)
- Google Open Buildings — Building footprint data for Tigray cities (Project 6)
- ReliefWeb API — UN humanitarian reports (Project 8)
- GDELT v2 via Google BigQuery — Global media event data (Project 8)
The projects build on each other in a deliberate sequence:
- Project 1 establishes data handling and spatial analysis skills
- Project 2 adds NLP, LLM-based annotation, and text-as-data methods
- Project 3 adds network thinking and relational analysis of conflict actors
- Project 4 synthesizes data from multiple sources and introduces ML pipelines for prediction
- Project 5 adds causal reasoning on top of ML, addressing the prediction-vs-explanation gap
- Project 6 adds computer vision and remote sensing for conflict damage assessment
- Project 7 introduces simulation-based approaches to model conflict dynamics
- Project 8 demonstrates data engineering: building collection pipelines from APIs, databases, and web sources
- Sundberg, R. and Melander, E. (2013). Introducing the UCDP Georeferenced Event Dataset. Journal of Peace Research, 50(4), 523–532.
- Raleigh, C. et al. (2010). Introducing ACLED: An Armed Conflict Location and Event Dataset. Journal of Peace Research, 47(5), 651–660.
- Tollefsen, A.F. et al. (2012). PRIO-GRID: A Unified Spatial Data Structure. Journal of Peace Research, 49(2), 363–374.
- Gleditsch, K.S. (2020). Advances in Data on Conflict and Dissent. In E. Deutschmann et al. (Eds.), Computational Conflict Research (pp. 23–38). Springer.
- Grimmer, J. and Stewart, B. (2013). Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts. Political Analysis, 21(3), 267–297.
- Maerz, S.F. and Puschmann, C. (2020). Text as Data for Conflict Research: A Literature Survey. In E. Deutschmann et al. (Eds.), Computational Conflict Research (pp. 43–65). Springer.
- Grootendorst, M. (2022). BERTopic: Neural Topic Modeling with a Class-Based TF-IDF Procedure. arXiv:2203.05794.
- Törnberg, P. (2025). How to Use LLMs for Text Analysis. Social Science Computer Review.
- Donnay, K. et al. (2019). Integrating Conflict Event Data. Journal of Conflict Resolution, 63(5), 1258–1282.
- Cederman, L.-E. and Gleditsch, K.S. (2009). Special Issue on Disaggregating Civil War. Journal of Conflict Resolution, 53(4), 487–495.
- Hegre, H. et al. (2019). ViEWS: A Political Violence Early-Warning System. Journal of Peace Research, 56(2), 155–174.
- Cederman, L.-E. and Weidmann, N.B. (2017). Predicting Armed Conflict: Time to Adjust Our Expectations? Science, 355(6324), 474–476.
- Hofman, J.M., Sharma, A., and Watts, D.J. (2017). Prediction and Explanation in Social Systems. Science, 355(6324), 486–488.
- Lundberg, S. and Lee, S.-I. (2017). A Unified Approach to Interpreting Model Predictions. NeurIPS.
- Chernozhukov, V. et al. (2018). Double/Debiased Machine Learning for Treatment and Structural Parameters. The Econometrics Journal, 21(1), C1–C68.
- Athey, S. and Wager, S. (2018). Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests. Journal of the American Statistical Association.
- Wager, S. (2025). Causal Inference: A Statistical Learning Approach. Stanford.
- Koch, T. et al. (2025). Deep Learning for Causal Inference: A Primer. Sociological Methods and Research.
- Law, T. and Roberto, E. (2025). Generative Multimodal Models for Social Science. Sociological Methods and Research.
- Warnke, A. and Runfola, D. (2024). Predicting Protest Locations from Satellite Imagery Using ResNet CNNs.
- Gupta, R. et al. (2019). xBD: A Dataset for Assessing Building Damage from Satellite Imagery. arXiv:1911.09296.
- Jean, N. et al. (2016). Combining Satellite Imagery and Machine Learning to Predict Poverty. Science, 353(6301), 790–794.
- Salvi, A. et al. (2020). On the Beaten Path: Violence Against Civilians and Simulated Conflict Along Road Networks. In E. Deutschmann et al. (Eds.), Computational Conflict Research (pp. 185–200). Springer.
- Cremaschi, S. et al. (2020). Generative Models and Simulation of Non-State Armed Group Attacks. In E. Deutschmann et al. (Eds.), Computational Conflict Research (pp. 163–183). Springer.
- Lazer, D. et al. (2009). Computational Social Science. Science, 323(5915), 721–723.
- Deutschmann, E. et al. (Eds.) (2020). Computational Conflict Research. Springer.
- Hox, J.J. (2017). Computational Social Science Methodology, Anyone? Methodology, 13(Supplement), 3–12.
- Cioffi-Revilla, C. (2014). Introduction to Computational Social Science. Springer.
- Python 3.11 (Anaconda)
- Jupyter Notebooks
- Libraries: pandas, geopandas, folium, scikit-learn, matplotlib, seaborn, BERTopic, sentence-transformers, NetworkX, EconML, earthengine-api
This portfolio is designed to demonstrate quantitative and computational methods for conflict research. Each repository contains Jupyter notebooks with step-by-step analysis, explanatory markdown guides, and reproducible code.