I am a Senior Software Engineer at RSE (Research Software Engineering) Team in University of Newcastle. I am leading a team of four RSEs and we work on building scalable geospatial data pipelines and tools to support research in public health, urban planning, and environmental science in Imago (Imagery Smart Data Service) project, part of the Smart Data Research UK programme funded by the ESRC. Our work involves developing reproducible workflows for processing large geospatial datasets in high-performance computer (Comet), integrating diverse data sources, and creating user-friendly interface (Data Catalogue) for data exploration and analysis. We collaborate closely with researchers to understand their needs and deliver tailored solutions that facilitate cutting-edge research.
Before joing Newcastle University, I worked as GIS Data Scientist on MAGENTA (Maternal And preGnancy hEalth aNd elevaTed heArt) project at University of Swansea, funded by the Wellcome Trust. I leaded the WP1 (Environmental exposure modelling for Wales and London) , and developed heat-exposure model at 1x1km resolution and managed environmental data pipelines for the UK. My work involved integrating satellite data, weather station data, and land-use information to create high-resolution exposure maps. I collaborated with epidemiologists to analyze the impact of heat exposure on maternal and pregnancy health outcomes, contributing to several publications in high-impact journals.
I completed my PhD in Environmental Epidemiology at Swiss TPH (Swiss Tropical and Public Health Institute), affiliated with the University of Basel, where I worked on the PoCHAS (Effects of Airborne Pollen on Cardiorespiratory Health and Allergic Symptoms) project. My work focused on integrating machine learning with environmental data to develop the first spatio-temporal model for predicting pollen concentrations at a 1 × 1 km scale across Switzerland.
I also have industrial experience working as a Geospatial Data Scientist at CollectiveCrunch oy., where I developed geospatial data processing pipelines and tools for remote sensing applications. My work involved processing large satellite datasets to create national level cloudless mosaics, developing algorithms for forest damage detection. I also worked in Regio OÜ as a GIS specialist, where I developed an automated process to generate internet connections to houses using GIS data, improving the efficiency of the planning process for network expansion projects.
I have a strong background in geospatial data science, with expertise in remote sensing, GIS, and spatial statistics. I am proficient in programming languages such as Python and R, and have experience working with big data technologies such as Hadoop and Spark. I am passionate about using data to solve real-world problems and am always looking for new challenges and opportunities to learn and grow. I design and operate scalable cloud infrastructure on Azure and AWS, including VMs/EC2, Blob storage / S3, managed databases, Kubernetes (AKS/EKS), and serverless services (Azure Functions / Lambda). I build reproducible geospatial data workflows (Prefect/Airflow), containers and CI/CD and integrate these with on‑prem HPC to enable reliable, scalable processing and delivery of research outputs.