Thursday, November 24, 2022

Can AI track reforestation projects using drone and satellite data?

Satellite and drone data was used to monitor tree coverage of Justdiggit re-greening projects in Tanzania and Kenya to measure the efficiency of carbon capture! To help fight climate change by improving carbon capture efficiency, three teams of AI engineers came together to build and implement machine learning models.

Introduction

Can AI help save trees and forests? The FruitPunch AI community teamed up with Justdiggit to track the progress of re-greening projects in Tanzania and Kenya. To help fight climate change by improving carbon capture efficiency, three teams of AI engineers came together to build and implement machine learning models on drone and satellite data. Their goal was to estimate the tree count and tree cover in project areas on the African continent. For the 1 hour final presentation, have a look here!

Let’s preserve some trees!

What data was at our disposal?

The team set out to create a machine learning model for automated tree detection and segmentation on drone and satellite imagery to aid JustDiggit and the Free University of Amsterdam in counting individual trees. For this, we made use of both satellite and drone data.

Drone data included both RGB and digital surface model data (DSM). From the DSM data we created height maps that could be used to identify individual trees. Drone images are perfect for this job but can be expensive and difficult to gather frequently. Satellite data allows scaling up the analysis, but has a resolution that is roughly 100 times lower than drone imagery.

The dataset that the team worked with existed of 41 large TIFF images. After extracting patches of 256x256 pixels we came up with a set of 7,595 train, 1,085 validation, and 2,170 test images with corresponding binary masks.

To fulfill the tasks in this Challenge, three teams were formed with different tasks:

Data Enrichment - Working closely together with our annotation partner Cfru.it to label more trees, do quality insurance, and build the pre-processing pipeline.
Drone Modelling - Build a tree detection model to count individual trees and estimate biomass.
Satellite Conversion - Augment the drone data in such a way that models could be trained that also worked for automatic tree detection on satellite images.

Figure 1: Tasks within the team and collaborations among the teams.

The drone subteam proposed two different methods:

Transfer learn the DeepForest model

A U-net approach with respectively RGB-only, height-only and both data types as input.

The satellite subteam also proposed two methods:

Unsupervised methods (K-means) on satellite data

Converting Drone Annotations to SHP and TIF using GDAL

Data Enrichment

At the start of the Challenge the team soon came to the conclusion that there weren’t sufficient labels to train accurate models with. In order to create a dataset large enough to reach the goal we partnered with labelling company Cfru.it. With their help, the Data team filled in the missing annotations and ensured that all the labels were accurate.

Drone Team

Transfer Learning with DeepForest

Figure 2: Results concerning DeepForest model.

The DeepForest approach is based on transfer learning from the RetinaNet model. The only pre-processing step required here was to transform the .tif files and the annotations files (.geojson) to match the requirements of the DeepForest package.

DeepForest provides good documentation and tutorials on how to improve the predictions. Moreover, the package contains a function to evaluate the model on new data. The evaluation metrics used are box-precision and box-recall. By default, the two metrics are computed by considering true positives the predictions that have an intersection-over-union (IoU) score of at least 40% with a label. The F1 score was adopted as a way to combine the two metrics in a single metric that matters to make training decisions.

Figure 3: The blue boxes are the predictions from the model while the orange boxes are the ground-truth. When the two boxes overlap they almost look green.

DeepForest recommends tuning the patch size in order to get better predictions. The patch size is the dimension given in pixels in which to split a bigger image before performing predictions:

Figure 4: Evolution of metrics with the different patch sizes.

The team decided to further fine-tune on new data both models with a 600 patch size and 1200 patch size and observe which one produces better results. A patch size of 600 is the last one that produces an improvement on both the recall and the precision at the same time, for bigger patch sizes the recall decreases. The 1200 patch size produces the best F1 score, but reaches a lower recall, perhaps indicating its inability to detect smaller trees. The fact that bigger patch sizes than the default ones work better is not a surprise since the Deepforest model was trained on images with a lower resolution. However, the F1 score does not reach satisfactory levels, hence further training on new data is necessary to obtain a better model.

The loss is influenced by the patch size hence it is best to compare it only among models with a similar patch size. Overall, a learning rate of 0.001 leads to the best training runs. The batch size hasn’t got any influence on the outcome of the training. More epochs do not cause overfitting and might slightly improve the results but most of the gains happen in the first 5 epochs.

2 models were trained for comparison, one with a patch size of 600 and the other with a patch size of 1200. To compare the two models we need to use the F1 score. However, the F1 score varies together with the score threshold, which is the minimum probability that the model must assign to a certain predicted box in order to consider it a valid prediction.

The default score threshold in the Deepforest model is 0.1, however, you can modify it. We tried different thresholds between 0.1 and 0.9 distanced by an interval of 0.1 on both models. Even though the differences with the best 1200 model remain minimal, the best F1 score is obtained by the best 600-patch-size model with a score threshold of 0.3. Then this is the model of choice! It obtains an F1 score of 54%

The table below illustrates the performances of the optimal DeepForest model on the validation and on the test set:

‍

Table 1: Metrics for the DeepForest model

Object-segmentation with U-net

The U-net is a type of neural network architecture developed for image segmentation purposes at the University of Freiburg. The main idea is to perform the classic set of convolutions coupled with an activation function and max-pooling to achieve dimensionality reduction of the features (referred to as the contracting pathway), and then perform a symmetric sequence of inverse operations.

There were essentially three paths in the U-net approach to be explored:

Model taking RGB images initialized with pre-trained ResNet on ImageNet data
Input model with Digital Surface Model (DSM) data
Input model with both RGB and DSM inputs

Figure 7: Predicted masks of the three models: DSM, RGB and DSM+RGB:

Figure 8: More results concerning the RGB method.

The RGB-only model’s prediction is the most accurate, whereas the RGB+DSM model is more conservative with labeling the pixels as representing trees.

Figure 10: Performance of the three U-net models on the test set

Despite the expectation that the RGB+DSM model would perform much better than the other models, it achieved similar results to the RGB-only model.

The RGB-only model weights were initialized with the values from a pre-trained ResNet34 model, which constituted a much better starting point than randomly initialized weights of the RGB+DSM model.

Satellite Team

K-Means Unsupervised Classification

A K-means clustering method was applied on satellite images on the pixel scale. The team used 8 clusters which allowed us to neatly detect different types of terrain and vegetation. Figure 11 shows the original (left, using only the red, green and Near InfraRed bands) and binary images (right) when merged to two sub-clusters. The results are visually comparable to human labelling.

Figure 11: Left - Original satellite image (R-G -NIR bands); Right - K-means clustering on pixel level for the satellite image

Degrading Drone images into Satellite images

As stated before, drone images can become very expensive rather quickly. To reduce costs and increase land coverage the next logical step is to verge into satellite imagery. Satellite images cover the entire earth every day and are relatively cheap compared to drones.

The only downside is that the quality of images is drastically reduced. Image quality ranges from 10-20 meters per pixel for free services (Sentinel, Landsat) to 50cm per pixel for paid services. For this Challenge, participants had the privilege to work with data from one of our partners, Planet. They deliver high-quality images and were able to provide daily coverage of our area of interest!

Registration using keypoint detection

The data from the drone and satellite did not align automatically due to different coordinate reference systems used for drones and satellites.

Using automatic keypoint detection based on SIFT features on a resampled drone image (Figure X) and satellite image, we can match the key points and then distort the satellite image to align the two images. Figure X shows that labelling from drone images is well transferred to satellite images through this registration technique.

Figure 13: (Left) Labels on Drone data (resampled to match satellite resolution). (Middle) Labels on satellite data (shift between the resampled drone and satellite images causing some trees to end up on the roofs). (Right) Labels on calibrated satellite data (obtained by geo-referencing the image).

Registration using Gaussian Beam

According to a research article for sheep counting, drone images can be aligned with satellite images with two steps:

“Bicubic downsampling to reach the target ground sample distance” (0.05 => 0.5m);
“Blurring using a point spread function kernel calculated to match the simulated satellite aperture”.

Obtaining the point spread function for our satellite image is rather difficult because it is of Cassegrain type. The team chose to start with a simple gaussian blurring (which is a way to approach the PSF of a circular telescope).

(L) Initial drone image (M) After Gaussian Blur (R) After resampling (3000px => 300px)

Figure 15: Comparison of Original Satellite Image (Left) and Processed Drone Image (Right).

We can see that the processed drone image is still very different from the satellite image. However, it is unsure that the satellite image is from the same season (it’s visible by zooming that some trees have no leaf on the drone image) and there might be additional work to adjust colors and contrast to improve the matching of those images. Besides, satellite images have a lot of postprocessing like pansharpening for instance that is hard to reproduce on the drone image.

There is no quantitative evaluation of this work yet, all the evaluations were done qualitatively. Registration using keypoint alignment looks more promising from human eyes. Unsupervised clustering performs surprisingly well for satellite image segmentation.

Which models and data are the best to fight deforestation?

The DeepForest model is a great tool to do some simple transfer-learning. The package provides a good framework to pre-process and post-process the several tiles.

Transfer learning seems to work better with a learning rate of 0.001 and less than 10 epochs are enough for good fine-tuning. The effect of different patch sizes might differ before and after the extra training but has got only a limited impact after some training. Changing the score threshold might also help to improve the model.

Overall, the results are encouraging with an F1 score of 58% on the test set and a MAPE of 32%.

Future research could address some shortcomings (like a struggle on dry/burnt trees or on big trees) by using google cloud data for the training of the model. Future researchers could also create two separate models one for bigger trees and one for smaller trees. At the same time, operators could maintain the drone at the same height while collecting footage or data scientists could automatically adjust the patch size based on the height at which the drone collected the images.

The U-net models do not seem to be the right fit for the available data. However, there are a few ideas on how to improve these results in the future, like hyperparameter tuning and incorporating the NEON data into the training dataset.

There is room for improvement both for what regards the DeepForest model and for the U-net ones. But the biggest incremental improvements over this work could be reached by data enrichment and not by focusing on improving the existing models. For instance, the normalised difference vegetation index could be used as input in one of the models. Researchers could use the NEON dataset for a pre-train of any model besides the DeepForest one (which was already pre-trained on this dataset). The pre-training on the NEON dataset should then be followed by the second phase of training /fine-tuning our own data.

For satellite images, automatic keypoint detection seems to produce the most promising visual results. Based on the current progress, the next step is to apply segmentation models given the registered drone and satellite pairs and the labels from drone images.

As often, most of the work seems to be in obtaining data of the proper quality. Here the goal was to find a way to use labels from a model trained on a specific image source (drone) on data from a different source. It was interesting to try out different methods to bridge the gap between the two sources. There is definitely a lot more work to do on this specific task and hopefully, FruitPunch has provided Justdiggit with an idea of what to do next!

The AI for Trees Challenge was a unique opportunity to learn more about object segmentation, clustering and computer vision techniques, besides learning how to work with drone and satellite data and the differences between these two. It was a pleasure to work with such a diverse and dedicated team!

Authors: Sara Nóbrega, Weiwei Zong

Participants AI for Trees Challenge: Alexandra Smith, Lee Dudek, Melanie Arp, Tim Broadhurst, Natalia Skaczkowska-Drabczyk, Minaraj Sai, John Nshimyumukiza, Michele Sergio Pozzi, John Lister, Luis Blanche, Sri Aravind, Weiwei Zong

Subscribe to our newsletter

Be the first to know when a new AI for Good challenge is launched. Keep up do date with the latest AI for Good news.

Previous publication

You are reading the most recent publication!

Return to publications page

Next publication

You have reached the end of our publications, you must love our content!

Return to publications page

April 12, 2024

Looking inland to clean up oil spills with AI

Oil spills are a significant threat to marine ecosystems, wildlife, and public health. Ships sinking, boats crashing, or leaking tankers pollute our rivers and ports, causing ecological damage. In our latest collaboration with Rijkswaterstaat (The Dutch Ministry of Infrastructure and Waterways), we aimed to use Artificial Intelligence to improve the response times of their Oil Clean-up teams. By working with drone-captured RGB images, we aimed to develop an advanced oil-volume estimation tool that would enhance the efficiency and accuracy of oil spill assessments. This tool would also facilitate prompt and effective response measures to minimize the damage caused by oil spills

February 21, 2024

Saving Marine Ecosystems with Artificial Intelligence

Discover how the AI for Coral Reefs Challenge is revolutionizing marine conservation. With a blend of supervised and unsupervised AI approaches, this initiative aims to enhance coral reef monitoring, offering hope for the preservation of marine ecosystems through advanced technology.

February 1, 2024

Tracking Turtles: How AI helps conservationists to re-identify sea turtles

Discover how AI revolutionizes sea turtle conservation, enabling accurate re-identification with non-intrusive methods. This collaboration with Sea Turtle Conservation Bonaire utilizes cutting-edge AI to match sea turtle photos against a database, enhancing our understanding of their migratory patterns and aiding in their preservation. The solution employs innovative techniques for turtle face detection, feature extraction, and matching, proving pivotal for habitat preservation efforts.

January 31, 2024

AI and Visualisations: A Data Driven All-Rounded Approach for Road Safety

Exploring the intersection of AI and road safety, highlighting a collaborative project aimed at leveraging data insights to enhance road safety measures. We delve into the use of AI-based Advanced Driver Assistance Systems and the analysis of various alert types to develop solutions like the Vehicle Risk Score and Hotspot Identification tools.

January 29, 2024

Understanding AI Models: A Comprehensive Guide

Explore artificial intelligence models and choose the right one with our comprehensive guide. Enhance your AI journey.

January 29, 2024

AI Challenge: Test Your Skills with Real-World Problems

Take on real-world challenges and boost your AI skills with our AI Challenge. Test, learn, and excel.

January 29, 2024

Your Path to an Artificial Intelligence Degree Made Easy

Learn about different ways to earn your AI degree. Explore online options and level up your AI career with convenience.

January 29, 2024

AI Bootcamp: Fast-Track Your AI Career

Accelerate your AI career with intensive AI Bootcamp training. Fast-track your expertise in AI and Machine Learning.

January 29, 2024

Machine Learning Projects: The Path to Becoming an AI Pro

Master AI through exciting Machine Learning projects. Your path to becoming an AI pro starts here!

January 16, 2024

AI Training For Businesses

Artificial Intelligence (AI) is no longer just a futuristic idea. It has become a game-changer in today's world. If you've been keeping up with business trends, you would know this. AI is important for both startups and established companies to stay competitive and innovative by building ai training solutions & upskilling teams in ai.

January 16, 2024

Unlocking AI Skills: Your Guide to AI Bootcamps

AI is a powerful force that drives innovation in various industries, not just a trendy term in the tech world. If you're a skilled developer or curious about AI, you've probably heard of AI bootcamps for learning and enhancing abilities. In this article, we'll explain AI bootcamps, why they're important, and how to choose the right one for your learning journey.

January 16, 2024

AI Learning Solutions: Boosting Team Competency

AI is changing the game for businesses in different industries in our fast-paced, tech-driven world. It's no longer a question of if AI will shape the future but how prepared we are to embrace it. In this article, I will explain the benefits of AI learning. It can improve team skills, encourage innovation, and help your organisation succeed in the AI era.

January 16, 2024

Essentials of AI Upskilling for Companies

The AI Revolution: Why Companies Can't Ignore Utilising AI Technologies and need to close the ai skills gap by upskilling teams.

November 20, 2023

From Pixels to Preservation: How AI Gives Rise to a Birdwatching Revolution

At the interface of ecology and the ever-increasing applications to use artificial intelligence for good, FruitPunch AI and the Swedish University of Agricultural Sciences (SLU) joined together to better understand how our ecosystems function. Over decades, passionate ornithologists have been taking thousands of images of birds across the world. In the AI for Eagles Challenge, we exploited this dataset to build a machine learning pipeline that would allow rapid assessment of a bird’s behavior, species, and age. Traditional manual labeling of such images is a time-consuming process, and the need for speed in assessing individual birds and population states underscores the urgency of the task. The successful implementation of AI has the potential to revolutionize the efficiency of bird image analysis. By automating the labeling process, the application can significantly accelerate the assessment of individual birds and contribute valuable insights into the broader state of bird populations.

October 18, 2023

Leveraging Large Language Models to make businesses around the world more sustainable!

With rising global temperatures, the world is facing more and more natural disasters in the form of extreme drought and subsequent fires, as well as extreme rainfall and flooding. Amidst this climate crisis, it is the responsibility of every single organization to implement sustainable business practices. One company that aids organizations in becoming a more sustainable version of themselves is Metabolic.

August 25, 2023

Flying High with AI: Counting Pelican Breeding Pairs in the Danube Delta

Imagine flying in a small airplane over the vast wetlands of the Danube Delta on the shores of the Black Sea in Romania looking for patches of small white dots: great white pelicans (Pelecanus onocrotalus). While flying over the colonies researchers like Sebastian Bugariu from the Romanian Ornithological Society (ROS) take photos which will be used to count the number of breeding birds when back in the office. The number of breeding pairs has grown from ~5,000 pairs 15 years ago to recently ~18,000 pairs. Keeping good records of the breeding numbers is important but not an easy task. Back at the office, it can take weeks to go through the images and manually count the pelicans. Wouldn’t it be great if this process could be automated, freeing up time that could be dedicated to other important conservation? This is where the AI for Pelican challenge started.

August 10, 2023

Solving automated wildlife taxonomy with AI

In the wild landscapes of Europe, camera traps have become essential tools for ecologists and wildlife researchers, sharing glimpses into the lives of the continent's diverse and fascinating animals. These devices have revolutionized our understanding of European wildlife, but their use comes with a challenge - the sheer volume of data they generate. To tackle this issue, a team of dedicated FruitPunchers recently developed an AI solution for the European Wildlife Challenge.

July 27, 2023

Listening to the Giants: Protecting Forest Elephants Through Audio Monitoring

In the dense rainforests of Central Africa, a captivating endeavor is underway, driven by researchers from Cornell University. Their mission? To track and protect elephants through the power of audio monitoring. These majestic creatures emit resounding rumbles at an incredibly low frequency, nearly imperceptible to the human ear. These deep calls, traversing vast distances, serve as a concealed communication network for elephants, veiled from our understanding for centuries.

March 23, 2023

Understanding seals with AI

On 6th December 2022, the FruitPunchAI community joined forces with researchers from Colgate University to build a facial recognition model that enables the non-invasive study and monitoring of harbor seals and other marine mammals. The team’s goal was to build upon the existing SealNet model developed by Colgate University to improve its accuracy, generalizability across sea species and mammals, and increase development velocity across data processing and model development workflows.

February 10, 2023

How we detect oil spills on open sea and support response teams

“Deepwater Horizon Oil Spill” might ring a bell for most of you. But did you know that there are numerous oil spills caused by small accidents and deliberate discharges, which do not make the headlines? Did you also know that these smaller incidents actually contribute to the bulk of oil pollution, and are just as, if not more threatening than a single major oil spill?

January 12, 2023

Inside Out: Crafting Corporate AI Training Programs

Ever wondered what it takes to transform your corporate playground into an AI training haven? Well, buckle up because we're about to dive deep into the world of crafting internal AI training programs. We’re excited to share the secrets to building trust and knowledge within your corporate community.

January 11, 2023

Unleashing AI Potential: Hands-On Challenges for Developers

Hey there fellow tech enthusiasts! 👋 I'm Andrew, an AI developer on a mission to share the incredible journey I've had in unleashing the full potential of AI through hands-on challenges. Picture this: a vibrant community of developers, of all ages and levels of experience, coming together to learn, grow, and build trust in a world where AI is evolving faster than ever. Let me kick things off by sharing a bit about my personal journey. Back in the day, I found myself drowning in theory-heavy AI courses that left me wondering, "How on earth do I apply this in the real world?" Fast forward to today, and I've cracked the code. How? Through hands-on challenges that not only bridge the gap between theory and application but also create a sense of camaraderie among developers.

January 11, 2023

The Ultimate Guide to Team-based AI Skill Development

Today, I'm excited to share with you the ultimate guide to team-based AI skill development, a game-changer for curious developers like yourself. Let's dive in by acknowledging the dynamic nature of AI development. The tech landscape is shifting faster than ever, and staying relevant requires continuous skill development. Now, let me share a bit of my story—a story of grappling with challenges and discovering the power of collective learning.

January 11, 2023

Building a Future-Ready Development Team with AI Education

Let's dive into the realm of AI education and how it's not just about personal growth but building future-ready development teams.

January 11, 2023

Mastering AI: Customised Learning Paths for Corporates

Ever felt like you're in an AI maze within the corporate jungle? Well, you're not alone. We’ve been there, done that, and found the treasure map to mastering AI in corporate settings. Today, let's embark on a journey to explore the magic of customised learning paths.

November 24, 2022

Can AI track reforestation projects using drone and satellite data?

November 22, 2022

The Most Important AI for Good Trends of 2022 + Some Dangers

The rise of open research communities usher in a power distribution in AI research and accessibility, AI safety is slowly being taken seriously and AI assists fundamental breakthroughs in various natural sciences! This article describes the most important AI for Good trends of 2022 and what they mean to you.

October 31, 2022

‘I just took the plunge and left my pharmacy job.’: Aisha Kala on how she became a self-taught data scientist

AI for Good engineer Aisha Kala talks about her journey of switching to a career in tech and learning statistics, coding and computer science online. A love letter to data and people in AI communities trying to make the world a better place.

October 14, 2022

How to use vehicle sensors to make cities more sustainable

A case study on making cities greener by vegetation monitoring and detecting traffic density; differentiating between heavy vehicles, buses and private transport.

October 3, 2022

Can AI detect the risk of heart failure from ECGs?

Electrocardiogram data was subjected to a sweeping array of machine learning and deep learning models. Is it as good a predictor of heart failure risk as blood tests?

September 21, 2022

The pains of classifying flooded forests in satellite data

About a tricky detection use case - from weeks of data pre-processing to training 2 CNNs; and why the answer might be in infrared band data.

September 14, 2022

AI-based early warning system for river floods

Forecasting flash floods with LSTM, ARIMA and Prophet using time series data from hydrological sensors monitoring French rivers.

September 6, 2022

How we applied AI to prevent sepsis in preterm babies

A case study on using XGBoost for time series forecasting to predict the onset of sepsis in preterm infants within a 12-hour prediction horizon.

August 29, 2022

AI-powered Wildlife Conservation in Africa

An account of 10-week teamwork developing multiple machine learning and hardware pipelines to bring production-ready AI to edge hardware on flying rangers.

August 28, 2022

Model Optimization and Pruning of Poacher-detecting YOLOv5

Optimizing a YOLOv5 model for NVIDIA Jetson Nano to increase the inference speed and reduce memory footprint, focusing on inference speed not the absolute mAP.

August 27, 2022

User-friendly, Wilderness-proof MLOps

Developing a CI/CD pipeline that automatically retrains the existing poacher-detecting model, which can be deployed to the drone with improved performance.