As a PhD student in the Pacific Northwest, I’ve seen, up close, the tension caused by balancing timber extraction and conservation of forest species. Back in 1994, when strong evidence showed the decline of spotted owl populations, the administration established a task force to develop The Northwest Forest Plan to balance the competing needs of timber harvest and the conservation of old growth forests. This was crucial because at that point, much of the Pacific Northwest's old growth forest was already logged, leaving very few patches for species that are entirely dependent on it for their lifecycle.
A critical component of that plan was monitoring these threatened and endangered species—a labor intensive process that involved field biologists driving out into the woods at night, playing audio recordings of spotted owl calls, and listening for an owl calling back. In that initial phase, biologists tracked demographics like population survival rates, growth rates, and how many young were being hatched every year. This program is now approaching 30 years of monitoring such populations.
In 2017, we began exploring the use of an emerging technology—acoustic recorders—that allowed us to start collecting acoustic data at scale in a non-invasive way. Every day the field recorders would collect four hours at dawn and four hours at dusk of sound. The problem was that you end up with this immense pile of audio data that is completely infeasible for someone to listen to from start to end. You also have the needle in the haystack problem where you have a really rare species that calls infrequently, and its recordings are somewhere in this mess of audio data.
Initially we focused on developing computational tools to detect spotted owls and other low-frequency vocalizing species. In 2020, Google developed a large-scale avian classification model called Perch. Google’s open-source model has been trained on a huge, global data set and has learned a lot about differentiating different types of bird sound.