An ambitious project that deploys big data and uses machine learning to understand the ecological impacts of hydropower dams in the Amazon Basin started in a mundane enough setting: on the sidelines at youth baseball games.
Conversations initially sparked when they were parents at local games ultimately led Alex Flecker, professor of ecology and evolutionary biology in the College of Agriculture and Life Sciences, and Carla Gomes, professor of computer science and director of the Institute for Computational Sustainability, to a collaborative effort to solve problems facing one of the most biodiverse areas of the world.
Using a seed grant from the Atkinson Center for a Sustainable Future and additional support from the National Science Foundation’s Expeditions in Computing initiative, the researchers are collaborating across disciplines – from hands-in-the-dirt ecology to the computing power of networks – crunching big data related to Amazonian rivers and waterways as they evaluate the cumulative economic and environmental impacts of dams.
Your areas of expertise seem, on the surface, to be far apart. How did this collaboration take shape?
Flecker: It all comes back to baseball. Ten years ago, we both had kids who played baseball together. We would sit on the sidelines and talk about how it would be great to do something together. I’m an ecologist, and a lot of the work that I do involves tropical rivers in South America. For the past six years, I’ve been working on a project exploring vulnerabilities of species to climate change, and spending a lot of time in the Andes and Amazon regions, especially Ecuador.
It’s been striking to see how many hydropower sites are in different stages of development. You look on a map, and there are literally hundreds of dots with proposed sites for dams in the Amazon region. But these are also places in which there is very, very little on-the-ground data available. So what we started thinking, with all of these dots on the map, [was] some of these have to be better locations than others. The question becomes, in the absence of a lot of information, how do you enter that conversation? And that’s where Carla has been pivotal.
Gomes: I’m a computer scientist, and I’m passionate about using the advancement of computer science to really impact the world. The opportunity to work with a real ecologist is just fascinating to me. Computer science is permeating all kinds of disciplines and is transforming the way our society functions, including in areas of business, on Wall Street. So why not use computational thinking to solve challenges in sustainability? With the Institute for Computational Sustainability, we are trying to have impact and think of nature in the same way major companies are using computer-science tools to optimize their own enterprises. We want to do the same thing for natural resources.
Dealing with something as big and complex as the Amazon, where do you begin?
Flecker: Dams are usually planned project by project in terms of local impact rather than in some integrated fashion. If you plan dam by dam, you’re always going to get suboptimal solutions. So that’s what we started thinking about. My work centers on what are called ecosystems services. These are the benefits that ecosystems provide that people really care about. In the Amazon, biodiversity and river fisheries are hugely important in people’s lives. So understanding connectivity – the length of free-flowing rivers – is important for things like fish migrations.
Another ecosystem service is navigable rivers. There are places higher up in the Andes with some of the best and most important whitewater rafting rivers. Locally, they become really important to recreation and the economy. Other aspects, like nutrient cycling and sediment flows, are incredibly important for the way that the rivers structure floodplains. How and where dams are constructed are huge questions that impact ecosystem services, but the Amazon is a region where not much data exists, if at all.
Gomes: Our role has been to think about how to articulate the problem we want to solve and then model it. For example, start with a very basic tradeoff: having dams producing electricity is a positive, but that is going to negatively impact connectivity along the river. But how do we understand that problem in the absence of a lot of specific information? Can we use big data, for example from remote sensing, to infer valuable information for the Amazon Basin?
Flecker: We want to see if we can at least estimate some of these things within these different projects. To do this right is computationally complex. Carla and her group are thinking about sustainability issues that require a lot of computing power. A lot of people, when you talk about collaborating, they say “oh, that’s too complicated; that’s going to be really hard.” What’s great about Carla is that, yes it’s complicated, and that’s why it’s exciting. It’s a really different mindset.
Gomes: Computer science has developed tools and models to design big computer networks, to design the grid. When you look at river connectivity, we leverage those advances to reflect the reality of the Amazon. We found an example of a set of dams that disrupts major connectivity, but their placement does not contribute much globally in terms of electricity generation. That’s what makes this so exciting: We are trying to reason at a global scale.
What have you learned from each other?
Gomes: As a computer scientist, I know our limitations. We know very little about ecology, hydrology, and so it’s key that we team up with the right experts. I only team up on a project if I have access to the highest level of expertise in the field. I want us to really connect with the real world in a realistic way, informed by experts. Alex has been incredible sharing his own expertise and bringing together all kinds of expertise. It’s really awesome.
Flecker: One thing you learn from Carla is, don’t be afraid of thinking of problems that, in our group, would have been impossible to work through due to the huge amount of raw data. With Carla, we can combine vast quantities of unprocessed data from many different sources, including global data collection systems, that are valuable for addressing complex sustainability problems. It’s been incredibly inspiring.
Gomes: We basically reinforce each other. I’m in awe of how Alex gets hydrologists, sediment specialists, all of these experts on board. We keep pushing each other. We are both very positive.
Flecker: We are having fun. You want to do things that are important but you also want to have fun while you’re doing them. This collaboration has allowed us to forge research relationships across Cornell, and even outside of it. The challenge is too great to be done by just two people. We have a group of four to five faculty, grad students and postdocs, and every Friday afternoon we hold conversations as a group that go for hours.
Gomes: We have ways to approach these problems on a large scale. When we first started, we began thinking about only one region, and then I said, ‘Why not go to the entire Amazon?’
Flecker: That’s the perfect example. To me, focusing on one region of the Amazon seemed computationally difficult enough. Yet, when they did their first analysis, they ran it in a tenth of a second.
Gomes: “A little bit of knowledge can be dangerous.” That’s a saying, right? I’m Portuguese, and for a non-native speaker, getting the idioms right is the toughest. Often people know these problems are computationally hard, so they say, “We don’t know what to do. These problems are too hard.” So often, researchers in these fields only use basic heuristics or indicators to prioritize options one by one, without a global perspective. In contrast, we can optimize the entire problem.
Our approach to global optimization and modeling of environmental challenges is similar to the way companies such as Amazon use computational models to obtain near-perfect efficiency of the full supply chain. It’s quite remarkable, with a little computational thinking and computational power, how we can now find the optimal solution in about 10 minutes. With Alex, I said, going to the entire Amazon Basin, I think we can do it. That’s what excites me: the scale. I say let’s go big. We are going big; I love that (laughs).
Alex, how has Carla’s approach changed what you do?
Flecker: It has changed, in dramatic fashion, the whole scale in which we work. It opens us up to think much differently in spatial scales, and temporal scales as well. Exposure to computational sustainability has revealed for me, dramatically, the power of machine learning and the optimization possibilities. The complexity of the problem, what we consider to be the bottlenecks, are not on the computational side. The challenge is finding the data we can interface with.
Gomes: In the U.S., we have a ton of detailed environmental data, sophisticated maps, et cetera, to use. For the Amazon, our data is far less detailed. We therefore need to be creative.
We are using computational methods – in particular, machine learning – to infer information from remote sensing. For example, Landsat is a satellite that has been orbiting Earth for over 40 years. The challenge is how to transform the vast amounts of remote sensing data from Landsat and other sources into meaningful information concerning ecosystem services such as river connectivity, nutrient and sediment flows, and fish habitat, to incorporate into our decision-making models. Our overall goal is to understand how different configurations of hydropower dams in the Amazon impact ecosystem services.
These problems are fascinating and truly challenging. Moreover, by studying these real-world problems, we are also advancing computer science. The sustainability challenges force us to develop new computational methods that can then also be applied to other domains. … Computational sustainability is a wonderful domain for computer scientists to have a significant positive impact in the world.
Flecker: The reason for doing this is to have an impact. For me, personally, there are incredibly special places that are really remote in the Amazon, and you realize that there are proposals to put dams there. Ultimately, those are very complex societal decisions. We want to know, aren’t there better places where you can generate the same amount of power? We think we can contribute and say: These are tradeoffs, and there are other, better places to do this, and here’s where they are. This is a long-term process, but we know we can have an impact.
Matt Hayes is managing editor and social media officer for the College of Agriculture and Life Sciences.