New Machine-Learning Model Offers Simple Solution to Predicting Crop Yield

Sam Fernandes, left, assistant professor of agricultural statistics and quantitative genetics, and Igor Fernandes, statistics and analytics master's student, have worked to improve crop yield prediction using environmental and genetic data.
Paden Johnson/U of A System Division of Agriculture

Sam Fernandes, left, assistant professor of agricultural statistics and quantitative genetics, and Igor Fernandes, statistics and analytics master's student, have worked to improve crop yield prediction using environmental and genetic data.

A new machine-learning model for predicting crop yield using environmental data and genetic information can be used to develop new, higher-performing crop varieties.

Igor Fernandes, a statistics and analytics master's student at the U of A, entered agriculture studies with a data science background and some exposure to agronomy as an undergraduate assistant for Embrapa, the Brazilian Agricultural Research Corporation. With an outsider's perspective and a history working with environmental data through one of his former advisers, he developed a novel approach to forecasting how crop varieties will perform in the field.

His interest in the subject led to a recently published study co-authored with his adviser, Sam Fernandes, an assistant professor of agricultural statistics and quantitative genetics with the Arkansas Agricultural Experiment Station, the research arm of the U of A System Division of Agriculture.

The study, published in the Theoretical and Applied Genetics journal, is titled "Using machine learning to combine genetic and environmental data for maize grain yield predictions across multi-environment trials."

"Igor came in from statistics with no genetics background," Sam Fernandes said. "So, he had this idea that was not at all what we would use in genetics, and it was just surprising that it worked well."

Igor Fernandes' model, which focused on environmental data, led him to a close second in this year's international Genome to Fields competition. Co-authors of the study that stemmed from the competition entry included Caio Vieira, an assistant professor of soybean breeding for the experiment station, and Kaio Dias, assistant professor in the Department of General Biology at the Federal University of Viçosa in Brazil.

Environment and genetics

While the competition entry showed environmental data alone worked better than expected at predicting crop yield, the researchers saw an opportunity to build a comprehensive study that compared the novel approach to established prediction models used in genomic breeding.

Genomic breeding, a process of screening thousands of candidates for field trials based on DNA alone, can save time and resources needed to develop a new plant variety, such as growing better in drought conditions. An important part of genomic breeding involves genomic prediction to estimate a plant's yield using its DNA.

"Let's say you have thousands of candidates, and you get the DNA from all of them," Sam Fernandes explains. "Based on the DNA along with information from previous field trials, you are able to tell which one will be the highest yielding without planting it in the field. So, you're saving resources that way. This is genomic prediction."

Adding information into a model on how that plant would interact with environmental conditions increases the accuracy of the genomic prediction and is becoming more common as more environmental data from testing centers becomes available. The practice is called "enviromics." Still, there is no consensus on the best machine-learning approach to combine environmental and genetic data.

"One advantage of including the environment information in the models is that you can address what we call genotype-by-environmental interaction," Sam Fernandes said. "Since the environment does not affect all of the individuals in the same way, we try to account for all of that, so we are able to select the best individual. And the best individual can be different depending on the place and season."

The study used the same data on corn plots from the Genomes to Fields Initiative that were used in the competition, but the researchers adjusted inputs as genetic, environmental or a combination of both in "additive" and "multiplicative" manners. When including environmental and genetic data in a more straightforward "additive" manner, the prediction accuracy was better than the more complicated "multiplicative" manner. 

The simpler model took less time for the computer to process, and the mean prediction accuracy improved 7 percent over the established model. The experiment was validated in three scenarios typically encountered in plant breeding.

"One of the unique things that Igor did is how he processed the environmental data," Sam Fernandes said. "There are fancier models that people can throw in all sorts of information. But what Igor did is a simple, yet efficient way of combining the genetic and environmental data using feature engineering to process the information and get a summary of variables that is more informative."

Collectively, the researchers say the results are promising, especially with the increasing interest in combining environmental features and genetic data for prediction purposes. Their immediate goal is to apply it to increase the capability of screening genotypes for field trials.

To learn more about the Division of Agriculture research, visit the Arkansas Agricultural Experiment Station website.Follow us on X at @ArkAgResearch, subscribe to the Food, Farms and Forests podcast and sign up for our monthly newsletter, the Arkansas Agricultural Research Report. To learn more about the Division of Agriculture, visit uada.edu. Follow us on X at @AgInArk. To learn about extension programs in Arkansas, contact your local Cooperative Extension Service agent or visit uaex.uada.edu.


About the Division of Agriculture: The University of Arkansas System Division of Agriculture's mission is to strengthen agriculture, communities, and families by connecting trusted research to the adoption of best practices. Through the Agricultural Experiment Station and the Cooperative Extension Service, the Division of Agriculture conducts research and extension work within the nation's historic land grant education system. The Division of Agriculture is one of 20 entities within the University of Arkansas System. It has offices in all 75 counties in Arkansas and faculty on five system campuses. The University of Arkansas System Division of Agriculture offers all its Extension and Research programs and services without regard to race, color, sex, gender identity, sexual orientation, national origin, religion, age, disability, marital or veteran status, genetic information, or any other legally protected status, and is an Affirmative Action/Equal Opportunity Employer.

Headlines

Honors College Lecture to Demystify Cancer and Chronic Disease Research

In his upcoming public lecture, professor Tim Muldoon will take a closer look at cancer and chronic disease research and their influence on healthcare. 

Ag Business Alumna Maloch Promoted to USDA Senior Staff Position

Victoria Maloch, an honors graduate from the U of A and Bumpers College, has been promoted by the U.S. Department of Agriculture to a senior staff position in Washington, D.C.

Risk Taker: U of A Alumna Shares Startup Journey

Intrigued by the startup scene percolating in Northwest Arkansas, Bhavya Patel uprooted her life and switched career paths in 2019 when she moved from Little Rock to Fayetteville.

Editor of Nation's Premier Public Health Journal to Speak on History of Public Health

Dr. Alfredo Morabia, editor for The American Journal of Public Health, will speak on "The Public Health Approach: Population Thinking from the Black Death to COVID-19" at 6 p.m. Oct. 17 in Giffels Auditorium.

Student Success Event: Virtual Reality at UARK Sept. 24

Presentations about the use of virtual reality at the U of A will be given from 11:30 a.m. to 1 p.m. Wednesday, Sept. 24, in the Cordia Harington Center for Excellence room 349.

News Daily