Is There a Ceiling for Gains in Machine-Learned Arbitrage?
An analysis of stock returns tests the limits of A.I.
Is There a Ceiling for Gains in Machine-Learned Arbitrage?Chanelle Nibbelink
In times of crisis—ahead of a big storm, in the wake of a natural disaster, during a pandemic—consumers often resort to panic buying. When the rise in COVID-19 cases led to lockdowns in early 2020, online retailers experiencing a surge in demand began to place much larger orders for a range of products, which contributed to a massive supply-chain shock around the globe.
Artificial-intelligence inventory managers might have helped prevent that outcome. Alibaba Group’s Jiaxi Liu, Shuyi Lin, and Yidong Zhang and Chicago Booth’s Linwei Xin find that A.I. can outperform human buyers in times of deep disruption—in part by avoiding the human tendency to overreact to risk.
Online retailers all over the world, including Amazon, the global leader, dealt with massive growth and inventory management challenges at the height of the pandemic. For their study, the researchers focused on Alibaba, China’s largest e-commerce platform by several measures and one of the largest, fastest-growing online retailers in the world. A.I. is used across the industry for inventory management, suggesting that algorithms can be applied to any online retailer.
Alibaba has enjoyed massive success with business-to-business, business-to-consumer, and consumer-to-consumer platforms selling products from Chinese and international manufacturers. The company reported total revenue of $134 billion for the fiscal year ending in March.
Alibaba’s Tmall Mart, a business-to-consumer digital mall with more than 500 million monthly users and a range of inventory from more than 70,000 brands, is an example of what management is up against. Intense competition from other online platforms means product demands that cannot be fulfilled immediately are usually lost to the business, the researchers say, so inventory management is key to survival. Tmall’s model includes the operation of regional distribution centers that serve as central warehouses in dozens of major Chinese cities. Suppliers deliver inventory to these centers for distribution to customers.
Human buyers who purchase and replenish goods for the business to sell may not be enough to handle inventory management alone anymore, which is where A.I. comes in. From August 2020 through November 2021, the researchers performed a competition at Tmall between human buyers and algorithmic buyers—an extension of a test they initially launched in April 2020 at the Shanghai location of Alibaba’s supermarket chain, Freshippo.
The researchers started by using a single-agent reinforcement learning (SARL) algorithm, which allows for the use of feature information such as page views, trends on social media, and other data to generate an inventory replenishment model. They then moved on to a multi-agent reinforcement learning (MARL) algorithm, which more fully captures the interplay between retailer and supplier.
“For example, how would the supplier respond if the retailer places a large order during a lockdown?” Xin says. “The [MARL] model provides an estimation of lead time and fill rate—as in, lead time would increase by 30 percent and fill rate would drop by 20 percent if the order volume is twice the normal.”
In all cases, the algorithms were not simply suggesting inventory replenishment strategies that humans could override. They were making final decisions.
Which came out ahead, the humans or the algorithms? The answer: both.
Algorithms, the researchers find, are optimal for managing slow-moving inventory—products that are important but may not have constant sales, as opposed to fast-moving items that sell every day. As Xin puts it, toilet paper is a fast-moving item, while a toilet brush is slow moving.
“It’s important to keep slow-moving items to offer customers more options,” Xin says. But such items contribute less to daily gross sales, and demand forecasting is time consuming. Managing this inventory with algorithms can free up human buyers to focus on the top-selling items.
Before the implementation of algorithmic product replenishment, Xin says, one human Tmall buyer managed up to thousands of products. “It was a tedious process and easy to make mistakes, not to mention many of these products rarely had sales but still took a nontrivial amount of a human’s time,” he says. “Now [the number of products managed] is reduced by 70 percent, and all slow-moving products are managed by algorithms.”
Indeed, more than half of all Tmall products were being replenished by algorithms by the time the study ended in November 2021. The financial impact: for goods with $10 million in daily gross merchandise value, the use of SARL algorithms could improve the company’s bottom line by $19 million annually, while MARL ones could boost it by $31 million, the researchers argue.
What’s the key to algorithms outperforming in a crisis?
Typically, when large orders are placed, inventory levels rise and the out-of-stock product rate decreases. However, in this case, the products managed by humans had both higher orders and higher sellouts. The main reason for this, Xin says, is that lead time also increases for large orders during major disruptions. When all retailers need inventory, those who want more have to wait longer. This results in more product sellouts. Algorithms are better able to catch the fluctuation in lead times and fill rates during major disruptions and thus eliminate the human tendency to panic buy.
None of this means algorithms will be taking away all human buyers’ jobs anytime soon. Human communication with suppliers is still critical. For example, suppliers may share inside information that can’t be captured by an algorithmic model, and suppliers may find it “weird” if there is no person on the other side of a transaction, Xin says. Additionally, products that require frequent promotion and price changes can’t be managed by algorithm alone because this requires coordination with other business areas, such as marketing.
Jiaxi Liu, Shuyi Lin, Linwei Xin, and Yidong Zhang, “A.I. vs. Human Buyers: A Study of Alibaba’s Inventory Replenishment System,” Working paper, September 2022.
An analysis of stock returns tests the limits of A.I.
Is There a Ceiling for Gains in Machine-Learned Arbitrage?An expert panel discusses how A.I. is reshaping entrepreneurship, and how founders and investors are responding.
Is A.I. Startup Funding a Rerun of the Dot-Com Bubble?The most influential rating system rests on some faulty calculations, affecting millions of people and billions of dollars.
Hospital Ratings Are Deeply Flawed. Can They Be Fixed?Your Privacy
We want to demonstrate our commitment to your privacy. Please review Chicago Booth's privacy notice, which provides information explaining how and why we collect particular information when you visit our website.