3rd Physical Retail AI Workshop (PRAW)
Foundation Models and GenAI
Technologies for Physical Retail







March 7 2026 WACV





Overview and Topics


In an evolving world in which consumers have a plurality of shopping methods available to them, physical “brick and mortar” stores continue to be the preferred means of shopping around the world. From groceries to clothing, customers continue to show strong demand for in person shopping.

Applications of vision-based Artificial Intelligence (AI) methods are increasingly present throughout society. Fueled by recent advances in Computer Vision, Deep Learning, web-scale training of vision and language models (“foundation models”), and edge compute, AI applications have expanded into a novel array of industries and products. In particular, the physical retail and grocery sectors have recently experienced an explosion of AI-enabled technologies, allowing for more efficient, effortless, and engaging experiences for shoppers, enabling the reduction of shrinkage for retailers, and providing insights on improving store efficiency, thereby reducing operational costs. Computer Vision applications are being deployed to numerous retail sectors, including small convenience stores, large grocery stores, fashion stores, and shopping carts, to name but a few.

The focus of this workshop includes active areas of research and development in the physical retail space, including:

  • Multi-modal modeling for shopping activity recognition, product detection/tracking/identification, and product quantity estimation
  • Applications of Generative AI and Large Language Models (LLM) to Physical Retail applications
  • Zero-Shot learning methods and approaches for activity recognition and understanding, appearance-based classification, and object detection
  • Zero-shot data labeling with foundation models
  • Appearance-based classification from a large gallery of product classes in the wild and open-set recognition (OSR)
  • Store analytics and shopping cart localization within a store environment
  • Generative models and systems for synthetic data generation of images and videos paired with ground truth labels

Important Dates


Workshop Paper Track Date
Paper Submission Site: OpenReview
Call For Paper Release 11/07/2025
Paper Submission Deadline 12/19/2025
12/26/2025
Notification of Paper Acceptance 12/26/2026
01/02/2026
Camera-Ready Paper Deadline (to be included in the proceeding) 01/09/2026
Workshop Date 03/07/2026 AM
Workshop Challenges Track Date
Challenge Participation: Kaggle
Call For Challenge Submission Release 11/07/2025
Challenge Training Data Release 12/12/2025
Challenge Submission Deadline 01/16/2026
Challenge Results Release and Winner Notification 01/30/2026

Tentative Workshop Schedule


Time Talk Name Title/Institution
08:50-09:00AM Opening Remarks Shun Miao Principal Applied Scientist @Amazon
09:00-09:30AM
Invited Talk:
Essences and Accidents in Video-Based Human Behavior Classification
Calvin Breseman Machine Learning Researcher & Staff Data Scientist @Standard AI
09:30-10:00AM
Invited Talk:
TBD
Chris Broaddus Sr. Applied Science Manager @Amazon
10:00-10:15AM COFFEE BREAK
10:15-10:25AM
Paper Presentation:
Zero-Shot Product Attribute Labeling with Vision-Language Models: A Three-Tier Evaluation Framework
Shubham Shukla
Kunal Sonalkar
Nordstrom
10:25-10:35AM
Paper Presentation:
CHURI: Contour-Gated, Resource-Intelligent Retail Tracking System
Riya Waghmare University of Southampton
10:35-10:45AM
Paper Presentation:
From Pixels to Purchase: Building and Evaluating a Taxonomy-Decoupled Visual Search Engine for Home Goods E-commerce
Cheng Lyu
Jingyue Zhang
Ryan Maunu
Mengwei Li
Vinny DeGenova
Yuanli Pei
Wayfair
10:45-10:55AM
Paper Presentation:
Carts Tell the Story: A Cart-Centric Temporal Modeling Framework for Privacy-Preserving Retail Behavior Understanding
Mahule Roy
Subhas Roy
National Institute of Technology Karnataka
11:00-11:30AM
Invited Talk:
TBD
Dr. Yosi Keller Principal Applied Scientist @Amazon; Professor @Bar-Ilan University
11:30-11:40AM
Challenge winner presentation 1:
Pose-aware Spatio-temporal Event Localization of Interactions in Multi-view Retail Videos
Winner team
11:40-11:50AM
Challenge winner Presentation 2:
Double Vision: Two Views, One Solution for Efficient Stereo Action Recognition
Second place team
11:50-12:00PM Closing Remarks Shun Miao Principal Applied Scientist @Amazon


Organizers


Dr. Quanfu Fan
(Amazon)

Dr. Shun Miao
(Amazon)

Dr. Weijian Li
(Amazon)

Dr. Sean Ma
(Amazon)

Dr. David Woollard
(Standard AI)

Dr. Bruno Abbate
(Standard AI)

Dr. Davide Mazzini
(Standard AI)

Dr. Rocco Pietrini
(Mercatorum University)