Deployment-Aligned Low-Precision Neural Architecture Search for Spaceborne Edge AI
New deployment-aligned low-precision NAS boosts space AI accuracy by 67% without hardware changes
A team led by Parampuneet Kaur Thind from the University of Toronto and the European Space Agency (ESA) has developed a novel neural architecture search (NAS) framework that explicitly addresses the deployment gap in low-precision edge AI systems. Traditionally, hardware-aware NAS optimizes models under full-precision assumptions and applies low-precision adaptation only after search, which often degrades on-device performance. The new approach, called deployment-aligned low-precision NAS, integrates FP16 numerical constraints directly into the search process, ensuring architectures are optimized from the ground up for low-precision deployment.
The researchers evaluated their method on vessel segmentation for spaceborne maritime monitoring using the Intel Movidius Myriad X VPU. While post-training quantization reduced on-device performance from 0.85 to 0.78 mIoU, their deployment-aligned approach achieved 0.826 mIoU on the same architecture (95,791 parameters). This recovery of approximately two-thirds of the deployment-induced accuracy gap was achieved without increasing model complexity or modifying the search space or evolutionary strategy.
- New deployment-aligned low-precision NAS integrates FP16 constraints during architecture search, fixing the mismatch between optimization and deployment environments
- Tested on Intel Movidius Myriad X VPU for vessel segmentation, the method improved mIoU from 0.78 (post-quantization) to 0.826 while using the same 95,791-parameter model
- Authors: Parampuneet Kaur Thind, Vaibhav Katturu, Giacomo Zema, and Roberto Del Prete (University of Toronto/ESA)
Why It Matters
Enables robust edge AI for space and other resource-constrained environments without expensive retraining or hardware changes