Alert button

E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

Jun 05, 2022
Figure 1 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Figure 2 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Figure 3 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Figure 4 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

Share this with someone who'll enjoy it:

Unmanned Aerial Vehicles (UAVs) based video text spotting has been extensively used in civil and military domains. UAV's limited battery capacity motivates us to develop an energy-efficient video text spotting solution. In this paper, we first revisit RCNN's crop & resize training strategy and empirically find that it outperforms aligned RoI sampling on a real-world video text dataset captured by UAV. To reduce energy consumption, we further propose a multi-stage image processor that takes videos' redundancy, continuity, and mixed degradation into account. Lastly, the model is pruned and quantized before deployed on Raspberry Pi. Our proposed energy-efficient video text spotting solution, dubbed as E^2VTS, outperforms all previous methods by achieving a competitive tradeoff between energy efficiency and performance. All our codes and pre-trained models are available at https://github.com/wuzhenyusjtu/LPCVC20-VideoTextSpotting.

Share this with someone who'll enjoy it: