From Vision to Motion: Translating Large-Scale Knowledge for Data-Scarce IMU Applications



Abstract


Pre-training representations acquired via selfsupervised learning could achieve high accuracy on even tasks with small training data. Unlike in vision and natural language processing domains, pre-training for IMU-based applications is challenging, as there are few public datasets with sufficient size and diversity to learn generalizable representations. To overcome this problem, we propose IMG2IMU that adapts pre-trained representation from large-scale images to diverse IMU sensing tasks. We convert the sensor data into visually interpretable spectrograms for the model to utilize the knowledge gained from vision. We further present a sensor-aware pre-training method for images that enables models to acquire particularly impactful knowledge for IMU sensing applications. This involves using contrastive learning on our augmentation set customized for the properties of sensor data. Our evaluation with four different IMU sensing tasks shows that IMG2IMU outperforms the baselines pre-trained on sensor data by an average of 9.6%p F1-score, illustrating that vision knowledge can be usefully incorporated into IMU sensing applications where only limited training data is available.


Publications


From Vision to Motion: Translating Large-Scale Knowledge for Data-Scarce IMU Applications
Hyungjun Yoon, Hyeongheon Cha, Hoang C. Nguyen, Taesik Gong, Sung-Ju Lee
IEEE Transactions on Mobile Computing.
PDF



People


Hyungjun Yoon

KAIST

Hyeongheon Cha

KAIST

Hoang C. Nguyen

KAIST

Taesik Gong

UNIST

Sung-Ju Lee

KAIST