TY - GEN
T1 - Augment with Teacher and Distill with Student
T2 - 12th International Conference on Computing and Pattern Recognition, ICCPR 2023
AU - Zhang, Chao
AU - Hotta, Katsuya
AU - Ueshima, Takuma
AU - Yu, Jun
AU - Gu, Chunzhi
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/10/27
Y1 - 2023/10/27
N2 - Human segmentation using point clouds requires clustering of points belonging to the same human body part. In the supervised learning scenario, previous studies can segment the human body parts to some extent. However, segmentation easily fails for complex postures, especially for the parts with a wide range of motion (e.g., parts from the hand to the upper arm). To alleviate this problem, first, the Random Vertex Displacement (RVD) filter is applied to an existing human body point clouds dataset to augment the training data. Specifically, the RVD filter creates a sphere with a given radius centered on each point that constitutes the human point cloud. The point is randomly shifted within the sphere for augmentation. The model trained with the RVD augmented data is treated as the teacher network. Second, we train a student network from scratch to generate the same intermediate representation to mimic the teacher network. In the experiment, the teacher network improves the average IoU by around 2%, and to our surprise, the student network further outperforms the teacher by another 2%, which well validates the effectiveness of the proposed two-stage scheme for human segmentation.
AB - Human segmentation using point clouds requires clustering of points belonging to the same human body part. In the supervised learning scenario, previous studies can segment the human body parts to some extent. However, segmentation easily fails for complex postures, especially for the parts with a wide range of motion (e.g., parts from the hand to the upper arm). To alleviate this problem, first, the Random Vertex Displacement (RVD) filter is applied to an existing human body point clouds dataset to augment the training data. Specifically, the RVD filter creates a sphere with a given radius centered on each point that constitutes the human point cloud. The point is randomly shifted within the sphere for augmentation. The model trained with the RVD augmented data is treated as the teacher network. Second, we train a student network from scratch to generate the same intermediate representation to mimic the teacher network. In the experiment, the teacher network improves the average IoU by around 2%, and to our surprise, the student network further outperforms the teacher by another 2%, which well validates the effectiveness of the proposed two-stage scheme for human segmentation.
KW - 3D Human Segmentation
KW - Point Cloud
KW - Teacher-Student Network
UR - http://www.scopus.com/inward/record.url?scp=85187554094&partnerID=8YFLogxK
U2 - 10.1145/3633637.3633696
DO - 10.1145/3633637.3633696
M3 - 会議への寄与
AN - SCOPUS:85187554094
T3 - ACM International Conference Proceeding Series
SP - 375
EP - 379
BT - ICCPR 2023 - Proceedings of the 2023 12th International Conference on Computing and Pattern Recognition
PB - Association for Computing Machinery
Y2 - 27 October 2023 through 29 October 2023
ER -