TY - GEN
T1 - A Clicking Strategy Inspired by Inter-Individual Variation for Interactive Image Segmentation
AU - Zhao, Shuofeng
AU - Gu, Chunzhi
AU - Yu, Jun
AU - Zhang, Chao
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/12/16
Y1 - 2023/12/16
N2 - Click-based interactive segmentation is a fundamental task in computer vision that allows user clicks to refine the results. Existing works typically focus on developing powerful segmentation models, yet sparsely treating the clicking method itself. In this study, we propose a novel clicking strategy that specifically aims to reflect the inter-individual variations of different humans during training to improve segmentation results. Our method consists of three steps. In particular, we first apply the erosion operation on the ground-Truth segmentation mask with different parameter settings to generate multiple eroded masks. These eroded masks are then regarded as possible hypotheses of users' interested regions. Then, we randomly select one mask from the hypotheses to simulate an arbitrary users' behavior. By next solving the visual center of the selected mask, the training clicks are eventually obtained via randomly sampling from the visual center region. In essence, our key idea is to cast multiple eroded regions as the potentially diverse users' interests, and include the resulting stochasticity into the model training for better generality. We directly adopt an existing segmentation backbone and incorporate our clicking strategy in the training to show the effectiveness of our method. Experimental results on five datasets generally demonstrate that our method contributes to state-of-The-Art segmentation performance.
AB - Click-based interactive segmentation is a fundamental task in computer vision that allows user clicks to refine the results. Existing works typically focus on developing powerful segmentation models, yet sparsely treating the clicking method itself. In this study, we propose a novel clicking strategy that specifically aims to reflect the inter-individual variations of different humans during training to improve segmentation results. Our method consists of three steps. In particular, we first apply the erosion operation on the ground-Truth segmentation mask with different parameter settings to generate multiple eroded masks. These eroded masks are then regarded as possible hypotheses of users' interested regions. Then, we randomly select one mask from the hypotheses to simulate an arbitrary users' behavior. By next solving the visual center of the selected mask, the training clicks are eventually obtained via randomly sampling from the visual center region. In essence, our key idea is to cast multiple eroded regions as the potentially diverse users' interests, and include the resulting stochasticity into the model training for better generality. We directly adopt an existing segmentation backbone and incorporate our clicking strategy in the training to show the effectiveness of our method. Experimental results on five datasets generally demonstrate that our method contributes to state-of-The-Art segmentation performance.
KW - Click strategies
KW - Human Visual Psychology
KW - Inter-Individual variation
KW - Interactive image segmentation
KW - Segmentation
UR - http://www.scopus.com/inward/record.url?scp=85190947336&partnerID=8YFLogxK
U2 - 10.1145/3639592.3639604
DO - 10.1145/3639592.3639604
M3 - 会議への寄与
AN - SCOPUS:85190947336
T3 - ACM International Conference Proceeding Series
SP - 78
EP - 83
BT - AICCC 2023 - 2023 6th Artificial Intelligence and Cloud Computing Conference
PB - Association for Computing Machinery
T2 - 6th Artificial Intelligence and Cloud Computing Conference, AICCC 2023
Y2 - 16 December 2023 through 18 December 2023
ER -