NoiseTolerant-SBIR

所属分类:人工智能/神经网络/深度学习
开发工具:Python
文件大小:6894KB
下载次数:0
上传日期:2022-05-30 09:00:28
上 传 者sh-1993
说明:  NoiseTolerant SBIR,[CVPR 2022]“无忧素描:基于噪声容忍素描的图像检索”,IEEE计算机视觉会议...
([CVPR 2022] "Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.)

文件列表:
FGSBIR_Baseline (0, 2022-05-30)
FGSBIR_Baseline\Networks.py (3508, 2022-05-30)
FGSBIR_Baseline\__pycache__ (0, 2022-05-30)
FGSBIR_Baseline\__pycache__\Networks.cpython-36.pyc (3200, 2022-05-30)
FGSBIR_Baseline\__pycache__\cvs_dataset.cpython-36.pyc (3768, 2022-05-30)
FGSBIR_Baseline\__pycache__\cvs_train_dataset.cpython-36.pyc (3811, 2022-05-30)
FGSBIR_Baseline\__pycache__\dataset.cpython-36.pyc (3765, 2022-05-30)
FGSBIR_Baseline\__pycache__\evaluation.cpython-36.pyc (3907, 2022-05-30)
FGSBIR_Baseline\__pycache__\model.cpython-36.pyc (2819, 2022-05-30)
FGSBIR_Baseline\__pycache__\rasterize.cpython-36.pyc (4828, 2022-05-30)
FGSBIR_Baseline\__pycache__\utils.cpython-36.pyc (1546, 2022-05-30)
FGSBIR_Baseline\condor_ev.py (2125, 2022-05-30)
FGSBIR_Baseline\condor_run.py (2229, 2022-05-30)
FGSBIR_Baseline\cvs_dataset.py (5392, 2022-05-30)
FGSBIR_Baseline\cvs_main.py (2696, 2022-05-30)
FGSBIR_Baseline\cvs_top_n_wrong_strokes.py (6503, 2022-05-30)
FGSBIR_Baseline\cvs_train_dataset.py (5407, 2022-05-30)
FGSBIR_Baseline\cvs_train_main.py (2774, 2022-05-30)
FGSBIR_Baseline\cvs_wrong_strokes.py (6217, 2022-05-30)
FGSBIR_Baseline\dataset.py (5424, 2022-05-30)
FGSBIR_Baseline\docker_stderror (0, 2022-05-30)
FGSBIR_Baseline\ev_1.py (6058, 2022-05-30)
FGSBIR_Baseline\ev_10.py (6060, 2022-05-30)
FGSBIR_Baseline\ev_5.py (6059, 2022-05-30)
FGSBIR_Baseline\ev_best.py (6065, 2022-05-30)
FGSBIR_Baseline\ev_best.txt (13952, 2022-05-30)
FGSBIR_Baseline\ev_new.py (6053, 2022-05-30)
FGSBIR_Baseline\evaluation.py (5905, 2022-05-30)
FGSBIR_Baseline\main.py (2526, 2022-05-30)
FGSBIR_Baseline\main_ev.py (1817, 2022-05-30)
FGSBIR_Baseline\model.py (3202, 2022-05-30)
FGSBIR_Baseline\rasterize.py (4631, 2022-05-30)
FGSBIR_Baseline\utils.py (2130, 2022-05-30)
FGSBIR_Baseline\wrong_stroke_ayan.py (7480, 2022-05-30)
FGSBIR_Baseline\wrong_strokes.py (7660, 2022-05-30)
FGSBIR_Baseline\wrong_strokes_start_from_half.py (5968, 2022-05-30)
SketchStrokeSubset_CVPR22.pdf (6700087, 2022-05-30)
Subset_PPO_ActorCritic (0, 2022-05-30)
... ...

# Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval, CVPR 2022 **Ayan Kumar Bhunia**, Subhadeep Koley, Abdullah Faiz Ur Rahman Khilji , Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song, “Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval”, IEEE Conf. on Computer Vision and Pattern Recognition (**CVPR**), 2022. ## SketchX_ShoeV2/ChairV2 Dataset: [Download](https://drive.google.com/file/d/1frltfiEd9ymnODZFHYrbg741kfys1rq1/view?usp=sharing) ## Abstract Sketching enables many exciting applications, notably, image retrieval. The fear-to-sketch problem (i.e., "I can't sketch") has however proven to be fatal for its widespread adoption. This paper tackles this "fear" head on, and for the first time, proposes an auxiliary module for existing retrieval models that predominantly lets the users sketch without having to worry. We first conducted a pilot study that revealed the secret lies in the existence of noisy strokes, but not so much of the "I can't sketch". We consequently design a stroke subset selector that detects noisy strokes, leaving only those which make a positive contribution towards successful retrieval. Our Reinforcement Learning based formulation quantifies the importance of each stroke present in a given subset, based on the extend to which that stroke contributes to retrieval. When combined with pre-trained retrieval models as a pre-processing module, we achieve a significant gain of 8%-10% over standard baselines and in turn report new state-of-the-art performance. Last but not least, we demonstrate the selector once trained, can also be used in a plug-and-play manner to empower various sketch applications in ways that were not previously possible. ## Outline ![Outline](./images/fig1.JPG) **Figure:** (a) While the average ranking percentile increases as the sketching proceeds from starting towards completion, unwanted sudden drops have been noticed for many individual sketches due to noisy/irrelevant strokes drawn. (b) The same thing is visualised with number of samples in the third axis to get an overall statistics on QMUL-Shoe-V2 dataset. ## Architecture ![Framework](./images/fig2.JPG) **Figure:** Illustration of Noise Tolerant FG-SBIR framework. Stroke Subset Selector X(·) acts as a pre-processing module in the sketch vector space to eliminate the noisy strokes. Selected stroke subset is then rasterized and fed through an existing pre-trained FG-SBIR model for reward calculation, which is optimised by Proximal Policy Optimisation. For brevity, actor-only version is shown here. ## Examples ![Framework](./images/fig3.JPG) **Figure:** Examples showing selected subset performing better (rank in box) than complete sketch from ShoeV2. ![Framework](./images/fig4.JPG) **Figure:** Examples showing ability to perform (rank in box) under synthetic noisy sketch input on ShoeV2. ## Citation If you find this article useful in your research, please consider citing: ``` @InProceedings{strokesubset, author = {Ayan Kumar Bhunia and Subhadeep Koley and Abdullah Faiz Ur Rahman Khilji and Aneeshan Sain and Pinaki Nath Chowdhury and Tao Xiang and Yi-Zhe Song}, title = {Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval}, booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2022} } ``` ## Work done at [SketchX Lab](http://sketchx.ai/), CVSSP, University of Surrey. ## Please check our other papers appearing in CVPR-2022 ``` * Sketch-Based Few-shot Incremental Learning * Partial SBIR * Sketch3T: Test-Time Training for Zero-Shot SBIR ```

近期下载者

相关文件


收藏者