Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672

Library Home Bookshelves View by Type Using Search

Books Catalogs/Sales Lists Journals Reports Thesis/Dissertation

Search for Books Search for Journals Manage Subjects Statistics Books without DDC/LCC Top Unstructured Orphaned Articles

Bookshelves (DDC layout)Bookshelves (LCC layout)Latest Books

Advanced

Search inside 'Pattern Recognition' only

- Only viewable:

Reference Type	Journal (article/letter/editorial)
Title	IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models
Journal	Pattern Recognition
Authors	Guo, Mingning		Author
	Wu, Mengwei		Author
	Shen, Yuxiang		Author
	Li, Haifeng		Author
	Tao, Chao		Author
Year	2025 (October)	Volume	166
Publisher	Elsevier BV
DOI	doi:10.1016/j.patcog.2025.111672Search in ResearchGate
	Generate Citation Formats
Mindat Ref. ID	18310426	Long-form Identifier	mindat:1:5:18310426:1
GUID	0
Full Reference	Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672
Plain Text	Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672
In	(2025) Pattern Recognition Vol. 166. Elsevier BV

References Listed

These are the references the publisher has listed as being connected to the article. Please check the article itself for the full list of references which may differ. Not all references are currently linkable within the Digital Library.

	Not Yet Imported: IEEE Transactions on Circuits and Systems for Video Technology - journal-article : 10.1109/TCSVT.2024.3370731 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Chen (2022) IEEE Trans. Geosci. Remote Sens. Contrastive learning for fine-grained ship classification in remote sensing images 60, 1
	Gao, Gui, Zhou, Ping, Yao, Libo, Liu, Jia, Zhang, Chuan, Duan, Dingfeng (2023) A Bi-Prototype BDC Metric Network With Lightweight Adaptive Task Attention for Few-Shot Fine-Grained Ship Classification in Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing, 61. 1-16 doi:10.1109/tgrs.2023.3321533
	Tajbakhsh, Nima, Suzuki, Kenji (2017) Comparing two classes of end-to-end machine-learning models in lung nodule detection and classification: MTANNs vs. CNNs. Pattern Recognition, 63. 476-486 doi:10.1016/j.patcog.2016.09.029
	Vaswani (2017) Adv. Neural Inf. Process. Syst. Attention is all you need 30
	*Chen, Cheng, Li, Bo (2023) An Interpretable Channelwise Attention Mechanism based on Asymmetric and Skewed Gaussian Distribution. Pattern Recognition, 139. 109467 doi:10.1016/j.patcog.2023.109467*
	Yang (2024) IEEE Trans. Geosci. Remote Sens. Adaptive mid-level feature attention learning for fine-grained ship classification in optical remote sensing images
	Xiong (2022) IEEE Trans. Geosci. Remote Sens. An explainable attention network for fine-grained ship classification using remote-sensing images 60, 1
	Wu, Haiyang, Du, Zhuofei, Zhong, Dandan, Wang, Yuze, Tao, Chao (2025) FSVLM: A Vision-Language Model for Remote Sensing Farmland Segmentation. IEEE Transactions on Geoscience and Remote Sensing, 63. doi:10.1109/tgrs.2025.3532960
	Liu (2024) Adv. Neural Inf. Process. Syst. Visual instruction tuning 36
	Li (2024) Adv. Neural Inf. Process. Syst. Llava-med: Training a large language-and-vision assistant for biomedicine in one day 36
	Hu (2023)
	Zhang (2024) IEEE Trans. Geosci. Remote Sens. Earthgpt: A universal multi-modal large language model for multi-sensor image comprehension in remote sensing domain
	Wei (2022) Adv. Neural Inf. Process. Syst. Chain-of-thought prompting elicits reasoning in large language models 35, 24824
	Li, Xiaoxu, Li, Zhen, Xie, Jiyang, Yang, Xiaochen, Xue, Jing-Hao, Ma, Zhanyu (2024) Self-reconstruction network for fine-grained few-shot classification. Pattern Recognition, 153. 110485 doi:10.1016/j.patcog.2024.110485
	Not Yet Imported: - journal-article : 10.1109/TCSVT.2023.3236636 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: Lecture Notes in Electrical Engineering - book-chapter : 10.1007/978-3-642-12990-2_85 If you would like this item imported into the Digital Library, please contact us quoting Book ID 9783642129896
	Obeso, Abraham Montoya, Benois-Pineau, Jenny, García Vázquez, Mireya Saraí, Acosta, Alejandro Álvaro Ramírez (2022) Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognition, 123. 108411pp. doi:10.1016/j.patcog.2021.108411
	Not Yet Imported: - journal-article : 10.3233/JIFS-179071 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: Remote Sensing - journal-article : 10.3390/rs14133087 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing - journal-article : 10.1109/JSTARS.2020.2981686 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Jiang (2024) Delving into multimodal prompting for fine-grained visual classification vol. 38, 2570
	Not Yet Imported: - proceedings-article : 10.1109/CVPR52688.2022.01750 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Xu (2024) IEEE Robot. Autom. Lett. Drivegpt4: Interpretable end-to-end autonomous driving via large language model
	K. Kuckreja, M.S. Danish, M. Naseer, A. Das, S. Khan, F.S. Khan, Geochat: Grounded large vision-language model for remote sensing, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 27831–27840.
	Hu, Zhongjian, Yang, Peng, Jiang, Yuanshuang, Bai, Zijian (2024) Prompting large language model with context and pre-answer for knowledge-based VQA. Pattern Recognition, 151. 110399 doi:10.1016/j.patcog.2024.110399
	Shuang (2024) Pattern Recognit. Visual primitives as words: Alignment and interaction for compositional zero-shot learning
	*Di, Yanghua, Jiang, Zhiguo, Zhang, Haopeng (2021) A Public Dataset for Fine-Grained Ship Classification in Optical Remote Sensing Images. Remote Sensing, 13 (4). doi:10.3390/rs13040747*
	Not Yet Imported: 2015 IEEE International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV.2015.170 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.00530 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.00515 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1007/s41095-023-0364-2 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Dosovitskiy (2020)
	Zhang, Zi-Chao, Chen, Zhen-Duo, Wang, Yongxin, Luo, Xin, Xu, Xin-Shun (2024) A vision transformer for fine-grained classification by reducing noise and enhancing discriminative information. Pattern Recognition, 145. 109979 doi:10.1016/j.patcog.2023.109979
	Not Yet Imported: IEEE Transactions on Multimedia - journal-article : 10.1109/TMM.2023.3244340 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Radford (2021) Learning transferable visual models from natural language supervision , 8748
	Zhu (2023)
	Lin (2024)
	He (2024)
	Dai (2024) Adv. Neural Inf. Process. Syst. Instructblip: Towards general-purpose vision-language models with instruction tuning 36

	Ke, Xiao; Cai, Yuhang; Chen, Baitao; Liu, Hao; Guo, Wenzhong (2025) Multi-granularity interaction and feature recombination network for fine-grained visual classification. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111632
	*Gao, Yansheng; Zhu, Zixi; Wang, Shengsheng (2026) Mixture of coarse and fine-grained prompt tuning for vision-language model. Pattern Recognition, 170. doi:10.1016/j.patcog.2025.112074*
	Li, Dong, Jin, Jiandong, Zhang, Yuhao, Zhong, Yanlin, Wu, Yaoyang, Chen, Lan, Wang, Xiao, Luo, Bin (2025) Semantic-aware frame-event fusion based pattern recognition via large vision–language models. Pattern Recognition, 158. doi:10.1016/j.patcog.2024.111080
	Yin, Junhui, Zhang, Xinyu, Wu, Lin, Wang, Xiaojie (2025) Context-aware prompt learning for test-time vision recognition with frozen vision-language model. Pattern Recognition, 162. doi:10.1016/j.patcog.2025.111359
	Zhang, Zi-Chao, Chen, Zhen-Duo, Wang, Yongxin, Luo, Xin, Xu, Xin-Shun (2024) A vision transformer for fine-grained classification by reducing noise and enhancing discriminative information. Pattern Recognition, 145. 109979 doi:10.1016/j.patcog.2023.109979
	Shi, Yanli, Hong, Qihua, Yan, Yong, Li, Jing (2025) LDH-ViT: Fine-grained visual classification through local concealment and feature selection. Pattern Recognition, 161. doi:10.1016/j.patcog.2024.111224
	*Li, Yuting, Chen, Dexiong, Tang, Tinglong, Shen, Xi (2025) HTR-VT: Handwritten text recognition with vision transformer. Pattern Recognition, 158. doi:10.1016/j.patcog.2024.110967*
	Xu, Wan, Huang, Tianyu, Qu, Tianyuan, Yang, Guanglei, Guo, Yiwen, Zuo, Wangmeng (2025) FILP-3D: Enhancing 3D few-shot class-incremental learning with pre-trained vision-language models. Pattern Recognition, 165. doi:10.1016/j.patcog.2025.111558
	Zhang, Guoqing; Kan, Shichao; Shi, Lu; Xu, Wanru; An, Gaoyun; Cen, Yigang (2025) Cross-scene visual context parsing with large vision-language model. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111641
	Deng, Cheng, Liu, Xianglong, Li, Chao, Tao, Dacheng (2018) Active multi-kernel domain adaptation for hyperspectral image classification. Pattern Recognition, 77. 306-315 doi:10.1016/j.patcog.2017.10.007
	*Yin, Yueming, Yang, Zhen, Hu, Haifeng, Wu, Xiaofu (2022) Universal multi-Source domain adaptation for image classification. Pattern Recognition, 121. 108238pp. doi:10.1016/j.patcog.2021.108238*

Mindat.org is an outreach project of the Hudson Institute of Mineralogy, a 501(c)(3) not-for-profit organization.
Copyright © mindat.org and the Hudson Institute of Mineralogy 1993-2025, except where stated. Most political location boundaries are © OpenStreetMap contributors. Mindat.org relies on the contributions of thousands of members and supporters. Founded in 2000 by Jolyon Ralph.
To cite: Ralph, J., Von Bargen, D., Martynov, P., Zhang, J., Que, X., Prabhu, A., Morrison, S. M., Li, W., Chen, W., & Ma, X. (2025). Mindat.org: The open access mineralogy database to accelerate data-intensive geoscience research. American Mineralogist, 110(6), 833–844. doi:10.2138/am-2024-9486.
Privacy Policy - Terms & Conditions - Contact Us / DMCA issues - Report a bug/vulnerability Current server date and time: September 20, 2025 14:47:29

Go to top of page

Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672

References Listed

See Also