Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672
Reference Type | Journal (article/letter/editorial) | ||
---|---|---|---|
Title | IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models | ||
Journal | Pattern Recognition | ||
Authors | Guo, Mingning | Author | |
Wu, Mengwei | Author | ||
Shen, Yuxiang | Author | ||
Li, Haifeng | Author | ||
Tao, Chao | Author | ||
Year | 2025 (October) | Volume | 166 |
Publisher | Elsevier BV | ||
DOI | doi:10.1016/j.patcog.2025.111672Search in ResearchGate | ||
Generate Citation Formats | |||
Mindat Ref. ID | 18310426 | Long-form Identifier | mindat:1:5:18310426:1 |
GUID | 0 | ||
Full Reference | Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672 | ||
Plain Text | Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672 | ||
In | (2025) Pattern Recognition Vol. 166. Elsevier BV |
References Listed
These are the references the publisher has listed as being connected to the article. Please check the article itself for the full list of references which may differ. Not all references are currently linkable within the Digital Library.
Not Yet Imported: IEEE Transactions on Circuits and Systems for Video Technology - journal-article : 10.1109/TCSVT.2024.3370731 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Chen (2022) IEEE Trans. Geosci. Remote Sens. Contrastive learning for fine-grained ship classification in remote sensing images 60, 1 | |
![]() | Gao, Gui, Zhou, Ping, Yao, Libo, Liu, Jia, Zhang, Chuan, Duan, Dingfeng (2023) A Bi-Prototype BDC Metric Network With Lightweight Adaptive Task Attention for Few-Shot Fine-Grained Ship Classification in Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing, 61. 1-16 doi:10.1109/tgrs.2023.3321533 |
![]() | |
Vaswani (2017) Adv. Neural Inf. Process. Syst. Attention is all you need 30 | |
![]() | |
Yang (2024) IEEE Trans. Geosci. Remote Sens. Adaptive mid-level feature attention learning for fine-grained ship classification in optical remote sensing images | |
Xiong (2022) IEEE Trans. Geosci. Remote Sens. An explainable attention network for fine-grained ship classification using remote-sensing images 60, 1 | |
![]() | |
Liu (2024) Adv. Neural Inf. Process. Syst. Visual instruction tuning 36 | |
Li (2024) Adv. Neural Inf. Process. Syst. Llava-med: Training a large language-and-vision assistant for biomedicine in one day 36 | |
Hu (2023) | |
Zhang (2024) IEEE Trans. Geosci. Remote Sens. Earthgpt: A universal multi-modal large language model for multi-sensor image comprehension in remote sensing domain | |
Wei (2022) Adv. Neural Inf. Process. Syst. Chain-of-thought prompting elicits reasoning in large language models 35, 24824 | |
![]() | |
Not Yet Imported: - journal-article : 10.1109/TCSVT.2023.3236636 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: Lecture Notes in Electrical Engineering - book-chapter : 10.1007/978-3-642-12990-2_85 If you would like this item imported into the Digital Library, please contact us quoting Book ID 9783642129896 | |
![]() | Obeso, Abraham Montoya, Benois-Pineau, Jenny, GarcĂa VĂĄzquez, Mireya SaraĂ, Acosta, Alejandro Ălvaro RamĂrez (2022) Visual vs internal attention mechanisms in deep neural networks for image classification and object detection. Pattern Recognition, 123. 108411pp. doi:10.1016/j.patcog.2021.108411 |
Not Yet Imported: - journal-article : 10.3233/JIFS-179071 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: Remote Sensing - journal-article : 10.3390/rs14133087 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing - journal-article : 10.1109/JSTARS.2020.2981686 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Jiang (2024) Delving into multimodal prompting for fine-grained visual classification vol. 38, 2570 | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR52688.2022.01750 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Xu (2024) IEEE Robot. Autom. Lett. Drivegpt4: Interpretable end-to-end autonomous driving via large language model | |
K. Kuckreja, M.S. Danish, M. Naseer, A. Das, S. Khan, F.S. Khan, Geochat: Grounded large vision-language model for remote sensing, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 27831â27840. | |
![]() | |
Shuang (2024) Pattern Recognit. Visual primitives as words: Alignment and interaction for compositional zero-shot learning | |
![]() | |
Not Yet Imported: 2015 IEEE International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV.2015.170 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.00530 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.00515 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1007/s41095-023-0364-2 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Dosovitskiy (2020) | |
![]() | |
Not Yet Imported: IEEE Transactions on Multimedia - journal-article : 10.1109/TMM.2023.3244340 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Radford (2021) Learning transferable visual models from natural language supervision , 8748 | |
Zhu (2023) | |
Lin (2024) | |
He (2024) | |
Dai (2024) Adv. Neural Inf. Process. Syst. Instructblip: Towards general-purpose vision-language models with instruction tuning 36 |
See Also
These are possibly similar items as determined by title/reference text matching only.
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() |