Vote for your favorite mineral in #MinCup25! - Carpathite vs. Tugtupite
Battle of mineralogical oddities as organic mineral #carpathite battles against vs #tugtupite, a mineral that looks like reindeer blood!
Log InRegister
Quick Links : The Mindat ManualThe Rock H. Currier Digital LibraryMindat Newsletter [Free Download]
Home PageAbout MindatThe Mindat ManualHistory of MindatCopyright StatusWho We AreContact UsAdvertise on Mindat
Donate to MindatCorporate SponsorshipSponsor a PageSponsored PagesMindat AdvertisersAdvertise on Mindat
Learning CenterWhat is a mineral?The most common minerals on earthInformation for EducatorsMindat ArticlesThe ElementsThe Rock H. Currier Digital LibraryGeologic Time
Minerals by PropertiesMinerals by ChemistryAdvanced Locality SearchRandom MineralRandom LocalitySearch by minIDLocalities Near MeSearch ArticlesSearch GlossaryMore Search Options
Search For:
Mineral Name:
Locality Name:
Keyword(s):
 
The Mindat ManualAdd a New PhotoRate PhotosLocality Edit ReportCoordinate Completion ReportAdd Glossary Item
Mining CompaniesStatisticsUsersMineral MuseumsClubs & OrganizationsMineral Shows & EventsThe Mindat DirectoryDevice SettingsThe Mineral Quiz
Photo SearchPhoto GalleriesSearch by ColorNew Photos TodayNew Photos YesterdayMembers' Photo GalleriesPast Photo of the Day GalleryPhotography

Guo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672

Advanced
   -   Only viewable:
Reference TypeJournal (article/letter/editorial)
TitleIFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models
JournalPattern Recognition
AuthorsGuo, MingningAuthor
Wu, MengweiAuthor
Shen, YuxiangAuthor
Li, HaifengAuthor
Tao, ChaoAuthor
Year2025 (October)Volume166
PublisherElsevier BV
DOIdoi:10.1016/j.patcog.2025.111672Search in ResearchGate
Generate Citation Formats
Mindat Ref. ID18310426Long-form Identifiermindat:1:5:18310426:1
GUID0
Full ReferenceGuo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672
Plain TextGuo, Mingning; Wu, Mengwei; Shen, Yuxiang; Li, Haifeng; Tao, Chao (2025) IFShip: Interpretable fine-grained ship classification with domain knowledge-enhanced vision-language models. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111672
In(2025) Pattern Recognition Vol. 166. Elsevier BV

References Listed

These are the references the publisher has listed as being connected to the article. Please check the article itself for the full list of references which may differ. Not all references are currently linkable within the Digital Library.

Not Yet Imported: IEEE Transactions on Circuits and Systems for Video Technology - journal-article : 10.1109/TCSVT.2024.3370731

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Chen (2022) IEEE Trans. Geosci. Remote Sens. Contrastive learning for fine-grained ship classification in remote sensing images 60, 1
Vaswani (2017) Adv. Neural Inf. Process. Syst. Attention is all you need 30
Yang (2024) IEEE Trans. Geosci. Remote Sens. Adaptive mid-level feature attention learning for fine-grained ship classification in optical remote sensing images
Xiong (2022) IEEE Trans. Geosci. Remote Sens. An explainable attention network for fine-grained ship classification using remote-sensing images 60, 1
Liu (2024) Adv. Neural Inf. Process. Syst. Visual instruction tuning 36
Li (2024) Adv. Neural Inf. Process. Syst. Llava-med: Training a large language-and-vision assistant for biomedicine in one day 36
Hu (2023)
Zhang (2024) IEEE Trans. Geosci. Remote Sens. Earthgpt: A universal multi-modal large language model for multi-sensor image comprehension in remote sensing domain
Wei (2022) Adv. Neural Inf. Process. Syst. Chain-of-thought prompting elicits reasoning in large language models 35, 24824
Not Yet Imported: - journal-article : 10.1109/TCSVT.2023.3236636

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Not Yet Imported: Lecture Notes in Electrical Engineering - book-chapter : 10.1007/978-3-642-12990-2_85

If you would like this item imported into the Digital Library, please contact us quoting Book ID 9783642129896
Not Yet Imported: - journal-article : 10.3233/JIFS-179071

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Not Yet Imported: Remote Sensing - journal-article : 10.3390/rs14133087

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Not Yet Imported: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing - journal-article : 10.1109/JSTARS.2020.2981686

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Jiang (2024) Delving into multimodal prompting for fine-grained visual classification vol. 38, 2570
Not Yet Imported: - proceedings-article : 10.1109/CVPR52688.2022.01750

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Xu (2024) IEEE Robot. Autom. Lett. Drivegpt4: Interpretable end-to-end autonomous driving via large language model
K. Kuckreja, M.S. Danish, M. Naseer, A. Das, S. Khan, F.S. Khan, Geochat: Grounded large vision-language model for remote sensing, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 27831–27840.
Shuang (2024) Pattern Recognit. Visual primitives as words: Alignment and interaction for compositional zero-shot learning
Not Yet Imported: 2015 IEEE International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV.2015.170

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.00530

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.00515

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Not Yet Imported: - journal-article : 10.1007/s41095-023-0364-2

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Dosovitskiy (2020)
Not Yet Imported: IEEE Transactions on Multimedia - journal-article : 10.1109/TMM.2023.3244340

If you would like this item imported into the Digital Library, please contact us quoting Journal ID
Radford (2021) Learning transferable visual models from natural language supervision , 8748
Zhu (2023)
Lin (2024)
He (2024)
Dai (2024) Adv. Neural Inf. Process. Syst. Instructblip: Towards general-purpose vision-language models with instruction tuning 36


See Also

These are possibly similar items as determined by title/reference text matching only.

 
and/or  
Mindat.org is an outreach project of the Hudson Institute of Mineralogy, a 501(c)(3) not-for-profit organization.
Copyright © mindat.org and the Hudson Institute of Mineralogy 1993-2025, except where stated. Most political location boundaries are Š OpenStreetMap contributors. Mindat.org relies on the contributions of thousands of members and supporters. Founded in 2000 by Jolyon Ralph.
To cite: Ralph, J., Von Bargen, D., Martynov, P., Zhang, J., Que, X., Prabhu, A., Morrison, S. M., Li, W., Chen, W., & Ma, X. (2025). Mindat.org: The open access mineralogy database to accelerate data-intensive geoscience research. American Mineralogist, 110(6), 833–844. doi:10.2138/am-2024-9486.
Privacy Policy - Terms & Conditions - Contact Us / DMCA issues - Report a bug/vulnerability Current server date and time: September 20, 2025 14:47:29
Go to top of page