Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725

Library Home Bookshelves View by Type Using Search

Books Catalogs/Sales Lists Journals Reports Thesis/Dissertation

Search for Books Search for Journals Manage Subjects Statistics Books without DDC/LCC Top Unstructured Orphaned Articles

Bookshelves (DDC layout)Bookshelves (LCC layout)Latest Books

Advanced

Search inside 'Pattern Recognition' only

- Only viewable:

Reference Type	Journal (article/letter/editorial)
Title	SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos
Journal	Pattern Recognition
Authors	Liu, Dan		Author
	Meng, Fanrong		Author
	Mi, Jinpeng		Author
	Ye, Mao		Author
	Li, Qingdu		Author
	Zhang, Jianwei		Author
Year	2025 (December)	Volume	168
Publisher	Elsevier BV
DOI	doi:10.1016/j.patcog.2025.111725Search in ResearchGate
	Generate Citation Formats
Mindat Ref. ID	18446599	Long-form Identifier	mindat:1:5:18446599:6
GUID	0
Full Reference	Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725
Plain Text	Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725
In	(2025) Pattern Recognition Vol. 168. Elsevier BV

References Listed

These are the references the publisher has listed as being connected to the article. Please check the article itself for the full list of references which may differ. Not all references are currently linkable within the Digital Library.

	Wang, Fan, Li, Xinke, Xiong, Han, Mo, Haofan, Li, Yongming (2024) MLENet: Multi-Level Extraction Network for video action recognition. Pattern Recognition, 154. 110614 doi:10.1016/j.patcog.2024.110614
	Liu (2024) Neurocomputing Temporal cues enhanced multimodal learning for action recognition in RGB-D videos 594
	Wu (2024) Pattern Recognit. Local and global self-attention enhanced graph convolutional network for skeleton-based action recognition
	Miao (2024) IEEE Trans. Multimed. Adaptive pitfall: Exploring the effectiveness of adaptation in skeleton-based action recognition PP, 1
	Radford (2021) Learning transferable visual models from natural language supervision , 8748
	Jia (2021) Scaling up visual and vision-language representation learning with noisy text supervision , 4904
	Bruce (2022) IEEE Trans. Pattern Anal. Mach. Intell. Mmnet: A model-based multimodal network for human action recognition in rgb-d videos 45, 3522
	Simonyan (2014) Adv. Neural Inf. Process. Syst. Two-stream convolutional networks for action recognition in videos 27
	Not Yet Imported: IEEE Transactions on Image Processing - journal-article : 10.1109/TIP.2020.2965299 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: 2015 IEEE International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV.2015.510 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - proceedings-article : 10.1109/WACV56688.2023.00553 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - proceedings-article : 10.1109/WACV56688.2023.00338 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1109/ACCESS.2020.2983355 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Ma, Yujun, Wang, Ruili (2024) Relative-position embedding based spatially and temporally decoupled Transformer for action recognition. Pattern Recognition, 145. 109905 doi:10.1016/j.patcog.2023.109905
	Not Yet Imported: IEEE Transactions on Circuits and Systems for Video Technology - journal-article : 10.1109/TCSVT.2020.3019293 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - proceedings-article : 10.1109/CVPR52688.2022.00298 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) - proceedings-article : 10.1109/CVPR.2015.7298714 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1109/TPAMI.2017.2771306 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1109/TIP.2018.2818328 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Yan (2018) Spatial temporal graph convolutional networks for skeleton-based action recognition vol. 32
	Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.01230 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - proceedings-article : 10.1109/CVPR42600.2020.00022 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Plizzari (2021) Comput. Vis. Image Underst. Skeleton-based action recognition via spatial and temporal transformer networks 208
	Not Yet Imported: - proceedings-article : 10.1145/3581783.3611900 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - proceedings-article : 10.1109/CVPR52729.2023.00634 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Chen (2022) Adv. Neural Inf. Process. Syst. Adaptformer: Adapting vision transformers for scalable visual recognition 35, 16664
	Pan (2022) Adv. Neural Inf. Process. Syst. St-adapter: Parameter-efficient image-to-video transfer learning 35, 26462
	Not Yet Imported: - proceedings-article : 10.1109/CVPR52729.2023.02206 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Lv, Jindi, Sun, Yanan, Ye, Qing, Feng, Wentao, Lv, Jiancheng (2024) A multiscale neural architecture search framework for multimodal fusion. Information Sciences, 679. doi:10.1016/j.ins.2024.121005
	Islam (2020) Hamlet: A hierarchical multimodal attention-based human activity recognition algorithm , 10285
	Islam (2022) Mumu: Cooperative multitask learning-based guided multimodal fusion vol. 36, 1043
	Ni (2022) Cross-modal knowledge distillation for vision-to-sensor action recognition , 4448
	Not Yet Imported: - journal-article : 10.1109/LSENS.2018.2878572 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1007/s12652-019-01239-9 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1109/TIP.2020.2967577 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Bruce (2021) Multimodal fusion via teacher-student network for indoor action recognition vol. 35, 3199
	Not Yet Imported: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV48922.2021.00912 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Das (2020) Vpn: Learning video-pose embedding for activities of daily living , 72
	Not Yet Imported: - proceedings-article : 10.1109/CVPR.2016.90 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Ouyang (2022) Adv. Neural Inf. Process. Syst. Training language models to follow instructions with human feedback 35, 27730
	Not Yet Imported: IEEE Transactions on Image Processing - journal-article : 10.1109/TIP.2019.2937724 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1016/j.neucom.2023.03.070 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Wang (2024) Skeleton-based action recognition with spatial-structural graph convolution , 1
	Not Yet Imported: - proceedings-article : 10.1109/ICCV48922.2021.01318 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Chang (2024) Wavelet-decoupling contrastive enhancement network for fine-grained skeleton-based action recognition , 4060
	Not Yet Imported: 2017 IEEE International Conference on Computer Vision Workshops (ICCVW) - proceedings-article : 10.1109/ICCVW.2017.77 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - journal-article : 10.1109/TIP.2023.3308750 If you would like this item imported into the Digital Library, please contact us quoting Journal ID
	Not Yet Imported: - proceedings-article : 10.1109/WACV56688.2023.00333 If you would like this item imported into the Digital Library, please contact us quoting Journal ID

	*Chen, Zhongyuan; Lu, Chong; Wang, Yihan (2025) CIEG-Net: Context Information Enhanced Gated Network for multimodal sentiment analysis. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111785*
	Li, Bingyu; Zhang, Da; Zhao, Zhiyuan; Gao, Junyu; Li, Xuelong (2025) U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111801
	Li, Jianan, Xie, Xuemei, Pan, Qingzhe, Cao, Yuhan, Zhao, Zhifu, Shi, Guangming (2020) SGM-Net: Skeleton-guided multimodal network for action recognition. Pattern Recognition, 104. 107356pp. doi:10.1016/j.patcog.2020.107356
	Bai, Lizhi, Yang, Jun, Tian, Chunqi, Sun, Yaoru, Mao, Maoyu, Xu, Yanjun, Xu, Weirong (2025) DCANet: Differential convolution attention network for RGB-D semantic segmentation. Pattern Recognition, 162. doi:10.1016/j.patcog.2025.111379
	Ijjina, Earnest Paul, Chalavadi, Krishna Mohan (2017) Human action recognition in RGB-D videos using motion sequence information and deep learning. Pattern Recognition, 72. 504-516 doi:10.1016/j.patcog.2017.07.013
	*Zhao, Shenlu, Jin, Ziniu, Jiao, Qiang, Zhang, Qiang, Han, Jungong (2025) Resolving semantic conflicts in RGB-T semantic segmentation. Pattern Recognition, 162. doi:10.1016/j.patcog.2025.111398*
	Chen, Yu, Li, Xiang, Luan, Chao, Hou, Weimin, Liu, Haochen, Zhu, Zihui, Xue, Lian, Zhang, Jianqi, Liu, Delian, Wu, Xin, et al. (2025) Cross-level interaction fusion network-based RGB-T semantic segmentation for distant targets. Pattern Recognition, 161. doi:10.1016/j.patcog.2024.111218
	*Wu, Wei, Chu, Tao, Liu, Qiong (2022) Complementarity-aware cross-modal feature fusion network for RGB-T semantic segmentation. Pattern Recognition, 131. 108881 doi:10.1016/j.patcog.2022.108881*
	Fang, Fengyi; Liao, Zihan; Kan, Zhehan; Wang, Guijin; Yang, Wenming (2025) MDSI: Pluggable Multi-strategy Decoupling with Semantic Integration for RGB-D Gesture Recognition. Pattern Recognition, 166. doi:10.1016/j.patcog.2025.111653
	*Zhang, Jing, Li, Wanqing, Ogunbona, Philip O., Wang, Pichao, Tang, Chang (2016) RGB-D-based action recognition datasets: A survey. Pattern Recognition, 60. 86-105 doi:10.1016/j.patcog.2016.05.019*
	Luo, Yuxuan; Chen, Jinpeng; Cong, Runmin; Ip, Horace Ho Shing; Kwong, Sam (2025) Trace Back and Go Ahead: Completing partial annotation for continual semantic segmentation. Pattern Recognition, 165. doi:10.1016/j.patcog.2025.111613

Mindat.org is an outreach project of the Hudson Institute of Mineralogy, a 501(c)(3) not-for-profit organization.
Copyright © mindat.org and the Hudson Institute of Mineralogy 1993-2025, except where stated. Most political location boundaries are © OpenStreetMap contributors. Mindat.org relies on the contributions of thousands of members and supporters. Founded in 2000 by Jolyon Ralph.
To cite: Ralph, J., Von Bargen, D., Martynov, P., Zhang, J., Que, X., Prabhu, A., Morrison, S. M., Li, W., Chen, W., & Ma, X. (2025). Mindat.org: The open access mineralogy database to accelerate data-intensive geoscience research. American Mineralogist, 110(6), 833–844. doi:10.2138/am-2024-9486.
Privacy Policy - Terms & Conditions - Contact Us / DMCA issues - Report a bug/vulnerability Current server date and time: August 29, 2025 04:26:08

Go to top of page

Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725

References Listed

See Also