Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725
Reference Type | Journal (article/letter/editorial) | ||
---|---|---|---|
Title | SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos | ||
Journal | Pattern Recognition | ||
Authors | Liu, Dan | Author | |
Meng, Fanrong | Author | ||
Mi, Jinpeng | Author | ||
Ye, Mao | Author | ||
Li, Qingdu | Author | ||
Zhang, Jianwei | Author | ||
Year | 2025 (December) | Volume | 168 |
Publisher | Elsevier BV | ||
DOI | doi:10.1016/j.patcog.2025.111725Search in ResearchGate | ||
Generate Citation Formats | |||
Mindat Ref. ID | 18446599 | Long-form Identifier | mindat:1:5:18446599:6 |
GUID | 0 | ||
Full Reference | Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725 | ||
Plain Text | Liu, Dan; Meng, Fanrong; Mi, Jinpeng; Ye, Mao; Li, Qingdu; Zhang, Jianwei (2025) SAM-Net: Semantic-assisted multimodal network for action recognition in RGB-D videos. Pattern Recognition, 168. doi:10.1016/j.patcog.2025.111725 | ||
In | (2025) Pattern Recognition Vol. 168. Elsevier BV |
References Listed
These are the references the publisher has listed as being connected to the article. Please check the article itself for the full list of references which may differ. Not all references are currently linkable within the Digital Library.
![]() | |
Liu (2024) Neurocomputing Temporal cues enhanced multimodal learning for action recognition in RGB-D videos 594 | |
Wu (2024) Pattern Recognit. Local and global self-attention enhanced graph convolutional network for skeleton-based action recognition | |
Miao (2024) IEEE Trans. Multimed. Adaptive pitfall: Exploring the effectiveness of adaptation in skeleton-based action recognition PP, 1 | |
Radford (2021) Learning transferable visual models from natural language supervision , 8748 | |
Jia (2021) Scaling up visual and vision-language representation learning with noisy text supervision , 4904 | |
Bruce (2022) IEEE Trans. Pattern Anal. Mach. Intell. Mmnet: A model-based multimodal network for human action recognition in rgb-d videos 45, 3522 | |
Simonyan (2014) Adv. Neural Inf. Process. Syst. Two-stream convolutional networks for action recognition in videos 27 | |
Not Yet Imported: IEEE Transactions on Image Processing - journal-article : 10.1109/TIP.2020.2965299 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: 2015 IEEE International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV.2015.510 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - proceedings-article : 10.1109/WACV56688.2023.00553 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - proceedings-article : 10.1109/WACV56688.2023.00338 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1109/ACCESS.2020.2983355 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
![]() | |
Not Yet Imported: IEEE Transactions on Circuits and Systems for Video Technology - journal-article : 10.1109/TCSVT.2020.3019293 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR52688.2022.00298 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) - proceedings-article : 10.1109/CVPR.2015.7298714 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1109/TPAMI.2017.2771306 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1109/TIP.2018.2818328 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Yan (2018) Spatial temporal graph convolutional networks for skeleton-based action recognition vol. 32 | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR.2019.01230 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - proceedings-article : 10.1109/CVPR42600.2020.00022 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Plizzari (2021) Comput. Vis. Image Underst. Skeleton-based action recognition via spatial and temporal transformer networks 208 | |
Not Yet Imported: - proceedings-article : 10.1145/3581783.3611900 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - proceedings-article : 10.1109/CVPR52729.2023.00634 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Chen (2022) Adv. Neural Inf. Process. Syst. Adaptformer: Adapting vision transformers for scalable visual recognition 35, 16664 | |
Pan (2022) Adv. Neural Inf. Process. Syst. St-adapter: Parameter-efficient image-to-video transfer learning 35, 26462 | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR52729.2023.02206 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
![]() | |
Islam (2020) Hamlet: A hierarchical multimodal attention-based human activity recognition algorithm , 10285 | |
Islam (2022) Mumu: Cooperative multitask learning-based guided multimodal fusion vol. 36, 1043 | |
Ni (2022) Cross-modal knowledge distillation for vision-to-sensor action recognition , 4448 | |
Not Yet Imported: - journal-article : 10.1109/LSENS.2018.2878572 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1007/s12652-019-01239-9 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1109/TIP.2020.2967577 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Bruce (2021) Multimodal fusion via teacher-student network for indoor action recognition vol. 35, 3199 | |
Not Yet Imported: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) - proceedings-article : 10.1109/ICCV48922.2021.00912 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Das (2020) Vpn: Learning video-pose embedding for activities of daily living , 72 | |
Not Yet Imported: - proceedings-article : 10.1109/CVPR.2016.90 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Ouyang (2022) Adv. Neural Inf. Process. Syst. Training language models to follow instructions with human feedback 35, 27730 | |
Not Yet Imported: IEEE Transactions on Image Processing - journal-article : 10.1109/TIP.2019.2937724 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1016/j.neucom.2023.03.070 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Wang (2024) Skeleton-based action recognition with spatial-structural graph convolution , 1 | |
Not Yet Imported: - proceedings-article : 10.1109/ICCV48922.2021.01318 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Chang (2024) Wavelet-decoupling contrastive enhancement network for fine-grained skeleton-based action recognition , 4060 | |
Not Yet Imported: 2017 IEEE International Conference on Computer Vision Workshops (ICCVW) - proceedings-article : 10.1109/ICCVW.2017.77 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - journal-article : 10.1109/TIP.2023.3308750 If you would like this item imported into the Digital Library, please contact us quoting Journal ID | |
Not Yet Imported: - proceedings-article : 10.1109/WACV56688.2023.00333 If you would like this item imported into the Digital Library, please contact us quoting Journal ID |
See Also
These are possibly similar items as determined by title/reference text matching only.
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() | |
![]() |