SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding.

Supervised Cross-modal Contrastive Learning for Audio-Visual Coding

2023/10/27 · We propose a Supervised Cross-modal Contrastive Learning Framework for Audio-Visual Coding (SCLAV). Our framework includes an audio-visual coding network.

Supersunn/SCLAV - GitHub

github.com › Supersunn › SCLAV

This is an open source repository for our paper SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding based on the pytorch framework.

Supervised Cross-modal Contrastive Learning for Audio-Visual Coding

www.researchgate.net › ... › Supervision

ADSH treats the query points and database points in an asymmetric way. More specifically, ADSH learns a deep hash function only for query points, while the hash ...

Publications | Never-Ending Travel

2oil.top › publication

SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding. In the 31st ACM International Conference on Multimedia (MM'23). Never-Ending ...

‪Chao Sun‬ - ‪Google Scholar‬

scholar.google.com.hk › citations

SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding. C Sun, M Chen, J Cheng, H Liang, C Zhu, J Chen. Proceedings of the 31st ACM ...

Self-Supervised Audio-Visual Representation Learning with ...

www.connectedpapers.com › search › q=...

We present CrissCross , a self-supervised framework for learning audio-visual representations. A novel notion is introduced in our framework ...

[PDF] Self-Supervised Learning by Cross-Modal Audio-Video Clustering

www.semanticscholar.org › paper › Self-...

A novel self-supervised method that leverages unsupervised clustering in one modality as a supervisory signal for the other modality, is proposed.

Audio-Visual Contrastive Learning with Temporal Self-Supervision

arxiv.org › cs

2023/02/15 · We propose a self-supervised learning approach for videos that learns representations of both the RGB frames and the accompanying audio without human ...

[PDF] Teaser Session M1 - ACM MM 2023

www.acmmm2023.org › 2023/10

2023/10/30 · SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding mmfp3292. MTSN: Multiscale Temporal Similarity Network for ...

CrossVideo: Self-supervised Cross-modal Contrastive Learning ...

arxiv.org › cs

2024/01/17 · This paper introduces a novel approach named CrossVideo, which aims to enhance self-supervised cross-modal contrastive learning in the field of point cloud ...

含まれない: SCLAV: Audio-