Visual Modality Examples

Is visual content modality a limiting factor for social capital? Examining user engagement within Instagram-based brand communities

In the age of virtual cocreation of value by consumers, the role of the content modality in the development of social capital has been largely overlooked. Given that different modalities lead to ...

GitHub

visual_modality_prompt_for_adapting_vision-language_object_detectors.md

description [ICCV 2025][Object Detection][Visual Prompt] This paper proposes ModPrompt, an encoder-decoder-based visual prompting strategy that adapts vision-language object detectors (e.g., ...

Microsoft

Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser

Audio-visual learning has been a major pillar of multi-modal machine learning, where the community mostly focused on its modality-aligned setting, i.e., the audio and visual modality are both assumed ...

Modality-Specific Features in Sign Languages

Sign languages (SLs), as natural human languages, operate within the visual-gestural modality, setting them apart from the oral-auditory systems of spoken languages. While SLs share universal ...

GitHub

Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios (WACV 2025)

This paper tackles the domain of multimodal prompting for visual recognition, specifically when dealing with missing modalities through multimodal Transformers. It presents two main contributions: (i) ...

IEEE

Bidirectional Cross-Modal Collaborative Alignment via Semantic-Guided Visual Embeddings for Partially Relevant Video Retrieval

Abstract: Partially Relevant Video Retrieval (PRVR) aims to retrieve videos that match a given textual query only partially. This task is inherently challenging due to the modality gap between text ...

eLife

Modality-Agnostic Decoding of Vision and Language from fMRI

Average decoding scores for modality-agnostic decoders (green), compared to modality-specific decoders trained on data from subjects viewing images (orange) or on data from subjects viewing captions ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results