Overview
Multimodal search and recommendation (MMSR) systems are at the forefront of modern information retrieval, designed to integrate and process diverse data types such as text, images, audio, and video within a unified framework. This integration enables more accurate and contextually relevant search results and recommendations, significantly enhancing user experiences. For example, e-commerce platforms have started supporting product searches through images to provide a streamlined shopping experience. Recent innovations in LLMs have extended their capabilities to handle multimodal inputs, allowing for a deeper and more nuanced understanding of content and user preferences.
MMSR will be a half-day workshop at ICDM 2025. The workshop will be held on November 12, 2025 in Washington DC, USA. The workshop will explore the latest advancements, challenges, and applications of multimodal search and recommendations.
Important Dates
All deadlines are at 23: 59 P.M. GMT
Task | Deadline |
---|---|
Paper submission deadline | September 1, 2025 |
Notification of acceptance | October 1, 2025 |
Camera Ready Version of Papers Due | October 12, 2025 |
MMSR ‘25 Workshop | November 12, 2025 |
Call for Papers
Topics of interest include, but are not limited to:
- From Data to Discovery: Using Multimodal Models for Smarter Search and Recommendations (2025 Special Theme)
- Strategies for building scalable multimodal discovery engines.
- Lessons learned from productionizing MMSR models in real-world applications.
- Handling discovery in cold-start scenarios and sparse multimodal data settings.
- Balancing discovery and relevance in multimodal recommendation systems.
- Evaluating business impact and user satisfaction of multimodal discovery systems.
- Emerging trends in using LLMs for multimodal data exploration and discovery.
- Personalization strategies tailored to multimodal discovery journeys.
- Bridging research and practical deployment: overcoming challenges in scaling multimodal models for search and recommendation.
- Cross-modal retrieval techniques
- Efficiently indexing and retrieving multimodal data.
- Handling large-scale cross-modal data.
- Developing metrics to measure similarity across different modalities.
- Zero-shot and few-shot retrieval across unseen modalities.
- Adapting retrieval architectures (e.g., dual encoders vs. fusion models) for different multimodal tasks.
- Applications of MMSR to Verticals (e.g., E-commerce, Healthcare, Real Estate)
- MMSR for image-based product search in e-commerce.
- Multimodal conversational agents for healthcare, legal, and retail industries.
- Augmented reality (AR) and multimodal discovery for shopping experiences.
- Customer service optimization through multimodal search interfaces (e.g., support chat, help centers).
- Personalized multimodal travel planning and recommendation systems.
- Video+text based multimodal recommendations in media and entertainment domains.
- User-centric design principles for MMSR interfaces
- Designing user-friendly interfaces that support multimodal search.
- Methods for evaluating the usability of MMSR systems.
- Ensuring MMSR interfaces are accessible to users with disabilities.
- Visualizations and interactive feedback mechanisms for multimodal search refinement.
- A/B testing strategies specific to multimodal search UI/UX improvements.
- Ethical and Privacy Considerations of MMSR
- Identifying and mitigating biases in multimodal algorithms.
- Ensuring transparency in how multimodal results are generated and presented.
- Approaches for obtaining and managing user consent for using user data.
- User perception studies of trust and explainability in multimodal search systems.
- Privacy-preserving multimodal modeling: federated learning and differential privacy for MMSR.
- Modeling for MMSR
- Multi-modal representation learning.
- Utilizing pre-trained multimodal LLMs.
- Dimensionality reduction techniques to manage multimodal complexity.
- Fine-tuning pre-trained vision-language models.
- Developing and standardizing metrics to evaluate the performance of MMSR models.
- Alignment challenges in multimodal embeddings across diverse modalities.
Submission Instructions
Papers must be formatted and written according to the Submission Guidelines on the ICDM 2025 conference web site. You are strongly encouraged to print and double check your PDF file before its submission, especially if your paper contains Asian/European language symbols (such as Chinese/Korean characters or English letters with European fonts).
-
Please note that at least one author of each accepted paper must register for the workshop.
-
All accepted workshop papers will be published in the dedicated ICDMW proceedings published by the IEEE Computer Society Press.
-
Non-archival submissions are not allowed, i.e., all accepted papers will be included and published in the proceedings.