Special Session
Special Session Ⅰ: Multimodal Perception and Its Applications
Session Chair: Prof. Changxin Gao——Huazhong University of Science and Technology, China
Assoc. Prof. Yuanjie Shao——Huazhong University of Science and Technology, China
Information: Nowadays, Multimodal Perception has emerged as a crucial research area in the field of artificial intelligence. By integrating information from multiple sensory modalities such as visual, auditory, and text data, it enables more accurate and comprehensive understanding of the real - world environment. This special session aims to explore the latest advancements in Multimodal Perception technologies and their wide - ranging application scenarios. Multimodal Perception is not only revolutionizing traditional computer vision and audio processing tasks but also finding applications in emerging fields like virtual reality, robotics, and smart healthcare.
The topics to be covered include, but are not limited to:
Multimodal Fusion
Multimodal Object Recognition
Multimodal Scene Understanding
Multimodal Object Detection
Multimodal Learning Algorithms
Multimodal Alignment
Multimodal Retrieval
Multimodal - guided Image Enhancement
Collaborative Perception
Submission Deadline: June 10, 2025
Special Session Ⅱ: Cross-media Intelligent Analysis and Reasoning
Session Chair: Prof. Libo Liu——Ningxia University, China
Dr. Ruonan Zhang——Ningxia University, China
Keywords: Multi-modal/cross-modal Retrieval, Large model, Representation learning, Labeling and description, Interaction, Alignment, Generation, Analysis
Information: The rapid development of internet and multimedia technologies has led to an explosive growth of massive multimodal data (such as images, videos, text, audio, etc.). Efficiently and accurately retrieving user-needed information from these heterogeneous data has become a critical challenge in the field of information retrieval. In this context, Cross-media Intelligent Analysis and Reasoning has emerged, aiming to break down the barriers between different modalities and achieve deep integration of cross-modal semantic understanding and information retrieval.This session will focus on the core technologies, application scenarios, and future development trends of preprocessing in cross-media intelligent analysis to advance the innovation and application of cross-media intelligent analysis and reasoning technologies.
The topics to be covered include, but are not limited to:
Cross-media Representation Learning
Cross-media Knowledge Graph
Cross-media Reasoning Models
Cross-media Retrieval
Cross-media Generation
Cross-media Sentiment Analysis
Cross-media Event Analysis
Applications of Cross-media in Smart Cities, Healthcare, Education, Finance, Entertainment, etc.
Inference, enhancement, and related applications of large models, etc.
Submission Deadline: May 15, 2025
Special Session Ⅲ: Evolutionary Computation and Swarm Intelligence for Solving Dynamic / Large-Scale / Multi-Objective Optimization Problems
Session Chair: Prof. Wei Song——Jiangnan University, China
Keywords: Evolutionary Computation, Swarm Intelligence, Dynamics, Large-scale, Multi-Objective, Optimization
Information: The field of optimization is experiencing a significant transformation driven by advances in computational techniques and artificial intelligence (AI). Dynamic / large-scale / multi-objective optimization (D/L/MO) algorithms attempt to handle complex optimization problems with multi-objectives and/or large-scale decision variables in a changing environment. This session will provide an in-depth understanding of D/L/MO and corresponding diverse applications. We will:
*Solving D/L/MO Problems: Understanding D/L/MO knowledge, principles, capabilities, and emerging techniques for solving D/L/MO problems.
*Exploring Real-World Applications: Learn how D/L/MO algorithms are applied in various scenes such as engineering, logistics, finance, healthcare, etc.
*Seeking Challenges and Opportunities: Finding challenges of existing D/L/MO algorithms, such as reducing computational complexity, accelerating convergence, and preventing premature convergence. Exploring the opportunities that efficient D/L/MO algorithms present in handling real-world problems.
This session is designed for researchers, practitioners, and students interested in D/L/MO. Whether you are an expert or new in the field of D/L/MO, this session will provide valuable insights and foster discussions on how D/L/MO algorithms can address complex optimization problems.
The topics to be covered include, but are not limited to:
Novel algorithms for solving D/L/MO problems, including evolutionary computation, swarm intelligence, reinforcement learning, transfer learning, etc
Hybrid algorithms combining evolutionary computation with other optimization algorithms or learning techniques
Tailored algorithms for solving specific types of D/L/MO problems, e.g., sparse problems, constrained problems, expensive problems, multimodal problems, and others
Applications of existing algorithms to D/L/MO problems in emerging areas, e.g., machine learning, data mining, manufacturing, scheduling, electrics, economics, bioinformatics, medicine, and others
Performance assessment, theoretical analysis, and benchmarking of algorithms for D/L/MO problems
Future directions in D/L/MO: Emerging trends, future research directions, and potential breakthroughs in D/L/MO
Submission Deadline: June 10, 2025
Special Session Ⅳ: Multirobot Collaboration and Its Applications
Session Chair: Assoc. Prof. Jiyu Cheng——Shandong University, China
Keywords: Multirobot System, Collaborative Perception, Collaborative Decision-making, Autonomous Learning
Information: In recent years, as multi-robot systems have been increasingly utilized in various fields such as power inspection, autonomous unmanned vehicle fleets, intelligent warehousing, and logistics, multi-robot technology has gradually become a research hotspot in the field of robotics. In many complex application scenarios, multiple robots cannot pre-acquire environmental map information and need to operate in unknown environments. In such application scenarios, multirobot collaboration becomes particularly important. Therefore, it makes great sense to devote more efforts exploring its potential and shine lights on its future developing direction.
The topics to be covered include, but are not limited to:
Multirobot Exploration and Navigation Algorithms
Machine Learning and Artificial Intelligence in Multirobot Systems
Sensor Fusion and Perception for Multirobot Systems
Multirobot Systems and Heterogeneous Robot Teams
Applications of Autonomous Robot Systems and Swarm Intelligence
Submission Deadline: June 10, 2025
Special Session Ⅴ: Advances in Image Computing
Session Chair: Assoc. Prof. Huanjie Tao——Northwestern Polytechnical University, China
Keywords: Image Classification, Image Recognition, Image captioning, Image Segmentation, Image Reconstruction
Information: Image Computing refers to the technology that utilizes computer to process, analyze, and understand image data, aiming to extract meaningful information, achieve specific functions, or solve practical problems. With the development of deep learning and large models, image computing technologies have made significant progress. This session highlights advances and applications in image computing research.
The topics to be covered include, but are not limited to:
Image classification
Image retrieval
Image registration
Image segmentation
Image captioning
Image watermarking
Image compression
Image generation
Image reconstruction
Image enhancement
Image denoising
Image super-resolution
Image restoration
Image object detection
Image anomaly detection
Image quality assessment
Image depth estimation
Image feature extraction
Image feature learning
Image privacy protection
Image dataset generation
Image computing systems
Image feature representation: Image-level feature representation, dataset-level feature representation
Image-to-text visual question answering
Image dataset quality assessment
Multimodal image fusion
Multi-view image reconstruction
Lightweight image computing models
Medical image processing
Remote sensing image processing
Security and ethical issues in image computing
Large model-based image computing methods
Large-scale image dataset distillation
Traditional image processing methods
Applications of image computing
Submission Deadline: June 15, 2025
Special Session Ⅵ: Mobile Visual Computing
Session Chair: Assoc. Prof. Lingyan Ran——Northwestern Polytechnical University, China
Assoc. Prof. Shizhou Zhang——Northwestern Polytechnical University, China
Keywords: Visual Perception based on Edge Computing, Scene Understanding, Embodied Visioning, Multimodal Sensor Fusion, Multi-modality Cognition
Information: Mobile visual computing refers to the integration of advanced computer vision, image processing, and machine learning technologies into mobile devices (e.g., smartphones, drones, AR/VR headsets) to enable real-time analysis, interpretation, and interaction with visual data. It combines hardware advancements (e.g., high-resolution cameras, GPUs, AI accelerators) with software algorithms to support applications like augmented reality, object recognition, scene reconstruction, and immersive media experiences. Key challenges include optimizing computational efficiency for resource-constrained environments, ensuring low latency, and balancing power consumption. This issue drives innovations in areas such as autonomous navigation, healthcare imaging, and interactive entertainment, reshaping how users perceive and interact with the digital-physical world.
The topics to be covered include, but are not limited to:
Edge-Accelerated Model Compression
Multimodal Sensing and Fusion
Generative AI for Mobile Visual Computing
Energy-Efficient Design for Mobile Visual Systems
Dynamic Environment Adaptation
Submission Deadline: June 15, 2025
Special Session Ⅶ: Intelligent Optimization for Solving Real-Time Constraints and Unsteady Problems in Dynamic System / Large-Scale Scenarios
Session Chair: Prof. Rong Fei——Northwestern Polytechnical University, China
Keywords: Large-Scale Network, Dynamic Optimization, Fault Diagnosis, Data Augmentation, Deep Learning, Performance Evaluation
Information: This special session focuses on the innovation and application of intelligent optimization technologies in dynamic systems and large-scale complex scenarios. With the rapid development of deep learning and large-scale model technologies, intelligent optimization has achieved remarkable progress across multiple domains. The conference will center on key technologies such as large-scale networks, dynamic optimization, fault diagnosis, and data enhancement, exploring their innovative applications under core challenges including real-time constraints, non-stationary environments, and limited computational resources.
The topics to be covered include, but are not limited to:
Real-time evolutionary optimization methods for dynamic systems
Novel algorithms tailored for specific application scenarios, such as fault diagnosis, indoor positioning, and gesture recognition
New algorithms addressing real-time constraints and non-stationary problems, including evolutionary computation, swarm intelligence, reinforcement learning, and transfer learning
Hybrid algorithms integrating dynamic optimization with other learning techniques
Performance evaluation, theoretical analysis, and algorithm benchmarking
Submission Deadline: June 15, 2025
Special Session Ⅷ: Multimodal Information Perception and Human- Machine Interaction
Session Chair: Prof. Lin Gan——Northwestern Polytechnical University, China
Dr. Zhongjie Li——Northwestern Polytechnical University, China
Keywords: Visual, Auditory, Brain, Information, Perception, Human-Machine Integration
Information: This special session focuses on the forefront technologies related to human-machine multimodal information fusion perception. For example, constructing data fusion models based on visual and auditory perception mechanisms.
The topics to be covered include, but are not limited to:
Visual and auditory perception and human-machine interaction modes
Human-machine integrated information perception
Submission Deadline: June 15, 2025