DEV Community

Dr. Carlos Ruiz Viquez
Dr. Carlos Ruiz Viquez

Posted on

**Challenge: "Unsupervised Discovery of Multimodal Dynamics

Challenge: "Unsupervised Discovery of Multimodal Dynamics in Time-Series Sequences"

Background: As AI systems become increasingly ubiquitous, monitoring and analyzing complex dynamic systems is more crucial than ever. Computer vision has traditionally been applied to static or low-complexity time-series sequences. However, many real-world scenarios involve multimodal, high-dimensional, and time-varying information flows.

Task: Develop a novel computer vision approach that discovers and models multimodal dynamics in high-dimensional, time-series sequences, where:

  1. Multimodality: Each sequence contains multiple heterogeneous modalities (e.g., audio, video, text, and sensor data) that coexist and interact in complex ways.
  2. High-dimensionality: Sequences have a high number of features and time steps, leading to a significant computational and analytical challenge.
  3. Dynamics: Sequences exhibit complex temporal patterns, including both deterministic and stochastic components.

Constraints:

  • Unsupervised learning: The system must learn from raw, unlabeled data.
  • Real-time processing: The approach should be capable of processing sequences in near real-time.
  • Interpretability: The system should provide insights into the underlying dynamics and relationships between modalities.

Evaluation Metrics:

  1. Quality of discovered patterns: Measures of similarity and coherence between predicted and actual dynamics.
  2. Efficiency: Processing speed and computational complexity.
  3. Robustness: Ability to handle noisy or missing data.

Dataset: We provide a unique dataset consisting of multimodal time-series sequences from various real-world domains (e.g., industrial processes, environmental monitoring, and financial markets).

Submission Guidelines:

  1. Participating teams will submit a technical paper describing their approach, methodology, and results.
  2. A code repository (e.g., GitHub) containing the implemented solution and any necessary dependencies.
  3. A short video (~5 minutes) demonstrating the system's performance on the provided dataset.

Prizes:

  • Best paper award: $5,000
  • Efficiency and robustness award: $3,000
  • Interpretable insights award: $2,000

Timeline:

  • Registration deadline: March 15, 2026
  • Submission deadline: May 15, 2026
  • Review and evaluation: June 1 - August 15, 2026
  • Award announcement and publication: September 2026

Join us in pushing the boundaries of computer vision and time-series analysis!


Publicado automáticamente

Top comments (0)