Busy season, huh?
#ICLR decisions are out and
#CVPR rebuttals are flying... but donโt miss this! ๐
๐ฃ ๐๐ฎ๐น๐น ๐ณ๐ผ๐ฟ ๐๐ผ๐ป๐๐ฟ๐ถ๐ฏ๐๐๐ถ๐ผ๐ป๐: We're organizing a new edition of the ๐ ๐๐น๐๐ถ๐บ๐ผ๐ฑ๐ฎ๐น ๐๐น๐ด๐ผ๐ฟ๐ถ๐๐ต๐บ๐ถ๐ฐ ๐ฅ๐ฒ๐ฎ๐๐ผ๐ป๐ถ๐ป๐ด Workshop @
#CVPR2026 (Denver)!
โ
๐ง๐ต๐ฒ ๐๐๐ฏ๐บ๐ถ๐๐๐ถ๐ผ๐ป ๐ฝ๐ผ๐ฟ๐๐ฎ๐น ๐ถ๐ ๐ป๐ผ๐ ๐ผ๐ฝ๐ฒ๐ป, and we welcome both new and previously published work.
๐ ๐ฆ๐๐ฏ๐บ๐ถ๐๐๐ถ๐ผ๐ป ๐ด๐๐ถ๐ฑ๐ฒ๐น๐ถ๐ป๐ฒ๐ (details on the workshop website):
We accept three types of submissions:
ย ย โข Original papers (โค 8 pages, in proceedings)
ย ย โข Short papers (โค 4 pages, workshop website only)
ย ย โข Previously published papers (โค 8 pages, workshop website only)
๐๏ธ ๐๐ฒ๐ ๐ฑ๐ฎ๐๐ฒ๐:
๐ฆ๐๐ฏ๐บ๐ถ๐๐๐ถ๐ผ๐ป ๐ฑ๐ฒ๐ฎ๐ฑ๐น๐ถ๐ป๐ฒ: ๐๐ฒ๐ฏ ๐ฎ๐ณ, ๐ฎ๐ฌ๐ฎ๐ฒ
Notification: Mar 20, 2026
Camera-ready: Apr 10, 2026
๐ ๐ช๐ฒ๐ฏ๐๐ถ๐๐ฒ:
marworkshop.github.io/cvpr26โฆ
๐ ๐ช๐ผ๐ฟ๐ธ๐๐ต๐ผ๐ฝ ๐ณ๐ผ๐ฐ๐๐:
This workshop focuses on multimodal algorithmic reasoning, where ๐ฎ๐ป ๐ฎ๐ด๐ฒ๐ป๐ ๐บ๐๐๐ ๐ฎ๐๐๐ถ๐บ๐ถ๐น๐ฎ๐๐ฒ ๐ถ๐ป๐ณ๐ผ๐ฟ๐บ๐ฎ๐๐ถ๐ผ๐ป ๐ณ๐ฟ๐ผ๐บ ๐บ๐๐น๐๐ถ๐ฝ๐น๐ฒ ๐บ๐ผ๐ฑ๐ฎ๐น๐ถ๐๐ถ๐ฒ๐ ๐ณ๐ผ๐ฟ ๐ฐ๐ผ๐บ๐ฝ๐น๐ฒ๐
๐ฝ๐ฟ๐ผ๐ฏ๐น๐ฒ๐บ ๐๐ผ๐น๐๐ถ๐ป๐ด. Real-world examples of such problems include: (i) chain-of-thought reasoning across modalities, (ii) vision-and-language problem solving, (iii) agentic reasoning and tool use, and (iv) reasoning under physical constraints, among others.
๐ง๐ต๐ฒ ๐๐ผ๐ฝ๐ถ๐ฐ๐ ๐ณ๐ผ๐ฟ ๐ ๐๐ฅ-๐๐ฉ๐ฃ๐ฅ ๐ฎ๐ฌ๐ฎ๐ฒ ๐ถ๐ป๐ฐ๐น๐๐ฑ๐ฒ, ๐ฏ๐๐ ๐ฎ๐ฟ๐ฒ ๐ป๐ผ๐ ๐น๐ถ๐บ๐ถ๐๐ฒ๐ฑ ๐๐ผ:
๐น Multimodal structured and multi-step reasoning across vision, language, audio, and other modalities, including compositional and programmatic inference.
๐น Multimodal foundation models and world models for reasoning, planning, and decision-making, and their connections to general intelligence.
๐น Reasoning under physical, geometric, and causal constraints, including embodied agents, simulators, and digital twins.
๐น Multi-agent reasoning and collaboration, including debate, coordination, mixture-of-experts, and reward- or critique-based aggregation.
๐น Extreme generalization and concept learning, including few-shot, zero-shot, and out-of-distribution multimodal reasoning.
๐น Scaling laws, efficiency, and test-time reasoning, including inference-time optimization, self-refinement, and tool-augmented reasoning.
๐น Benchmarks, datasets, diagnostics, and evaluation, including synthetic data, interpretability, and systematic analysis of shortcomings and failure modes in multimodal AI models.
๐น Theoretical and cognitive perspectives on multimodal reasoning, including limits of current models and insights from human cognition.
๐น HumanโAI reasoning comparisons and foundations, including perspectives from psychology, neuroscience, and child development; theoretical limits of reasoning in large models; and position papers on how current multimodal AI reasoning differs from human cognition.
#MultimodalReasoning
#Reasoning
#AlgorithmicReasoning
#Multimodal
#AI
#VisionLanguage
#VisionLanguageModels
#VLM
#Agents
#ToolUse
#LLM
#FoundationModels
#Research
#MachineLearning
#DeepLearning
#CallForPapers