Sign in
← Back to search

amazon-science/mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Stars
3,991
Forks
331
Commits
14
Language
Python
Awesome lists
1

Similar repositories

Gen-Verse/MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)

1642 stars
Python 1 awesome list

NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

3799 stars
Python 1 awesome list

SkyworkAI/Skywork-R1V

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

3161 stars
Python 1 awesome list

microsoft/MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

5807 stars
Python 1 awesome list

EvolvingLMMs-Lab/Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

3390 stars
Python 1 awesome list

EvolvingLMMs-Lab/multimodal-search-r1

[ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

447 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:22

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-02-02
  • First commit: 2023-02-02
  • Last pushed: 2024-06-12
  • Website: https://arxiv.org/abs/2302.00923
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.