Here's a kit of 8 amazing open-source multimodal projects.

share link

by shweta10 dot icon Updated: Mar 25, 2024

technology logo
technology logo

Guide Kit Guide Kit ย 

Apple Releases New Multimodal Models: MM1 Family 

LAVISby salesforce

Python doticonstar image 5474 doticonVersion:v1.0.2doticon
License: Permissive (BSD-3-Clause)

LAVIS - A One-stop Library for Language-Vision Intelligence

Support
    Quality
      Security
        License
          Reuse

            LAVISby salesforce

            Python doticon star image 5474 doticonVersion:v1.0.2doticon License: Permissive (BSD-3-Clause)

            LAVIS - A One-stop Library for Language-Vision Intelligence
            Support
              Quality
                Security
                  License
                    Reuse

                      BentoMLby bentoml

                      Python doticonstar image 5022 doticonVersion:v1.0.22doticon
                      License: Permissive (Apache-2.0)

                      Unified Model Serving Framework ๐Ÿฑ

                      Support
                        Quality
                          Security
                            License
                              Reuse

                                BentoMLby bentoml

                                Python doticon star image 5022 doticonVersion:v1.0.22doticon License: Permissive (Apache-2.0)

                                Unified Model Serving Framework ๐Ÿฑ
                                Support
                                  Quality
                                    Security
                                      License
                                        Reuse

                                          pytorch-widedeepby jrzaurin

                                          Python doticonstar image 1049 doticonVersion:v1.2.2doticon
                                          License: Permissive (Apache-2.0)

                                          A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

                                          Support
                                            Quality
                                              Security
                                                License
                                                  Reuse

                                                    pytorch-widedeepby jrzaurin

                                                    Python doticon star image 1049 doticonVersion:v1.2.2doticon License: Permissive (Apache-2.0)

                                                    A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
                                                    Support
                                                      Quality
                                                        Security
                                                          License
                                                            Reuse

                                                              jinaby jina-ai

                                                              Python doticonstar image 18565 doticonVersion:v3.17.0doticon
                                                              License: Permissive (Apache-2.0)

                                                              ๐Ÿ”ฎ Build multimodal AI services via cloud native technologies

                                                              Support
                                                                Quality
                                                                  Security
                                                                    License
                                                                      Reuse

                                                                        jinaby jina-ai

                                                                        Python doticon star image 18565 doticonVersion:v3.17.0doticon License: Permissive (Apache-2.0)

                                                                        ๐Ÿ”ฎ Build multimodal AI services via cloud native technologies
                                                                        Support
                                                                          Quality
                                                                            Security
                                                                              License
                                                                                Reuse

                                                                                  unilmby microsoft

                                                                                  Python doticonstar image 12771 doticonVersion:s2s-ft.v0.3doticon
                                                                                  License: Permissive (MIT)

                                                                                  Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

                                                                                  Support
                                                                                    Quality
                                                                                      Security
                                                                                        License
                                                                                          Reuse

                                                                                            unilmby microsoft

                                                                                            Python doticon star image 12771 doticonVersion:s2s-ft.v0.3doticon License: Permissive (MIT)

                                                                                            Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
                                                                                            Support
                                                                                              Quality
                                                                                                Security
                                                                                                  License
                                                                                                    Reuse

                                                                                                      NeMoby NVIDIA

                                                                                                      Python doticonstar image 7027 doticonVersion:v1.19.0doticon
                                                                                                      License: Permissive (Apache-2.0)

                                                                                                      NeMo: a toolkit for conversational AI

                                                                                                      Support
                                                                                                        Quality
                                                                                                          Security
                                                                                                            License
                                                                                                              Reuse

                                                                                                                NeMoby NVIDIA

                                                                                                                Python doticon star image 7027 doticonVersion:v1.19.0doticon License: Permissive (Apache-2.0)

                                                                                                                NeMo: a toolkit for conversational AI
                                                                                                                Support
                                                                                                                  Quality
                                                                                                                    Security
                                                                                                                      License
                                                                                                                        Reuse

                                                                                                                          mmfby facebookresearch

                                                                                                                          Python doticonstar image 5241 doticonVersion:v0.3.1doticon
                                                                                                                          License: Others (Non-SPDX)

                                                                                                                          A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

                                                                                                                          Support
                                                                                                                            Quality
                                                                                                                              Security
                                                                                                                                License
                                                                                                                                  Reuse

                                                                                                                                    mmfby facebookresearch

                                                                                                                                    Python doticon star image 5241 doticonVersion:v0.3.1doticon License: Others (Non-SPDX)

                                                                                                                                    A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
                                                                                                                                    Support
                                                                                                                                      Quality
                                                                                                                                        Security
                                                                                                                                          License
                                                                                                                                            Reuse

                                                                                                                                              LLaVAby haotian-liu

                                                                                                                                              Python doticonstar image 3169 doticonVersion:Currentdoticon
                                                                                                                                              License: Permissive (Apache-2.0)

                                                                                                                                              Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

                                                                                                                                              Support
                                                                                                                                                Quality
                                                                                                                                                  Security
                                                                                                                                                    License
                                                                                                                                                      Reuse

                                                                                                                                                        LLaVAby haotian-liu

                                                                                                                                                        Python doticon star image 3169 doticonVersion:Currentdoticon License: Permissive (Apache-2.0)

                                                                                                                                                        Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
                                                                                                                                                        Support
                                                                                                                                                          Quality
                                                                                                                                                            Security
                                                                                                                                                              License
                                                                                                                                                                Reuse

                                                                                                                                                                  See similar Kits and Libraries