Skip to content
Change the repository type filter

All

    Repositories list

    • SEED-Story: Multimodal Long Story Generation with Large Language Model
      Python
      Other
      5672320Updated Oct 11, 2024Oct 11, 2024
    • Open-MAGVIT2: Democratizing Autoregressive Visual Generation
      Python
      Apache License 2.0
      2664750Updated Sep 27, 2024Sep 27, 2024
    • Official Code for MotionCtrl [SIGGRAPH 2024]
      Python
      Apache License 2.0
      711.3k240Updated Sep 20, 2024Sep 20, 2024
    • ST-LLM

      Public
      [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
      Python
      Apache License 2.0
      411790Updated Sep 10, 2024Sep 10, 2024
    • mllm-npu

      Public
      mllm-npu: training multimodal large language models on Ascend NPUs
      Python
      Apache License 2.0
      27920Updated Aug 29, 2024Aug 29, 2024
    • MasaCtrl

      Public
      [ICCV 2023] Consistent Image Synthesis and Editing
      Python
      Apache License 2.0
      26720212Updated Aug 19, 2024Aug 19, 2024
    • Plot2Code

      Public
      Python
      21500Updated Aug 17, 2024Aug 17, 2024
    • PhotoMaker [CVPR 2024]
      Jupyter Notebook
      Other
      7539.4k1423Updated Aug 15, 2024Aug 15, 2024
    • GFPGAN

      Public
      GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
      Python
      Other
      5.9k36k34623Updated Jul 26, 2024Jul 26, 2024
    • CustomNet

      Public
      Python
      Apache License 2.0
      1026261Updated Jul 22, 2024Jul 22, 2024
    • BrushNet

      Public
      [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
      Python
      Other
      1141.4k431Updated Jul 17, 2024Jul 17, 2024
    • ViT-Lens

      Public
      [CVPR 2024] ViT-Lens: Towards Omni-modal Representations
      Python
      Other
      1015630Updated Jul 2, 2024Jul 2, 2024
    • T2I-Adapter
      Python
      2043.4k846Updated Jun 21, 2024Jun 21, 2024
    • SmartEdit

      Public
      Official code of SmartEdit [CVPR-2024 Highlight]
      Python
      8239100Updated Jun 21, 2024Jun 21, 2024
    • InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
      Python
      Apache License 2.0
      3313.2k972Updated Jun 20, 2024Jun 20, 2024
    • LLaMA-Pro

      Public
      [ACL 2024] Progressive LLaMA with Block Expansion.
      Python
      Apache License 2.0
      35470220Updated May 20, 2024May 20, 2024
    • NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
      Python
      Other
      1839071Updated May 14, 2024May 14, 2024
    • BTS

      Public
      BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
      Other
      02540Updated Apr 16, 2024Apr 16, 2024
    • UMT

      Public
      UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
      Python
      Other
      1819010Updated Apr 15, 2024Apr 15, 2024
    • BEBR

      Public
      Official code for "Binary embedding based retrieval at Tencent"
      Python
      Apache License 2.0
      14320Updated Mar 7, 2024Mar 7, 2024
    • DeSRA

      Public
      Official codes for DeSRA (ICML 2023)
      Python
      012450Updated Feb 2, 2024Feb 2, 2024
    • ViSFT

      Public
      Python
      Apache License 2.0
      23310Updated Jan 20, 2024Jan 20, 2024
    • MM-RealSR

      Public
      Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"
      Python
      BSD 3-Clause "New" or "Revised" License
      12155100Updated Jan 16, 2024Jan 16, 2024
    • HOSNeRF

      Public
      HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
      Python
      Apache License 2.0
      76631Updated Dec 12, 2023Dec 12, 2023
    • VTLayout

      Public
      0310Updated Oct 23, 2023Oct 23, 2023
    • TVTS

      Public
      Turning to Video for Transcript Sorting
      Jupyter Notebook
      Other
      24410Updated Aug 27, 2023Aug 27, 2023
    • AnimeSR

      Public
      Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"
      Python
      Other
      3433181Updated Aug 18, 2023Aug 18, 2023
    • pi-Tuning

      Public
      Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.
      Python
      Other
      13220Updated Jul 21, 2023Jul 21, 2023
    • SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
      Apache License 2.0
      67640Updated Jul 10, 2023Jul 10, 2023
    • GVT

      Public
      Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".
      Python
      Apache License 2.0
      05750Updated Jun 27, 2023Jun 27, 2023