Skip to content
Change the repository type filter

All

    Repositories list

    • COIN-Detours: Context-Aware Retrieval and Generative Temporal Grounding for Instructional Video Detours
      Python
      Apache License 2.0
      0000Updated Jun 25, 2026Jun 25, 2026
    • NaviGen

      Public
      Python
      Apache License 2.0
      0110Updated Jun 23, 2026Jun 23, 2026
    • Python
      1400Updated Jun 23, 2026Jun 23, 2026
    • Python
      MIT License
      0100Updated Jun 23, 2026Jun 23, 2026
    • Python
      MIT License
      0110Updated Jun 23, 2026Jun 23, 2026
    • [NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
      Python
      MIT License
      1218640Updated Jun 17, 2026Jun 17, 2026
    • Latest Papers, Codes and Datasets on VTG-LLMs.
      39400Updated Jun 12, 2026Jun 12, 2026
    • [CVPRW26] Official Implementation for "ParseFixer: An Agentic Framework for Document Parsing via Selective Multimodal Correction"
      Python
      Apache License 2.0
      0100Updated Jun 11, 2026Jun 11, 2026
    • [SIGIR'25] FiRE: enhancing mllms with fine-grained context learning for complex image retrieval
      Python
      0000Updated Jun 10, 2026Jun 10, 2026
    • [CVPRW26] Official Implementation for "ChartLens: A Dual-Branch Framework for Chart Data Correction and Factual Summary Refinement"
      Python
      Apache License 2.0
      0100Updated Jun 10, 2026Jun 10, 2026
    • Python
      78582Updated Jun 9, 2026Jun 9, 2026
    • Collection of Composed Image Retrieval (CIR) papers.
      2235611Updated Jun 8, 2026Jun 8, 2026
    • ASPNet

      Public
      Python
      0000Updated Jun 7, 2026Jun 7, 2026
    • [ACM MM 2025] PUMA: Layer-Pruned Language Model for Efficient Unified Multimodal Retrieval with Modality-Adaptive Learning
      Python
      Apache License 2.0
      01800Updated Jun 6, 2026Jun 6, 2026
    • [CVPR 2026] Official Implementation for Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulat…
      Python
      MIT License
      12200Updated Jun 4, 2026Jun 4, 2026
    • NovelClaw

      Public
      Dynamic-memory-first collaborative AI framework for long-form story generation, chapter planning, and coherent narrative writing
      Python
      MIT License
      4533800Updated May 31, 2026May 31, 2026
    • [AAAI 2026] Official repository of AAAI 2026 - INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval.
      Python
      Apache License 2.0
      0500Updated May 25, 2026May 25, 2026
    • [ACM MM 2025] Official repository of ACM MM 2025 - OFFSET: Segmentation-based Focus Shift Revision for Composed Image Retrieval.
      Python
      Apache License 2.0
      0400Updated May 25, 2026May 25, 2026
    • MM25-HUD

      Public
      [ACM MM 2025] Official repository of ACM MM 2025 - HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Retrieval.
      Python
      Apache License 2.0
      0400Updated May 25, 2026May 25, 2026
    • [AAAI 2026] Official repository of AAAI 2026 - HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval.
      Python
      Apache License 2.0
      1900Updated May 25, 2026May 25, 2026
    • [AAAI 2026] Official repository of AAAI 2026 - ReTrack: Evidence-Driven Dual-Stream Directional Anchor Calibration Network for Composed Video Retrieval.
      Python
      Apache License 2.0
      0800Updated May 25, 2026May 25, 2026
    • [AAAI 2025] Official repository of AAAI 2025 - ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval
      Python
      Apache License 2.0
      0700Updated May 25, 2026May 25, 2026
    • [CVPR 2026] Official repository of CVPR 2026 - ConeSep: Cone-based Robust Noise-Unlearning Compositional Network for Composed Image Retrieval.
      Python
      Apache License 2.0
      0610Updated May 25, 2026May 25, 2026
    • [CVPR 2026] Official repository of Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval
      Python
      Apache License 2.0
      0600Updated May 25, 2026May 25, 2026
    • [ACL 2026 main] TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval.
      Python
      Apache License 2.0
      0400Updated May 25, 2026May 25, 2026
    • [TIP 2026] Official repository of TIP 2026 - COMBINER: Composed Image Retrieval Guided by Attribute-based Neighbor Relations.
      Python
      1500Updated May 25, 2026May 25, 2026
    • Long Short-Term Imputer: Handling Consecutive Missing Values in Time Series
      Python
      MIT License
      0000Updated May 24, 2026May 24, 2026
    • Meta Guidance: Incorporating Inductive Biases into Deep Time Series Imputers
      Python
      MIT License
      0000Updated May 24, 2026May 24, 2026
    • Official Implementation of the Paper: A Transferable Augmentation Framework to Combat Distribution Shifts(TMLR25)
      Python
      Apache License 2.0
      0000Updated May 22, 2026May 22, 2026
    • Official repository for "Boosting Visual Reprogramming for CLIP with Dual Granularity Alignment" [CVPR 2026 Highlight]
      Python
      MIT License
      0300Updated May 15, 2026May 15, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.