Skip to content
Change the repository type filter

All

    Repositories list

    • Official repository for "Boosting Visual Reprogramming for CLIP with Dual Granularity Alignment" [CVPR 2026 Highlight]
      Python
      MIT License
      0300Updated Apr 12, 2026Apr 12, 2026
    • [ACM MM 2021] Official Implementation for Complementary Factorization towards Outfit Compatibility Modeling.
      Python
      MIT License
      0100Updated Apr 11, 2026Apr 11, 2026
    • [ACL 2026 main] PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records
      Python
      01010Updated Apr 11, 2026Apr 11, 2026
    • Python
      MIT License
      01190Updated Apr 10, 2026Apr 10, 2026
    • Python
      1300Updated Apr 10, 2026Apr 10, 2026
    • (ICASSP 2026) MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
      Python
      Apache License 2.0
      1100Updated Apr 10, 2026Apr 10, 2026
    • NovelClaw

      Public
      Dynamic-memory-first collaborative AI framework for long-form story generation, chapter planning, and coherent narrative writing
      Python
      MIT License
      2417910Updated Apr 10, 2026Apr 10, 2026
    • [CVPRW26] Official Implementation for "ViSAGE @ NTIRE 2026 Challenge on Video Saliency Prediction"
      Python
      Apache License 2.0
      0600Updated Apr 10, 2026Apr 10, 2026
    • Python
      Apache License 2.0
      0000Updated Apr 10, 2026Apr 10, 2026
    • Python
      Apache License 2.0
      0000Updated Apr 10, 2026Apr 10, 2026
    • Python
      Apache License 2.0
      0000Updated Apr 10, 2026Apr 10, 2026
    • Python
      Apache License 2.0
      0000Updated Apr 10, 2026Apr 10, 2026
    • Python
      Apache License 2.0
      0000Updated Apr 10, 2026Apr 10, 2026
    • Python
      0300Updated Apr 10, 2026Apr 10, 2026
    • [AAAI 2026] Official repository of AAAI 2026 - INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval.
      Python
      Apache License 2.0
      0300Updated Apr 10, 2026Apr 10, 2026
    • Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
      Python
      12600Updated Apr 9, 2026Apr 9, 2026
    • Discover Micro-influencers for Brands via Better Understanding
      Python
      21710Updated Apr 9, 2026Apr 9, 2026
    • [CVPR-2026] Official implementations of "TINA: Text-Free Inversion Attack for Unlearned Text-to-Image Diffusion Models".
      Python
      0300Updated Apr 9, 2026Apr 9, 2026
    • [AAAI 2026] Official repository of AAAI 2026 - ReTrack: Evidence-Driven Dual-Stream Directional Anchor Calibration Network for Composed Video Retrieval.
      Python
      Apache License 2.0
      0400Updated Apr 9, 2026Apr 9, 2026
    • [ICASSP 2026] Official repository of ICASSP 2026 - HINT: Composed Image Retrieval with dual-patH composItional coNtextualized neTwork.
      Python
      Apache License 2.0
      0100Updated Apr 9, 2026Apr 9, 2026
    • Python
      0000Updated Apr 9, 2026Apr 9, 2026
    • [AAAI 2026] Official repository of AAAI 2026 - HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval.
      Python
      Apache License 2.0
      1400Updated Apr 9, 2026Apr 9, 2026
    • [NeurIPS 2025] Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization
      Python
      1700Updated Apr 9, 2026Apr 9, 2026
    • Python
      0100Updated Apr 9, 2026Apr 9, 2026
    • [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning
      Python
      MIT License
      13370Updated Apr 9, 2026Apr 9, 2026
    • Official repository of CVPR 2025 - Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
      Python
      Apache License 2.0
      0100Updated Apr 8, 2026Apr 8, 2026
    • [ACL 2026 main] TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval.
      Python
      Apache License 2.0
      0000Updated Apr 8, 2026Apr 8, 2026
    • Collection of Composed Image Retrieval (CIR) papers.
      2132411Updated Apr 8, 2026Apr 8, 2026
    • SlideRAG

      Public
      PPT-Centric Multimodal RAG for Study Preview & Exam Review
      Python
      MIT License
      0800Updated Apr 8, 2026Apr 8, 2026
    • A framework for filtering noisy correspondence that applies to any pre-trained model performing image-text matching tasks
      Python
      Apache License 2.0
      0000Updated Apr 8, 2026Apr 8, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.