🐷
Fighting
Research Fellow at MMLab@NTU on Large Multi-Modality Models for Perception and Generation.
-
Nanyang Technological University
- Paranioar.github.io/
- @paranioar
- in/haiwen-diao-95987a281
Pinned Loading
-
OpenSenseNova/SenseNova-U1
OpenSenseNova/SenseNova-U1 PublicSenseNova-U series: Native Unified Paradigm with NEO-Unify from the First Principles
-
EvolvingLMMs-Lab/NEO
EvolvingLMMs-Lab/NEO PublicNEO Series: Native Vision-Language Models from First Principles
-
baaivision/EVE
baaivision/EVE PublicEVE Series: Encoder-Free Vision-Language Models from BAAI
-
baaivision/NOVA
baaivision/NOVA Public[ICLR 2025] Autoregressive Video Generation without Vector Quantization
-
Awesome_Matching_Pretraining_Transfering
Awesome_Matching_Pretraining_Transfering PublicThe Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Ins…
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

