🎗️
Figuring out LLMs for Vision
CS Ph.D. Student at Georgia Tech | GRA @SHI-Labs | IIT Roorkee CSE 2023
-
Georgia Tech
-
05:24
(UTC -05:00) - https://praeclarumjj3.github.io/
- @praeclarumjj
Highlights
- Pro
Pinned Loading
-
SHI-Labs/OneFormer
SHI-Labs/OneFormer PublicOneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
-
SHI-Labs/OLA-VLM
SHI-Labs/OLA-VLM PublicOLA-VLM: Elevating Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024
Python 5
-
SHI-Labs/VCoder
SHI-Labs/VCoder PublicVCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024
-
Picsart-AI-Research/SeMask-Segmentation
Picsart-AI-Research/SeMask-Segmentation Public[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
-
SHI-Labs/FcF-Inpainting
SHI-Labs/FcF-Inpainting Public[WACV 2023] Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.