- LVLM: ARC Prize, Chatbot, LLM Benchmark, OmniSpatial
- Detection: Zero-Shot COCO, Open-Voc COCO, Open-Voc LVIS
- Text-encoder: MTEB, SuperGLUE
- 6D Object Pose: BOP
- other: KITTI, Mip-NeRF 360
Install the basic requirements:
pip install -r requirements.txtSelect preferences and run the command to install PyTorch >= 2.5.1 locally.
pip install open_clip_torchpip install mss git+https://github.com/facebookresearch/detectron2.git git+https://github.com/openai/CLIP.gitpip install --no-build-isolation provider pyrender torch-clusterpip install xformers==0.0.29 # for torch 2.5.1
pip install --no-deps git+https://github.com/ByteDance-Seed/Depth-Anything-3.git
pip install e3nn evo "moviepy<2" pillow-heif plyfile pycolmappip install git+https://github.com/IDEA-Research/GroundingDINO.gitpip install --no-deps git+https://github.com/robbyant/lingbot-depthpip install git+https://github.com/facebookresearch/sam2.gitpip install diffusersInstall the following packages to run the web server:
pip install fastapi uvicornYou need to create a Python file that stores the API in a dictionary format within a variable named FUNCTIONS, and set the file path in server.py.
Then, run the server using the command below:
uvicorn server:app --port 8000