Instructions to use LanguageBind/LanguageBind_Video with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use LanguageBind/LanguageBind_Video with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("zero-shot-image-classification", model="LanguageBind/LanguageBind_Video") pipe( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png", candidate_labels=["animals", "humans", "landscape"], )# Load model directly from transformers import AutoModelForZeroShotImageClassification model = AutoModelForZeroShotImageClassification.from_pretrained("LanguageBind/LanguageBind_Video", dtype="auto") - Notebooks
- Google Colab
- Kaggle
| { | |
| "add_prefix_space": false, | |
| "bos_token": { | |
| "__type": "AddedToken", | |
| "content": "<|startoftext|>", | |
| "lstrip": false, | |
| "normalized": true, | |
| "rstrip": false, | |
| "single_word": false | |
| }, | |
| "do_lower_case": true, | |
| "eos_token": { | |
| "__type": "AddedToken", | |
| "content": "<|endoftext|>", | |
| "lstrip": false, | |
| "normalized": true, | |
| "rstrip": false, | |
| "single_word": false | |
| }, | |
| "errors": "replace", | |
| "model_max_length": 77, | |
| "name_or_path": "lb203/LanguageBind_Video", | |
| "pad_token": "<|endoftext|>", | |
| "special_tokens_map_file": "./special_tokens_map.json", | |
| "tokenizer_class": "LanguageBindVideoTokenizer", | |
| "unk_token": { | |
| "__type": "AddedToken", | |
| "content": "<|endoftext|>", | |
| "lstrip": false, | |
| "normalized": true, | |
| "rstrip": false, | |
| "single_word": false | |
| } | |
| } | |