LIVE-Last scan updating-53 sources active-129 signals today-RESEARCH PGaussDet: Open-Vocabulary and Referring Segmentation for 3D Gaussians Using 2D Detectors
Automated alternatives

Best Hugging Face Inference Providers alternatives.

Live source-backed alternatives to Hugging Face Inference Providers for Vision-language. Alternatives are selected from the same task category and update whenever the best-of index rebuilds.

Alternatives
7
same task category
Sources
15
distinct URLs
Modules
6
indexable
Updated
Jun 26, 2026
from radar data
Reference option

Hugging Face Inference Providers

Official Hugging Face Inference Providers catalog for running model API and serverless inference workflows across text, vision, image generation, speech, embedding, and multimodal model tasks. official_inference_catalog llm api model api inference api serverless inference image generation object detection vision-language embedding model speech-to-text text-to-speech text image audio vision embedding multimodal api hosted model hub serverless inference model hub provider routing task catalog developer inference

RDR81Paid APIHugging Face
Alternative

NVIDIA NIM Model Catalog

Matched vision-language, vision language, multimodal; 3 source links; official inference catalog signal; access model: Free endpoint

RDR84Free endpoint
Alternative

A Unified Framework for Efficient Remote Sensing Visual Question Answering: Adapting Dual, Hybrid, and Encoder-Decoder Architectures

Matched vision-language, vision language, vlm; 1 source link; access model: Research-only

RDR80Research-only
#AlternativeKindAccessFitWhy it appearsSource
01NVIDIA NIM Model Catalog serviceFree endpointRDR84Matched vision-language, vision language, multimodal; 3 source links; official inference catalog signal; access model: Free endpointbuild.nvidia.com
02A Unified Framework for Efficient Remote Sensing Visual Question Answering: Adapting Dual, Hybrid, and Encoder-Decoder ArchitecturespaperResearch-onlyRDR80Matched vision-language, vision language, vlm; 1 source link; access model: Research-onlyarxiv.org
03Fireworks AI Serverless Models servicePaid APIRDR80Matched vision-language, vision language, multimodal; 2 source links; official inference catalog signal; access model: Paid APIdocs.fireworks.ai
04Together AI Serverless Models servicePaid APIRDR80Matched vision-language, vision language, multimodal; 2 source links; official inference catalog signal; access model: Paid APIdocs.together.ai
05Paying More Attention to Visual Tokens in Self-Evolving Large Multimodal ModelspaperResearch-onlyRDR75Matched vision-language, vision language, multimodal; 1 source link; access model: Research-only; freshly updatedarxiv.org
06SARLO-80: Worldwide Slant SAR Language Optic Dataset 80cmpaperResearch-onlyRDR75Matched vision-language, vision language, multimodal; 2 source links; access model: Research-onlyarxiv.org
07RSICCLLM: A Multimodal Large Language Model for Remote Sensing Image Change CaptioningpaperResearch-onlyRDR75Matched vision-language, vision language, multimodal; 2 source links; access model: Research-only; freshly updatedarxiv.org