MCP · A2A · x402 · agentndx.ai llms.txt MCP endpoint
10
Model Providers
6
Infrastructure
1
Local Inference
MCP
Protocol
Model Providers
LLM inference accessible as MCP tools. Call models directly from any agent workflow.
Provider Models Speed Deployment Auth Status
Replicate MCP
Official MCP server for Replicate, enabling agents to run ML model inference across thousands of open-source models.
1000s of open models variable cloud api-key verified
Perplexity MCP
MCP server for Perplexity's AI-powered search API, providing sourced answers with citations for agent research tasks.
Sonar (search-augmented) standard cloud api-key verified
OpenAI MCP
Access OpenAI models including GPT-4o, o1, DALL-E, and Whisper from MCP-compatible agents. Use OpenAI capabilities as a tool inside any MCP workflow.
GPT-4o, o1, DALL-E, Whisper standard cloud api-key
Groq MCP
Ultra-fast LLM inference via Groq. Access Llama 3, Mixtral, and Gemma models at speeds up to 800 tokens/sec from any MCP-compatible agent.
Llama 3, Mixtral, Gemma 800+ tok/s cloud api-key
HuggingFace MCP
Access 900,000+ models on HuggingFace Hub. Run inference, search models, retrieve datasets, and interact with Spaces from AI agent workflows.
900K+ models on Hub variable both api-key verified
Cohere MCP
Access Cohere Command and Embed models from AI agents. Generate text, create embeddings, rerank search results, and build RAG pipelines via Cohere API.
Command R+, Embed, Rerank standard cloud api-key verified
Mistral MCP
Access Mistral AI models including Mistral Large, Codestral, and Pixtral from MCP-compatible agents for text generation, code completion, and vision tasks.
Mistral Large, Codestral, Pixtral standard cloud api-key
Anthropic Claude MCP
Call Anthropic Claude models as tools within MCP-compatible agent workflows. Access Claude 3.5 Sonnet, Haiku, and Opus for text generation, analysis, and reasoning.
Claude 3.5 Sonnet, Haiku, Opus standard cloud api-key
Together AI MCP
Together AI inference API via MCP. Run open-source LLMs — Llama, Mistral, DBRX, and more — with fast parallel inference at scale.
Llama, Mistral, DBRX, Qwen 800+ tok/s cloud api-key verified
Ollama MCP
Local LLM inference via Ollama. Run Llama, Mistral, Gemma, and other models locally — no API keys, no data leaving the machine.
Llama, Mistral, Phi, Qwen (local) local hw local none verified
LOCAL
Ollama MCP is the only local inference provider in this set. No API key, no egress, no token costs. Runs Llama, Mistral, Phi, Qwen on your hardware. Requires Ollama installed locally.
AI Infrastructure
Memory, reasoning, observability, and search tools for building reliable AI systems.
Tool Purpose Protocols Auth Status
Context7 MCP
Live library documentation fetcher for LLMs. Resolves library names to current API docs, preventing hallucinations on outdated APIs.
documentation · libraries
MCP
none verified
Sequential Thinking MCP
Structured multi-step reasoning for complex problems. Enables agents to break tasks into explicit thought chains, revise reasoning, and build toward solutions methodically.
reasoning · thinking
MCP
none verified
Qdrant MCP
Official Qdrant MCP server for storing, retrieving, and searching vector embeddings in the Qdrant vector database.
qdrant · vector-search
MCP
api-key verified
Mem0 MCP
Persistent memory layer for AI agents. Store, search, and retrieve user preferences, conversation history, and learned facts across sessions. Official Mem0 MCP server.
memory · persistence
MCP
api-key verified
LangSmith MCP
LangSmith LLM observability via MCP. Trace agent runs, inspect prompts and outputs, evaluate quality, and debug complex chain failures.
langsmith · observability
MCP
api-key verified
Weights & Biases MCP
Weights & Biases ML experiment tracking via MCP. Log runs, compare metrics, manage model artifacts, and query training history from agents.
wandb · mlops
MCP
api-key verified
Other AI Servers
Name Protocols Auth Status
Langfuse MCP
Official MCP server for Langfuse LLM observability. Access and manage prompts, traces, and datasets through the Model Context Protocol.
MCP
api-key
Opik MCP
MCP server for Comet Opik, providing unified access to LLM prompts, projects, traces, and evaluation metrics from your IDE.
MCP
api-key
Arize Phoenix MCP
MCP server for Arize Phoenix AI observability. Explore projects, traces, spans, prompts, datasets, and experiments via the Model Context Protocol.
MCP
api-key
Confident AI MCP
Official MCP server for Confident AI and DeepEval. Run LLM evaluations, manage prompt templates, pull datasets, and trigger cloud evals from your editor.
MCP
api-key
Braintrust MCP
MCP server for Braintrust AI evaluation and observability. Access experiments, datasets, scoring functions, and production logs for LLM quality management.
MCP
api-key
MCP LLM Eval
Local MCP server that packages LLM evaluation gates as reusable CI/CD primitives. Run datasets against models, score with LLM-as-judge, enforce quality thresholds.
MCP
api-key
MCP Bench
Benchmarking framework by Accenture for evaluating LLM tool-use via MCP. End-to-end pipeline assessing how effectively models discover, select, and use tools.
MCP
none
Promptfoo MCP
MCP server exposing Promptfoo eval and red-team testing tools to AI agents. Run prompt evaluations, security tests, and quality checks from your IDE.
MCP
none
Google AI (Gemini) MCP
Access Google Gemini models via MCP. Run text generation, multimodal inputs, code execution, and grounding with Google Search through the Gemini API.
MCP
api-key
xAI (Grok) MCP
Integrate xAI's Grok models into agent workflows. Access Grok's real-time knowledge, large context window, and reasoning capabilities via the xAI API.
MCP
api-key
OpenRouter MCP
Route LLM requests across 200+ models via a single MCP interface. Access GPT, Claude, Gemini, Llama, Mistral, and more with unified pricing, fallbacks, and load balancing.
MCP
api-key
Amazon Bedrock MCP
Access Amazon Bedrock foundation models via MCP. Invoke Claude, Titan, Llama, Mistral, and other models hosted on AWS Bedrock directly from agent workflows.
MCP
api-key verified
LiteLLM MCP
Universal LLM gateway via MCP. Route agent requests to 100+ LLM providers — OpenAI, Anthropic, Bedrock, Gemini, Groq — through a single interface with cost tracking and load balancing.
MCP
api-key verified
Azure OpenAI MCP
Access Azure-hosted OpenAI models via MCP. Use GPT-4o, o1, and other models deployed in your Azure OpenAI resource with enterprise compliance and private networking.
MCP
api-key verified
DeepSeek MCP
Access DeepSeek reasoning and chat models via MCP. Run DeepSeek-R1 and DeepSeek-V3 for cost-efficient reasoning tasks, long-context analysis, and code generation.
MCP
api-key
LlamaIndex MCP
MCP server for LlamaIndex — the leading data framework for LLM applications. Query knowledge bases, run RAG pipelines, manage document indexes, and connect agents to structured data.
MCP
api-key
Haystack MCP
MCP server for Haystack by deepset — the open-source NLP framework for building production-grade RAG and search pipelines. Run pipelines, query document stores, and manage AI components.
MCP
api-key
Cerebras MCP
MCP server for the Cerebras Inference API — the world's fastest AI inference engine. Run Llama and other open models at 1,800+ tokens/second for latency-sensitive agentic workloads.
MCP
api-key
MLflow MCP
MCP server for MLflow — the open-source ML lifecycle platform by Databricks. Log experiments, compare runs, register models, manage the model registry, and track artifacts from AI and ML pipeline agents.
MCP
api-key
Guidepoint MCP
MCP server from Guidepoint giving AI research workflows access to 100,000+ expert interview transcripts, industry insights, and primary research intelligence directly inside Claude and other MCP-compatible tools.
MCP
api-key
Google AI Edge Gallery MCP Server
MCP integration for Google AI Edge Gallery enabling on-device AI models running locally on Android to connect with external tools and data sources. Supports the Model Context Protocol for extending local LLM capabilities with tool use, persistent chat history, and notification reminders without cloud dependency.
MCP
none
FunASR MCP Server
Industrial-grade speech recognition toolkit from ModelScope with MCP integration. Delivers 170x realtime transcription speed across 50+ languages with speaker diarization, emotion detection, and streaming support. Provides an OpenAI-compatible API and exposes ASR capabilities as MCP tools for AI agent workflows.
MCP
none
AgentDB
Self-learning vector memory MCP server that improves search quality up to 36% through agent feedback loops. Consolidates vectors, indexes, learning state, and a cryptographic audit trail into a single portable .rvf file. Provides tiered memory with automatic quality refinement for AI agent workflows.
MCP
none
Hippocamp
Tiered memory system MCP server for AI agents that stores and recalls episodes, facts, preferences, and reflections. Provides local-first portable memory that works across devices. Organizes agent knowledge into structured tiers for efficient retrieval and long-term context preservation.
MCP
none
Machine-readable: /api/search?category=ai  ·  llms.txt  ·  Browse AI category →
FEATURED LISTING

Top placement + verified badge for your MCP server

Get Featured — $149
API PRO

Full API access — no rate limits, all endpoints

API Pro — $29/mo