Image & Video Processing MCP Servers
Explore our curated list of the best Image & Video Processing MCP servers. Browse 40 servers to compare features, choose, connect with ease, and find the right server for your project.
Algolia
Use AI agents to provision, configure, and query your [Algolia](https://algolia.com) search indices.
AgentMode
Connect to dozens of databases, data warehouses, Github & more, from a single MCP server. Run the Docker image locally, in the cloud, or on-premise.
AWS Cost Explorer
Optimize your AWS spend (including Amazon Bedrock spend) with this MCP server by examining spend across regions, services, instance types and foundation models ([demo video](https://www.youtube.com/watch?v=WuVOmYLRFmI&feature=youtu.be)).
Azure OpenAI DALL-E 3 MCP Server
An MCP server for Azure OpenAI DALL-E 3 service to generate image from text.
Bilibili
This MCP server provides tools to fetch Bilibili user profiles, video metadata, search videos, and more.
Browser MCP
(by UI-TARS) - A fast, lightweight MCP server that empowers LLMs with browser automation via Puppeteer’s structured accessibility data, featuring optional vision mode for complex visual understanding and flexible, cross-platform configuration.
CRASH
MCP server for structured, iterative reasoning and thinking with flexible validation, confidence tracking, revision mechanisms, and branching support.
Creatify
MCP Server that exposes Creatify AI API capabilities for AI video generation, including avatar videos, URL-to-video conversion, text-to-speech, and AI-powered editing tools.
DaVinci Resolve
MCP server integration for DaVinci Resolve providing powerful tools for video editing, color grading, media management, and project control.
Dicom
An MCP server to query and retrieve medical images and for parsing and reading dicom-encapsulated documents (pdf etc.).
Docker
Integrate with Docker to manage containers, images, volumes, and networks.
Docker
Docker MCP Server provides advanced, unified Docker management via CLI and MCP workflows, supporting containers, images, volumes, networks, and orchestration.
FoundationModels
An MCP server that integrates Apple's [FoundationModels](https://developer.apple.com/documentation/foundationmodels) for text generation.
Home Assistant
Docker-ready MCP server for Home Assistant with entity management, domain summaries, automation support, and guided conversations. Includes pre-built container images for easy installation.
HuggingFace Spaces
Server for using HuggingFace Spaces, supporting Open Source Image, Audio, Text Models and more. Claude Desktop mode for easy integration.
IIIF
Comprehensive IIIF (International Image Interoperability Framework) protocol support for searching, navigating, and manipulating digital collections from museums, libraries, and archives worldwide.
Image Generation
This MCP server provides image generation capabilities using the Replicate Flux model.
ImageSorcery MCP
ComputerVision-based 🪄 sorcery of image recognition and editing tools for AI assistants.
JSON2Video MCP
A Model Context Protocol (MCP) server implementation for programmatically generating videos using the json2video API. This server exposes powerful video generation and status-checking tools for use with LLMs, agents, or any MCP-compatible client.
mcp-screenshot-website-fast
High-quality screenshot capture optimized for Claude Vision API. Automatically tiles full pages into 1072x1072 chunks (1.15 megapixels) with configurable viewports and wait strategies for dynamic content.