MCP Directory · Media
Media MCP Servers
MCP servers for image generation, video processing, audio transcription, and media management with AI agents.
93 servers · sorted by GitHub stars
What you can do
- ✓Generate images from text descriptions
- ✓Transcribe audio and video files
- ✓Process and resize images in bulk
- ✓Extract metadata from media files
Top Media MCP servers by GitHub stars
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to
Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction an
天枢 - 企业级 AI 一站式数据预处理平台 | PDF/Office转Markdown | 支持MCP协议AI助手集成 | Vue3+FastAPI全栈方案 | 文档解析 | 多模态信息提取
MCP server retrieving transcripts of YouTube videos
Create professional visuals from text, URLs, or PDFs. Carousels, presentations, posts. 100+ formats
Image processing for AI agents. Resize, convert, compress, and pipeline images.
ROC biometrics & computer vision: face, LPR, OCR, pedestrian, vehicle, gun detection.
Analyze images and videos with Gemini to get fast, reliable visual insights. Handle content from U…
Create images and videos from prompts, with options for image mixing, reference images, and start/…
Analyze images from multiple angles to extract detailed insights or quick summaries. Describe visu…
Generate professional PowerPoint presentations from text, YouTube videos, or structured JSON data.…
Generate polished PowerPoint presentations from text prompts, YouTube videos, or structured outlin…
Find academic papers across major sources like arXiv, PubMed, bioRxiv, and more. Download PDFs whe…
Generate, manage and explore your Switch AI image and video library, scoped to your account.
Detect grooming, bullying, fraud, and 16+ online threats across text, voice, image, and video.
Clip videos into published YouTube Shorts with real measured 48h/7d receipts.
Edit, draft, summarize, export styled .docx/PDF/HTML/MD/RTF via 21 MCP tools + 4 prompts.
Manage discussion topics, gather votes and video rationale, and read AI insights.
Bitcoin and YouTube video intelligence for AI agents. Pay-per-call via x402 USDC on Base.
QC videos, podcasts, and clips before upload — timestamped flags with agent-ready repair prompts.
Need custom MCP integration?
Altor builds production AI systems for US B2B companies.
We integrate media tools into your AI agent workflows — connecting to your live systems and shipping to production in 3 weeks.