3,611 个媒体任务的工具和技能
MiniMax multimodal skill — TTS, voice cloning, image generation for OpenClaw
OpenClaw skill: fetch YouTube video transcripts via TranscriptAPI.com
Create striking HTML presentations from scratch, by converting PPT/PPTX files, or by enhancing existing slide decks. Use
Command-line news research tool for people who don’t use AI clients. It runs Last30Days-style multi-source search (Reddi
A Claude Code skill that generates structured meeting minutes with traffic-light initiative tracking, per-participant ta
Generate diagram/animation videos from natural language using Manim. Agent skill for Claude Code, Codex, and other AI co
An OpenClaw skill that uses faster-whisper and Parakeet to transcribe audio quickly, with additional features such as sp
Agentic motion videos built with Remotion
Local image generation with FLUX or Stable Diffusion. Generate images from text prompts on your GPU.
Gemini + Sora-2-all automated short video generation skill for Claude Code
Claude Code skill to convert Word documents to Markdown with image preservation
A comprehensive PDF document translation tool that enables multilingual translation of PDF documents while preserving th
Claude Code skill for MLX-Audio: TTS, STT, and speech-to-speech on Apple Silicon
Claude Code skill: researches topics across 5-8 real sources and writes data-backed LinkedIn posts in your voice — not A
Claude Code skill: Transcribe audio/video/YouTube to Markdown with speaker diarization (MLX Whisper + SpeechBrain)
An autonomous AI agent that executes your final wishes and preserves your voice, because nothing should be left unsaid,
Claude Code skill that transforms transcripts into ready-to-publish Jike posts with rigorous fact-checking
Claude Code skill that organizes hundreds of Apple Voice Memos into a searchable archive with transcriptions, summaries,
Zernio CLI - Schedule and manage social media posts across 13 platforms from the terminal
通过 869 客户端发送微信消息(私聊/群聊):纯文本、图片、视频、语音(音乐=语音)、链接、文件(附件)。语音支持 amr/wav/mp3;其中 wav/mp3 会在本地按 allbot 兼容方式转为 silk,单条最长 59 秒,超长自
OpenClaw TTS Provider for Xiaomi MiMo (mimo-v2-tts)
POST.devad.io Agent Skill - connect it to Claude / OpenClaw / etc, to schedule social media posts 🤖 Cheapest Social Med
Fine-tune Qwen3-TTS for text-to-speech with custom voices
Standalone TTS CLI for Apple Silicon using Qwen3-TTS and MLX. No server needed.
OpenClaw skill for organizing scanned documents using local AI models (OCR + classification)
Skill for content creators designing educational media: plan objectives, outline, and scripts that are platform-aware an
Skills for transcript video.
'Generate or edit images with Gemini using the Google GenAI SDK. Use when the user asks to create, transform, render, or
image-creation skills
Tessl tile: Write developer blog posts from video transcripts, meeting notes, or rough ideas
Seedream 5.0 image generation skill for AI agents (Claude Code, Codex, OpenClaw)
全平台视频下载工具,支持抖音、X、B站、YouTube、小红书
让agent能分析视频内容,需要LLM支持分析图片
Nano Banana - Claude Code skill for AI image generation with Gemini. Optimizes Chinese prompts to English and generates
All-in-one: Torrents + Subtitles + Player - Streamlined media experience
跨平台媒体分析报告生成技能
ArcaneaClaw — AI-powered Creator Media Engine. Scans, classifies, scores, and publishes creative media 24/7. Works with
AI agent skill for signing PDFs on behalf of Richard Atkinson — PyHanko + vision-based field detection
A creative OS for kids where OpenClaw turns ideas into games, stories, music, art, science apps, and more.
从视频中自动提取高光片段,支持 AI 语义分析(Whisper+CLIP)和音频节奏分析两种模式。自动检测平台/GPU/Miniconda。
OpenClaw Skill for Veevid AI Video Generator — generate videos from text or images with Kling 3.0, Sora 2, Veo 3.1, LTX
Copilot CLI skill for automated Azure portal screenshot capture with PII redaction
面向 Codex 的多后端图像与视频生成 skill。用于配置 `novelai_official`、`novelai_compatible`、`nanobanana`、`grok_imagine`,检查能力矩阵,并通过统一的 `media
OpenClaw skill for joining meetings and transcription
Take screenshots of desktop/Electron apps and automatically blur sensitive information using AI vision + ImageMagick
OpenClaw skill for Smallest AI — sub-100ms TTS (Lightning v3.1) and 64ms STT (Pulse) with 30+ languages
Frigate skill for Openclaw
Agent skills for building voice AI with Synthflow
A development skill for documentation-grounded Unitree G1 SDK, ROS2, DDS, service-interface, and RealSense camera workfl
"Local PDF extraction skill for OpenClaw using OpenDataLoader CLI. Use when: extracting text from PDFs, improving t