3,611 个媒体任务的工具和技能
A CLI for Bilibili — browse videos, users, favorites from the terminal 📺
ppt-svg-generator 是一个 Skill,帮助你将 Markdown 文稿快速转化PPT 或 PDF,并支持多种预设风格选择,效果美观且可控。 使用流程参考公众号:懂点儿 AI 👇
Automated video podcast creation skill
Claude Code skill that translates entire books (PDF/DOCX/EPUB) into any language using parallel subagents
Skill to talk to Claude about your projects over the phone
A CLI for Bilibili — browse videos, users, search, and feeds from the terminal
📸 Structured screenshot intelligence for AI coding tools. Give your AI X-ray vision for UI screenshots.
An MCP implementation to allow OpenAI, Claude, or xAI to interact with Smalltalk images
Open-source ai image generator skill and ai video generator skill for Codex, Claude Code, OpenClaw, Cursor, and more. Po
Standalone CLI & library for Google NotebookLM — generate audio podcasts, analyze content, manage notebooks via reve
Zero-setup bash CLI that downloads full-resolution images from iCloud/Dropbox/Google Photos share links, bridging iPhone
MCP server for Apple Music - manage playlists, control playback, browse your library.
Interactive code walkthrough skill with VS Code highlighting and AI-powered voice narration
Control a remote macOS machine from Linux/VPS via SSH — screenshots, commands, app control, file transfer
Voice interface for OpenClaw with speaker recognition, voice-gated security, real-time barge-in, and multi-provider stre
Fast local PDF parsing with PyMuPDF (fitz) for Markdown/JSON outputs and optional images/tables. Use when speed matters
AI image, video, and music generation skill for Claude Code. Flux, Veo 3.1, Suno V5.
MCP server for RustChain blockchain and BoTTube video platform — AI agent tools for earning RTC tokens. Built on createk
Text-to-speech workflow for turning user-provided text or markdown into pleasant `.opus` audio with `chunktts`. Use when
第三方Nano-banana 图像生成与编辑 Skill for OpenClaw - 支持文生图和图生图
UniImage — Unified multi-platform AI image generation skill for OpenClaw. Supports Volcengine Seedream, Alibaba Qwen Ima
AI Agent's document beautifier — one command to turn Markdown into beautiful web pages, Word, PDF & slideshows
Tianphoto — 智能图文生成工作室 | Claude Code Skill for WeChat-style article image generation
Turn docs, posts, and codebases into interactive walkthroughs — narrated animation, quizzes, and hands-on widgets.
Extract individual assets from images as transparent PNGs. Zero ML models, pure classical CV. This is used in pdf2ppt on
批量读取论文 PDF、生成结构化总结并导出 Excel 的 Codex/Claude Code skill
AI Agent skill for narrator-ai-cli — CLI client for Narrator AI video narration API
🚀 An OpenClaw-native health management agent. It tracks diet, hydration & exercise via LLM, and automates the gener
🎙 OpenClaw skill — 将在线视频 URL、本地音视频文件通过通义听悟 API 转写为 Markdown 文字稿,归档至阿里云 OSS
LLM-powered voice command agent that connects to OpenClaw
An OpenCode skill that generates stunning, professional HTML slide presentations. 18 style presets, PDF/PPTX export, pre
Connect Even Realities G2 smart glasses to OpenClaw AI agents. Voice commands → full agent capabilities via Cloudflare W
Give your AI agents memory across sessions. Lightweight session transcript search for OpenClaw.
Generate, edit, and restore images using Google Gemini -- from any AI agent.
Codex skill for turning Bilibili videos into Obsidian study notes with subtitle download, keyframe capture, formula corr
A Claude Code skill that generates AI podcast scripts and audio from source content. NotebookLM-style two-person convers
PowerShell-based GDI+ screenshot tool for OpenClaw Windows headless nodes. Pure PowerShell, no external dependencies, su
Claude Code skill: Download all images from an esa post for local viewing
Docker container for playing Amiga music modules using UADE
Fetch and read transcripts from YouTube and Bilibili videos. Use when you need to summarize a video, answer questions ab
Transform YouTube videos into structured knowledge — TL;DR, key takeaways, timestamped claims, topic timeline, and notab
AI Agents skill that gives agents the capability to render PlantUML diagrams as images
Video analysis skill for Claude Code — frame extraction, scene detection, transcription, parallel subagent analysis
Claude Code skill for generating Excalidraw diagrams and exporting to PNG/PDF/SVG locally
OpenClaw skill for local text-to-speech using Edge TTS
article-to-media-image
Unified NVIDIA NIM skill for OCR, layout, tables, charts, and reranking across Codex, Claude, and OpenClaw
AI agent skill for Plex Media Server — search, watchlist, libraries, sessions via CLI
Clawdess, your OpenClaw assistant that sends you photos, videos, and voice messages.
UGC reel production engine for AI coding agents. Create scroll-stopping short-form videos at scale.