Media AI Skills - 3,611 Tools

Ai Video Generation

Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, See

by okaris · community · Quality: medium · 1 stars

Agent Browser

Browser automation for AI agents via inference.sh. Navigate web pages, interact with elements using @e refs, take screen

by okaris · community · Quality: medium · 1 stars

ACE Music - Free Suno Alternative Generate unlimited AI music for free using ACE-Step 1.5. Full songs with vocals, lyrics, any genre, any language. No subscription, no credits, no limits. The open-source Suno alternative, powered by ACE Music's free API.

Generate AI music using ACE-Step 1.5 via ACE Music's free API. Use when the user asks to create, generate, or compose mu

by fspecii · community · Quality: medium · 1 stars

Bilibili & YouTube Watcher

Fetch and read transcripts from YouTube and Bilibili videos. Use when you need to summarize a video, answer questions ab

by donnycui · community · Quality: medium · 1 stars

vsum

视频 summarizer，支持 YouTube 和 Bilibili 视频自动获取字幕并 AI 总结，输出为 md 格式。适用于：用户给出一个视频链接，希望总结内容。

by Chrischaan · community · Quality: medium · 1 stars

OpenClaw ComfyUI

Connect and control ComfyUI API efficiently using template mapping and auto-asset management for image generation and ed

by clawhub · community · Quality: medium · 1 stars

YouTube AI Videos

Fetch latest AI-related YouTube videos from curated channels using YouTube Data API v3 and filter by keywords

by clawhub · community · Quality: medium · 1 stars

Peekaboox

Control and automate the Linux desktop GUI on X11. Use this skill to take screenshots, find and click UI elements, type

by clawhub · community · Quality: medium · 1 stars

Video Captions

Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and bu

by ivangdavila · community · Quality: medium · 1 stars

Gemini Image Generator

使用 Gemini 模型生成或编辑图片，支持自定义第三方 API 端点（baseUrl）和密钥。默认 OpenAI 兼容格式，也支持 Google 原生格式。触发场景：文生图、图片编辑、图片合成、绘画请求、生成插画/照片/海报、 AI

by clawhub · community · Quality: medium · 1 stars

PrintPal 3D Generation

Generate 3D models for 3D printing from images or text prompts using PrintPal API. Use when the user wants to create 3D

by plebbyd · community · Quality: medium · 1 stars

Taobao Image Search

使用淘宝进行以图搜同款、候选比对和加购物车操作。用户提供商品图片并要求“搜同款/找类似款/比价/加入购物车”时使用。优先执行本地脚本（save-taobao-cookie.js、verify-taobao-runner.js）完成全流程；当

by clawhub · community · Quality: medium · 1 stars

Veryfi Documents AI

Real-time OCR and data extraction API by Veryfi. Extract structured data from receipts, invoices, bank statements, W-9s,

by dbirulia · community · Quality: medium · 6 stars

Cheapest Image Generation

Possibly the cheapest AI image generation (~$0.0036/image). Text-to-image via the EvoLink API.

by EvoLinkAI · community · Quality: medium · 1 stars

Dropbox KB Auto

Automatically index and semantically search Dropbox files using OCR and Office file parsing with efficient delta-based s

by clawhub · community · Quality: medium · 1 stars

Best Image

Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLink A

by clawhub · community · Quality: medium · 1 stars

Cheapest Image

Possibly the cheapest AI image generation (~$0.0036/image). Text-to-image via the EvoLink API.

by clawhub · community · Quality: medium · 1 stars

video-copy-analyzer

Video Copy Analyzer - AI-powered video transcription and copywriting analysis skill

by ALBEDO-TABAI · community · Quality: medium · 82 stars

Vajra

Analyze URLs, YouTube videos, tweets, or text for quality, bias, and reliability using the Vajra API (vajra.to). Use whe

by clawhub · community · Quality: medium · 1 stars

YouTube Ultimate

Free transcripts, 4K downloads, and video exploration — zero API quotas burned.

by clawhub · community · Quality: medium · 2 stars

Best Image Generation

Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLink A

by clawhub · community · Quality: medium · 1 stars

nano-banana-pdf-edit

Edit PDF files visually using natural language with the nano-pdf CLI tool, powered by Google's Gemini 3 Pro Image (Nano

by ps06756 · community · Quality: medium · 1 stars

Gemini Voice Assistant

Voice-to-voice AI assistant using Gemini Live API. Speak to the AI and get spoken responses. Use when you want to have n

by clawhub · community · Quality: medium · 1 stars

Voice Log

Background voice journaling with Soniox realtime STT for OpenClaw. Requires SONIOX_API_KEY. Get/create your Soniox API k

by easwee · community · Quality: medium · 1 stars

Microsoft Foundry image generation

Azure Foundry image generation skill for OpenClaw; generates images via a Foundry deployment and returns image bytes or

by clawhub · community · Quality: medium · 1 stars

YouTube Uploader

Upload videos and custom thumbnails to YouTube. Use when the user wants to publish, upload, or post a video to YouTube,

by clawhub · community · Quality: medium · 1 stars

Browser Automation

Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill

by clawhub · community · Quality: medium · 1 stars

Moltgram

Post to Moltgram — Instagram for AI Agents. Register, generate images, post, like, follow, and comment.

by clawhub · community · Quality: medium · 1 stars

mcp-chrome

Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill

by clawhub · community · Quality: medium · 1 stars

stats.fm

Query stats.fm (Spotify listening stats) via the public REST API. Provides music listening data, Spotify stats, top arti

by clawhub · community · Quality: medium · 1 stars

Open Animate

Open Animate — the creative suite for AI agents. Create professional motion graphics, generate images, and render MP4 vi

by jacobcwright · community · Quality: medium · 1 stars

Web2Labs Studio

Edit my recording, turn a long video into shorts, generate captions and thumbnails, estimate cost before processing. Upl

by Vinlow · community · Quality: medium · 1 stars

openlesson

Interact with the openLesson tutoring API to generate learning plans, start audio-based sessions, analyze reasoning gaps

by clawhub · community · Quality: medium · 1 stars

AI Video Editor

Use when editing videos, creating Reels/Shorts/TikTok, cutting long videos into clips, adding AI captions or commentary,

by clawhub · community · Quality: medium · 1 stars

ultraplan

CLI tool for recording multi-modal context (audio, keystrokes, clipboard, screenshots) locally

by definite-app · community · Quality: medium · 22 stars

code2animation

Produce complete code-based animated videos by scripting, generating narration, creating visual assets, and rendering fi

by clawhub · community · Quality: medium · 1 stars

LOCAL WHISPER API

Transcribe audio via API Whisper with any compatible local servers.

by clawhub · community · Quality: medium · 1 stars

WiseDiag MedOCR

Convert PDF files to Markdown using WiseDiag MedOcr API. Supports table recognition, multi-column layouts, and medical d

by fmdmm · community · Quality: medium · 1 stars

poidh-bounty

Post bounties and evaluate/accept winning submissions on poidh (pics or it didn't happen) on Arbitrum, Base, or Degen Ch

by picsoritdidnthappen · community · Quality: medium · 25 stars

Deepgram Transcribe

Transcribe audio via Deepgram Nova-3 API (5.26% WER, 40x faster than Whisper, built-in speaker diarization). Use when us

by clawhub · community · Quality: medium

Meeting Transcripts

Capture meeting transcripts from Fireflies.ai via polling or webhooks. Auto-fetches transcripts, extracts action items/d

by clawhub · community · Quality: medium

文件批量处理大师

一键批量处理所有文件：重命名、压缩图片、转 PDF、自动分类整理，无需安装任何软件，小龙虾本地运行，安全无广告，职场 / 学生 / 店主必备神器！使用场景：批量文件重命名、批量图片压缩、批量转 PDF、按类型自动分类整理。适用于Window

by clawhub · community · Quality: medium

Research to WeChat

An end-to-end WeChat article orchestrator that turns a keyword, article, URL, or video transcript into a researched arti

by clawhub · community · Quality: medium

universal-pdf-vision-parser

Extract multilingual document content and language learning notes (French, German, Japanese, Spanish, etc.) from PDFs us

by clawhub · community · Quality: medium

Narrative Voice

叙事性对话技能。在日常会话中输出富有故事感、有温度、有余韵的回应。灵感来自 Neil Gaiman 的访谈风格——只言片语却能展开丰富的高维信息。使用两档深度（轻/深），自动判断切换。

by clawhub · community · Quality: medium

Morfeo Content Engine Pipeline

Autonomous pipeline generating TikTok videos simulating real Argentine brands with a final AI reveal by Morfeo Labs, pos

by clawhub · community · Quality: medium

Save Douyin Video To Feishu Drive

从抖音分享链接或视频页 URL 解析出可下载的视频直链、标题与描述，并可下载到本地或上传到飞书云盘。适用于需要解析抖音 URL（短链、/video/、/note/、modal_id 等）并获取真实播放地址或下载视频时使用。

by clawhub · community · Quality: medium

Ads Audience Targeting

Build audience segmentation and targeting plans for Meta (Facebook/Instagram), Google Ads, TikTok Ads, YouTube Ads, and

by clawhub · community · Quality: medium

Ads Q&A Assistant

Answer ads operations questions quickly for Meta (Facebook/Instagram), Google Ads, TikTok Ads, YouTube Ads, Amazon Ads,

by clawhub · community · Quality: medium

Ads Data Query

Run natural-language data query workflows for Meta (Facebook/Instagram), Google Ads, TikTok Ads, YouTube Ads, Amazon Ads

by clawhub · community · Quality: medium