メディアタスク向けの 3,620 件のツールとスキル
Automatically back up images to stardots.io cloud storage with secure MD5 authentication and configurable credentials.
Video translation / dubbing skill. Translate user-provided video (file or URL) and return preview_url. 适用于视频翻译、视频配音、字幕翻译
AI face swap service - Use verging.ai AI face swap directly from command line. Supports local video files and images, re
调用 OCR.space 免费 API 识别图片中的文字
全自动"重生爽文"短视频流水线。给定题材,依次完成: AI生成小说 → TTS语音合成 → FFmpeg竖屏视频合成。 触发关键词:生成爽文、生成小说视频、重生爽文流水线、 novel pipeline、tts 合成视频
Designs product option structures and live personalization preview flows for custom gift stores (e.g. engraved necklaces
Use this skill when users need to extract text from images, PDFs, or documents. Supports URLs and local files. Returns s
Automatically converts received voice messages to text via an external ASR service, supporting multiple audio formats an
Automatically upload images to Stardots.io cloud storage, manage files, and obtain secure access links using API authent
Professional AI video generation with cinematic prompt optimization, auto-detection of optimal generation backends (Comf
MinerU document parsing CLI with layout.json post-processing and S3 integration. Parse PDF/Word/PPT/images to structured
将PDF文件的每一页转换为图片文件;支持自定义图片格式(PNG/JPG)和分辨率;适用于文档处理、图片化存档等场景
Use PoYo AI's Sora 2 video generation models through the `https://api.poyo.ai/api/generate/submit` endpoint. Use when a
图片尺寸调整和压缩工具技能。用于按指定像素宽高、比例或最大尺寸限制调整图片大小,并支持智能压缩到指定文件大小。适用于需要批量处理图片、生成特定尺寸缩略图、压缩图片以满足文件大小限制等场景。
Manage your entire social media from AI — post, schedule, and analyze across Facebook, Instagram, TikTok, YouTube, Linke
Download videos, images, and audio without watermarks from 999+ platforms (TikTok, YouTube, Instagram, Twitter, Bilibili
Async AI image generation (text-to-image and image-to-image). Submit a job to get a task_id, then poll status to get an
Generate images with FLUX models (Black Forest Labs) via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA with
使用科大讯飞 API 将音频/视频转换为文字。支持本地音频文件转录、YouTube 视频下载并转文字。适用于会议记录、视频字幕、语音笔记等场景。当用户需要语音转文字、音频转录、YouTube 视频转文字时触发。
Use when the user wants to generate product detail images or carousel/main images for e-commerce platforms like Taobao.
提供使用摄像头拍照, 录制视频或直接生成gif的能力。何时触发: 需要拍照时, 需要观察一段时间当前视野时, 需要关注某件事情的进展时.
Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers i
Convert text to podcast audio using Tencent Cloud TTS. Supports both short and long text processing, generates up to 30-
Convert multiple HTML page elements into separate high-resolution images with customizable settings and automatic file n
Parse PDF, DOC, DOCX, and image files to Markdown or JSON using UniDoc API with sync or async mode and automatic status
Control Android devices via adbclaw CLI — tap, swipe, type, screenshot, UI inspection, and app management. Use when: (1)
Assist macOS users in preparing, converting, exporting, and troubleshooting Word, PDF, Markdown, PowerPoint, and Excel f
智能简历解析系统,支持PDF/Word/图片格式简历的结构化信息提取、岗位匹配度分析、优化建议生成。完全本地运行,无需外部API。使用场景:(1) 解析上传的简历文件提取核心信息,(2) 输入岗位JD计算简历匹配度,(3) 生成简历优化建议
Use PoYo AI Flux 2 through the `https://api.poyo.ai/api/generate/submit` endpoint. Use when a user wants to generate or
Use PoYo AI Grok Imagine Video for short text-to-video and image-to-video generation with motion-style controls through
支持一键将视频批量上传至抖音、快手、视频号、B站、YouTube 和 TikTok,具备凭证管理和失败自动重试功能。
Use PoYo AI Hailuo 02 for prompt-optimized video generation and image-to-video workflows through the `https://api.poyo.a
Use PoYo AI GPT Image 1.5 through the `https://api.poyo.ai/api/generate/submit` endpoint. Use when a user wants to gener
Use PoYo AI GPT-4o Image through the `https://api.poyo.ai/api/generate/submit` endpoint. Use when a user wants to genera
AI voice call agent — make outbound calls, generate browser call links, accept inbound calls, and retrieve full transcri
Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).
Video intelligence and content analysis using Memories.ai LVMM. Discover videos on TikTok, YouTube, Instagram by topic o
Download Sentinel satellite imagery (Sentinel-1/2/5P) via STAC API with cloud cover filtering and batch download support
Intelligent workplace inspection system with guided setup, configurable inspection tasks, AI-powered image analysis, and
Daily AI image generation from Wikipedia On This Day events using local ComfyUI. Use when user wants daily historical im
Local TTS router for Apple Silicon — pull models, serve OpenAI-compatible API, synthesize speech, clone voices. Use when
Organize a video folder by cleaning non-video files, removing short/bad videos, and classifying videos into numbered sub
Generate subtitles with automatic time alignment using Volcengine ATA API. Use when the user wants to: (1) add time-alig
Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM
Organize a photo folder by cleaning non-photo files, removing bad exposures, detecting blur and burst shots, and classif
CLI for VibeSKU — an AI-powered creative automation platform that turns product SKU photos into professional e-commerce
Generate and customize shareable personal pages from memory profiles that evolve with your experiences and include rich
Generate AI videos for mature creative projects using Wan 2.6, Seedance 1.5, Vidu Q3-Pro, and other models with relaxed
Automate posting to WeChat Moments on Windows desktop (open Moments window, trigger publish entry, select image, paste c
Generate AI images for mature creative projects using Wan 2.6, Seedream, and other models with relaxed content policies