メディアタスク向けの 3,621 件のツールとスキル
Search 699pic enterprise photo/video assets, check whether an asset was already downloaded, inspect download records, an
AI图片编辑工具,支持自然语言驱动的换装、换背景、换脸、风格转换(动漫/粘土/油画等)、美颜修图。当用户需要AI图片处理、人像编辑、背景替换、风格迁移、服装更换、脸部融合时使用此skill。支持用户通过描述性prompt(如"把衣
Call the coze-js-api Douyin transcription endpoint and return transcript-ready results from Douyin URLs or share-text. U
使用火山引擎豆包模型生成图片。通过火山引擎豆包图片生成 API 创建图片。支持自定义提示词、尺寸、模型等参数。使用方式:生图:一只可爱的小猫。
Create product demo videos with voiceover, text overlays, and real browser interactions. Fully automated, zero cost. Use
当用户需要查询基金、策略、公告、财经资讯,做资产配置、组合诊断、风险回测、现金流分析,或生成图表、PDF 时,优先使用本 Skill 获取真实数据与可执行能力。
Fix garbled text in PDF/SVG vector graphics for final editing in AI. Detect, replace and repair garbled text in vector g
AI background removal service - Remove background from images using verging.ai AI technology. Supports local images and
AI图片生成与编辑工具,使用Sih.AI API进行自然语言驱动的图片处理。支持换装、换背景、换脸、风格转换(动漫/粘土/油画等)、美颜修图等功能。当用户需要通过自然语言描述来编辑图片(如"把衣服换成bikini"、&q
newtranx CLI for translate MP4 videos, Used for directly translating video files on the terminal. When you want to trans
AI short drama generation - account management, script writing, video production. Integrated X2C billing for commercial
Use Lux3D to generate 3D models from 2D images. Trigger conditions: when user asks to generate 3D model from image, imag
识别图片中的K12算式(加减乘除、竖式计算、分数、方程等),返回结构化文本结果。 支持手写体和印刷体,可拒绝非算式图片。 触发条件:用户要求识别算式、数学题、计算题图片,或上传数学题图片时调用。 关键词:算式识别、数学题、OCR、竖式计算、
MOSI Studio 指令式音色生成(moss-voice-generator): 用自然语言描述想要的音色风格,无需指定预设 voice_id, 模型根据描述实时生成对应的声音。 触发词:指令式语音、按描述生成声音、自定义音色、描述一个
全自动教学视频制作技能。根据课程主题自动生成教学视频,包含文案编写、TTS配音、画面设计、Remotion代码开发、视频导出。触发场景:用户要求制作教学视频、课程视频、讲解视频、教育内容时使用。支持竖屏(1080x1920)和横屏(1920
多平台视频/图文内容发布技能集合。支持账号管理、登录状态维护、一键多平台发布。 当用户要求发布内容到抖音、小红书、微信视频号、Threads、Instagram,或管理发布账号时触发。
使用慧穗云发票识别 API,通过上传发票影像文件(图片、PDF、OFD、ZIP)自动识别发票信息。
This skill provides audio sleep aid recommendations and guidance for users experiencing insomnia or sleep-related issues
Turn app screenshots into structured UX, copywriting, and conversion audits with issue severity and recommended fixes.
支持查询、绑定及切换火山引擎 TTS 机器人音色,设置默认音色并生成测试音频,配置自动保存生效。
Use this skill whenever a user uploads a large image and wants to see interesting details, highlights, or close-ups crop
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Generate photorealistic rendering scripts for PyMOL and UCSF ChimeraX to create publication-quality molecular visualizat
Use the official MinerU (mineru.net) parsing API to convert a URL (HTML pages like WeChat articles, or direct PDF/Office
Conduct FTO patent searches and infringement risk analyses by querying granted and pending patents based on technical de
Virtual gene knockout simulation using foundation models to predict transcriptional changes
Record experimental procedures and observations via voice commands during lab work. Real-time transcription for structur
AI-powered invoice scanning and data extraction from images and PDFs. Use when: (1) user sends an invoice image/PDF to s
使用极速数据 VIN 识别 API,对车辆挡风玻璃或行驶证上的车架号图片进行识别,返回 VIN 及品牌、厂家信息。
Automatically convert X (Twitter) posts into highly engaging viral videos using Gemini scriptwriting and HeyGen AI Avata
Generate music via Suno with the local browser-backed flow. Use when the user wants Suno songs, instrumental tracks, lyr
fal.ai API integration with managed API key authentication. Run AI models for image generation, video generation, audio
CLI for the Seer media request management API. Search movies and TV shows, create and manage media requests, manage user
文章配图推荐。根据文章主题、内容关键词,推荐合适的配图来源和搜索关键词,帮助用户找到符合文章意境的图片。当用户提到「配图」「找图」「文章图片」「封面图」「插图」时激活。
Analyze audio quality, detect noise types, and provide improvement recommendations. Use when users need to check audio q
语音录音转录并保存到 Notion 数据库。使用 faster-whisper 转录,自动提取关键信息并写入数据库。
Control Ezviz PTZ cameras via the open platform, supporting device listing, status, PTZ control, presets, and cruise pla
Generate audiobooks from novels and long-form text with chapter management and character voices. Use when users mention
Transcribe meetings with speaker identification and generate summaries with action items. Use when users need meeting tr
Create language learning audio with adjustable speed, pronunciation examples, and bilingual content. Use when users need
Diagnose why short-video retention drops and suggest practical fixes. Use when views start but audience leaves early.
Build and troubleshoot SenseAudio speech recognition integrations, including HTTP transcription (`/v1/audio/transcriptio
Create videos using ShortVideo API. Supports product-to-video, image-to-ad-video, and replicate-video. Use this skill wh
Edit and enhance images and videos with AI via muapi.ai — prompt-based editing, upscaling, background removal, face swap
Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjo
Direct high-fidelity cinematic video with AI — translates creative intent into technical cinematographic directives for
离线使用 OpenAI Whisper 免费转录本地视频音频,支持多格式多语言,生成时间戳字幕及AI内容摘要。
Generates dual-disease transcriptomic and ML research designs for shared biomarkers, hub genes, and mechanisms, outputti
Generates complete Mendelian Randomization + single-cell transcriptomics (scRNA-seq) research designs from a user-provid
远程配置萤石摄像机参数,支持布防状态、镜头遮蔽、全天录像和移动侦测灵敏度等9种设备设置。