3,619 个媒体任务的工具和技能
Compare two face images and return similarity score using iFlytek Face Recognition API.
Create stunning designs with AI. Social media graphics, presentations, and marketing materials without design skills. No
Generate images, music, and videos from text prompts using Pollinations AI with models like flux, zimage, and suno-4 via
AI生成图片,支持Kolors/FLUX/Qwen-Image等模型(需SiliconFlow API)
OpenClaw adaptation of @mvanhorn's last30days skill. Research any topic from the last 30 days across Reddit, X, YouTube,
Turn messy recordings, transcripts, voice notes, or brain dumps into clean, team-ready Standard Operating Procedures (SO
Zeelin Social Watch: monitor social media sentiment, trending events, platform rankings, and account data via GSData ope
Story generation pipeline skill. Supports multi-episode continuous generation, graph management, AI quality check + huma
Generate professional PDFs from HTML/CSS using flow layouts and selective break controls to avoid whitespace gaps and la
Optimize and generate text-to-image prompts for AI art platforms. Use when a user wants to: (1) Optimize prompts for Mid
Generate speech audio with 阿里云百炼 TTS via the `bailian-cli` npm package. Use when users ask to convert text to voice, cho
PDF文字水印工具 / PDF text watermark tool. 智能检测页面方向,自动调整角度和大小,支持中文,居中显示。Auto-detects page orientation, adjusts angle & siz
Trigger this skill for ANY of these situations — writing OR conversation: WRITING: blog posts, articles, social media ca
调用 Nano Banana API 生成或编辑图片,支持文生图和图生图,需提供API Key和提示词,支持自定义尺寸比例。
Binary classification-based human portrait segmentation for complete body contour recognition and image matting.
Recognize songs by singing or audio file using iFlytek's Query By ACRCloud technology.
Generate educational comic-style Xiaohongshu posts using AI-generated comic images. Includes topic research, storyboard
Helps choose the right fal.ai model before API calls. Provides quick decision matrix for video generation (text-to-video
Provide real-time traffic camera footage and livestreams for specified roads or highways to check current traffic condit
Discover, research, script, fact-check, and generate podcast episodes automatically. Multi-source topic discovery, LLM s
Feishu Document Exporter - Batch export Feishu docs to markdown/PDF
腾讯云语音合成(TTS)服务技能包。当用户需要将文本转换为语音文件时使用此技能,支持多种音频格式输出和灵活的配置选项。当用户提到语音合成、文本转语音、TTS服务、音频文件生成时,都应该考虑使用此技能。
Skill for Tencent Cloud HunYuan Text-to-Image Generation (混元生图). Provides AI image generation from text prompts using th
Plan, launch, and optimize digital marketing with growth marketing systems, short-form video, funnel operations, and rev
Generate images with FLUX models (Black Forest Labs) via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA with
Full AI image creation workflow — intent classification, prompt enhancement, multi-direction generation via fal.ai, and
Automatically publish notes to Xiaohongshu (小红书) creator center. Generates cover images (PIL poster, multiple styles), w
Give your AI agent eyes to see the entire internet. Install and configure upstream tools for Twitter/X, Reddit, YouTube,
Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio file
图片人脸融合(专业版)为同步接口,支持自定义美颜、人脸增强、牙齿增强、拉脸等参数,最高支持8K分辨率,有多个模型类型供选择。
通过上传图片和选择特效模板,生成一段特效视频,将静态图像转化为充满活力、动感、有趣的视频画面。
腾讯云混元生图 3.0,文生图 / 图生图,智能生成贴合描述的图片。Tencent Cloud Hunyuan Image Generation 3.0, text-to-image / image-to-image, intelligen
Generate music with ACE-Step via HuggingFace private Space. Supports text-to-music, lyrics, style tags, and reference-au
Monitor live streams (YouTube, Bilibili) and get notified when specific keywords are mentioned. Uses browser SpeechRecog
生成专业级 A股早报/晚报,包含大盘指数行情、市场情绪、K线走势图、 行业/概念板块排行、个股涨跌榜、主题新闻追踪、综合分析, 输出 Markdown + PNG 图表 + PDF。数据源为东方财富公开 API。 Use when aske
Fetch and analyze YouTube video content using transcripts when available, or fall back to video descriptions with source
AI security toolkit — deepfake and AI-generated media detection. Use when verifying if an image, video, or audio is a de
将多张图片自动旋转合并为单个PDF,支持根据Excel清单重命名及扫描PDF的OCR文字提取。
腾讯云试题批改Agent(SubmitQuestionMarkAgentJob/DescribeQuestionMarkAgentJob)接口调用技能。当用户需要对试卷图片或试题图片中的K12试卷或试题进行自动批改、手写答案识别、知识点分析
腾讯云行驶证识别(VehicleLicenseOCR)接口调用技能。当用户需要识别行驶证图片主页(车牌号码、车辆类型、所有人、住址、使用性质、品牌型号、识别代码、发动机号、注册日期、发证日期)或副页(号牌号码、档案编号、核定载人数、总质量、
腾讯云护照识别(多国多地区)(MLIDPassportOCR)接口调用技能。当用户需要识别护照图片中中国大陆、港澳台地区或其他国家/地区的护照信息(护照ID、姓名、出生日期、性别、有效期、发行国、国籍、国家地区代码、MRZ码等)时,应使用此
腾讯云表格识别v3(RecognizeTableAccurateOCR)接口调用技能。当用户需要从表格图片或PDF中识别常规表格、无线表格、多表格的内容,提取每个单元格的文字信息,或将表格图片识别结果导出为Excel文件时,应使用此技能。支
Automatically generate social media posts from articles. Supports Twitter, LinkedIn, and more. Perfect for content repur
飞书语音消息发送技能(Windows 版)。使用 Edge TTS(微软,免费)生成语音并以飞书语音气泡发送。
Extract and break down content from web documents, PDFs, images, and URLs into structured markdown notes stored locally
Generate stunning images with Flux Dev. Best quality open image model. No API keys needed. $2 FREE credits to start. Pay
多片段短视频自动拼接工具,支持按文件名排序、统一音视频参数、淡入淡出转场、分块/完整拼接,适合短剧、分镜头视频批量拼接
腾讯云广告文字识别(AdvertiseOCR)接口调用技能。当用户需要从图片中识别文字内容时,应使用此技能。支持中英文、横排、竖排及倾斜场景的图片文字识别,支持90度、180度、270度翻转场景的图片识别,返回文本框位置与文字内容。支持图片
腾讯云实时文档抽取Agent(ExtractDocAgent)接口调用技能。当用户需要从图片或PDF中按自定义字段名称进行结构化信息抽取时,应使用此技能。支持自定义字段名称、字段类型(KV对或表格字段)和字段提示词,实现灵活的文档信息提取。
腾讯云营业执照识别(BizLicenseOCR)接口调用技能。当用户需要识别营业执照图片上的字段信息(统一社会信用代码、公司名称、主体类型、法定代表人、注册资本、组成形式、成立日期、营业期限、经营范围等)时,应使用此技能。支持图片Base6