3,619 个媒体任务的工具和技能
Use the internet: search, read, and interact with 13+ platforms including Twitter/X, Reddit, YouTube, GitHub, Bilibili,
腾讯云车牌识别(LicensePlateOCR)接口调用技能。当用户需要对中国大陆机动车车牌进行自动定位和识别时,应使用此技能。支持返回车牌号码、车牌颜色、置信度和像素坐标信息,支持多车牌场景识别,支持图片Base64和URL两种输入方式。
腾讯云身份证识别(IDCardOCR)接口调用技能。当用户需要识别身份证图片中中国大陆居民二代身份证正反面信息(姓名、性别、民族、出生日期、住址、身份证号、签发机关、有效期限等)时,应使用此技能。支持图片Base64和URL两种输入方式,同
Transcribe audio with free credits. Whisper-powered, 99 languages. No API keys needed. $2 FREE credits to start. Pay-as-
Generate images with free credits. Flux, DALL-E, and more. No credit card to start. No API keys needed. $2 FREE credits
Generate images in ~1 second with Flux Schnell. Fastest high-quality image model. No API keys needed. $2 FREE credits to
Generate images with Black Forest Labs Flux. Models: flux-schnell (fast), flux-dev (quality). Best open-source image mod
Image and video analysis powered by Isaac vision models. Capabilities include visual Q&A, object detection, OCR, cap
Remove visible Gemini AI watermarks from images via reverse alpha blending. Use for cleaning Gemini-generated images, re
Bidirectional LAN file sharing for AI agents. Provides a static file server (port 18801) for serving files to users, and
AI image generation with OpenAI, Google, DashScope and Replicate APIs. Supports text-to-image, reference images, aspect
Find guitar tabs/sheet sources for a song from a title or link (especially YouTube), rank the best matches, and produce
Automate web browsing tasks like navigation, data extraction, form filling, clicking, and screenshots using the agent-br
Generate and iterate on images using Image Sprout projects. Creates consistent outputs from reference images, style guid
Generate fixed-template daily AI news posters from five news items. Use when the user asks to create a poster, social ca
Extract text from PDFs using Google Gemini OCR. Use when extracting text from PDFs, performing OCR on scanned documents,
HTML-first PDF production skill for reports, papers, and structured documents. Must be applied before generating PDF del
YouTube video search, download & subtitle extraction. 40 Stars! Supports video/audio/subtitles. Each call charges 0.
支持PDF、Word、Markdown智能摘要和格式转换,提供批量处理与进度报告,提升文档处理效率。
Production-grade OCR with intelligent engine selection. Tesseract (lightweight, fast) and PaddleOCR (high accuracy, Chin
Vision-driven HarmonyOS NEXT device automation using Midscene. Operates entirely from screenshots — no DOM or accessibil
Vision-driven browser automation using Midscene. Operates entirely from screenshots — no DOM or accessibility labels req
Create mobile-friendly newspaper-style long images from raw text or summaries by extracting key points and rendering str
Generate Xiaohongshu (RedNote) infographic images. 7.2K Stars! 9 styles × 6 layouts. Each call charges 0.001 USDT via Sk
Convert content from sources like YouTube, PDFs, and WeChat into podcasts, PPTs, mind maps, or quizzes with Google Noteb
监控视频平台官方频道更新,快速获取指定频道在过去一周内发布的新视频(排除 Shorts 短视频)。支持 YouTube、Vimeo 等视频平台。用于: (1) 获取竞品或行业标杆的品牌内容更新,(2) 追踪多个频道的视频发布动态,(3) 生
Give your AI agent eyes to see the entire internet - read Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu. 6.5K
从 frameset.app 搜索视频参考片段,找到合集页面和原视频链接。用于: (1) 根据关键词搜索广告/电影片段参考,(2) 获取原视频 YouTube/Vimeo 链接,(3) 下载视频到本地。
AI diary service - push diary entries, query diaries, get AI analysis and cover images via HTTP API.
Read, analyze, convert, trim, merge, adjust volume, and transcribe audio files in multiple formats including MP3, WAV, F
Read, analyze metadata, convert formats, resize, rotate, crop, compress, and batch process PNG, JPG, GIF, WebP, TIFF, BM
Automatically generate social media posts from articles. Supports Twitter, LinkedIn, and more. Perfect for content repur
Read, extract text and metadata, and convert documents in formats like PDF, DOCX, XLSX, PPTX, EPUB, RTF, and OpenDocumen
Automatically monitor RSS feeds and post to social media. Schedule content, generate posts with AI, and publish to Twitt
Daily social media routine for indie developers and founders. Runs a structured 15-20 minute engagement routine across R
使用万兴天幕(Tomoviee)AI大模型从文字描述生成视频。当用户需要:(1) 通过文字生成AI视频,(2) 创建短视频内容,(3) 将文案或创意转化为动态视频画面时使用此skill。即使用户只说'帮我生成一个视频'、'做一个XX的短视频
3 images, one prompt, instant A/B/C. Nano Banana Pro's natural randomness gives you three distinct takes on any image id
Automate social media posting across Twitter, LinkedIn, Facebook, Instagram. Schedule posts, track engagement, auto-repl
Research a topic from the last 30 days. Also triggered by 'last30'. Sources: Reddit, X, YouTube, TikTok, Instagram, Hack
Fetch YouTube video subtitles/captions using Felo YouTube Subtitling API. Use when users ask to get YouTube subtitles, e
Fetch, transcribe, and summarize YouTube or Bilibili videos using subtitles, cloud STT, local Whisper, or description fa
Reverse-engineer any LinkedIn profile's content strategy — pillars, hooks, CTAs, and PDF report
Translate files (PDF, DOCX, PPTX) to any language using the Bluente Translation API. Asks for API key, source files, tar
Generate background music from text description. Use when users request text_to_music operations or related tasks.
Continue/extend existing 5-second video by generating next segment. Use when users request video_continuation operations
Generate videos from image + text prompt. Animates static images with motion and camera movements. Use when users reques
专业AI视频生成器,支持文本转高质量短视频,批量处理、多模板和高级自定义语音功能,适合创作者和企业。
Track global research, essays, discourse, and media discussion about "mommy-type" characters, mommy issues, da
Voice note transcription and archival for OpenClaw agents. Powered by Deepgram Nova-3. Transcribes audio messages, saves
Build and manage Voice AI agents using Vapi, Bland.ai, or Retell. Create agents, configure voices, set prompts, make out