[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"project-80046":3},{"id":4,"name":5,"fullName":6,"owner":7,"repo":5,"description":8,"homepage":9,"htmlUrl":10,"language":11,"languages":10,"totalLinesOfCode":10,"stars":12,"forks":13,"watchers":14,"openIssues":15,"contributorsCount":15,"subscribersCount":15,"size":15,"stars1d":15,"stars7d":15,"stars30d":16,"stars90d":15,"forks30d":15,"starsTrendScore":15,"compositeScore":17,"rankGlobal":10,"rankLanguage":10,"license":10,"archived":18,"fork":18,"defaultBranch":19,"hasWiki":20,"hasPages":18,"topics":21,"createdAt":10,"pushedAt":10,"updatedAt":22,"readmeContent":23,"aiSummary":24,"trendingCount":15,"starSnapshotCount":15,"syncStatus":16,"lastSyncTime":25,"discoverSource":26},80046,"claude-vision","mikefutia\u002Fclaude-vision","mikefutia","Claude Vision Skill (Mike Futia | SCALE AI)","https:\u002F\u002Fwww.skool.com\u002Fscale-ai\u002Fabout",null,"Python",65,17,63,0,2,3.77,false,"main",true,[],"2026-06-12 02:03:57","# Claude Vision — Video Analyzer Skill\n\nJoin 550+ Performance Marketing Teams Inside SCALE AI: https:\u002F\u002Fwww.skool.com\u002Fscale-ai\u002Fabout\n\nGive Claude Code the ability to \"watch\" videos.\n\nThis is a Claude Code skill that routes any video file through Google's Gemini API (which has native video understanding) and returns a structured markdown report — top-level summary, scene-by-scene breakdown with timestamps, audio transcript, visual details, and key moments. Works on screen recordings, UGC ads, tutorials, demos, meeting recordings — anything Gemini can ingest.\n\nIt has strong anti-hallucination guardrails: it will not invent narrators, voiceovers, or speaker names that aren't actually in the video.\n\n## Install\n\n### 1. Clone this repo\n\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fmikefutia\u002Fclaude-vision.git\n```\n\n### 2. Move it into your Claude Code skills folder\n\n```bash\nmv claude-vision ~\u002F.claude\u002Fskills\u002Fvideo-analyzer\n```\n\nThe folder name **must** be `video-analyzer` — that's how Claude Code finds the skill.\n\n### 3. Get a free Gemini API key\n\nGo to [Google AI Studio](https:\u002F\u002Faistudio.google.com\u002Fapikey) and create a key. The free tier is generous and fine for personal use.\n\n### 4. Set the API key\n\nThe easiest way: open Claude Code in any project and ask it to set up the key for you. Something like:\n\n> \"Set my GEMINI_API_KEY to `your_key_here` so it's available in every new shell.\"\n\nClaude Code will add the export to your shell profile and confirm it works. You won't need to touch `.zshrc` yourself.\n\n### 5. Install the Python dependency\n\nThe skill uses Google's official Gemini SDK:\n\n```bash\npip install google-genai\n```\n\nIf pip complains about an externally-managed environment, use:\n\n```bash\npip install google-genai --break-system-packages\n```\n\n### 6. Use the skill\n\nIn Claude Code, just point it at a video:\n\n> \"Use the video-analyzer skill on \u002Fpath\u002Fto\u002Fmy-video.mp4\"\n\nOr invoke it directly:\n\n> \"\u002Fvideo-analyzer ~\u002FDownloads\u002Fdemo.mp4\"\n\nClaude will run the analysis and present the structured report.\n\n## What you can do with it\n\n- **Ad teardowns** — drop in a competitor's UGC ad, get a beat-by-beat breakdown\n- **Tutorial → SOP** — turn a Loom recording into a written step-by-step guide\n- **Meeting recaps** — extract decisions and action items from a call\n- **Demo notes** — summarize what happened in a screen recording\n- **General \"what's in this video?\"** — any video, any question\n\n## Supported formats\n\nmp4, mov, webm, avi, mpeg, mpg, flv, wmv, 3gpp, 3gp\n\n## Optional flags\n\n```\n\u002Fvideo-analyzer \u003Cpath> [--prompt \"custom prompt\"] [--fps N] [--model gemini-2.5-flash]\n```\n\n- `--prompt` — override the default structured-report prompt with anything you want\n- `--fps` — change the frame sampling rate (default 1 fps; raise it for fast-cut content)\n- `--model` — pick a different Gemini model (default `gemini-3-flash-preview`)\n\n## Troubleshooting\n\n- **\"GEMINI_API_KEY environment variable is not set\"** — your key isn't visible to the shell Claude Code is running in. Open a new terminal and try again, or ask Claude Code to fix it.\n- **\"google-genai is not installed\"** — run `pip install google-genai` (see step 5).\n- **Upload timeout on big files** — Gemini's Files API can take 30–60 seconds to process longer videos. The script polls for up to 5 minutes before giving up.\n- **Model 404** — try `--model gemini-2.5-flash` if the default preview model isn't available in your region.\n\n## License\n\nMIT — do whatever you want with it.\n","Claude Vision 是一个视频分析工具，能够通过Google的Gemini API对视频内容进行深度解析，并生成结构化的Markdown报告。其核心功能包括提供视频的总体概述、逐场景分解（附带时间戳）、音频转录、视觉细节及关键时刻点。该技能具有强大的防幻觉机制，确保不会生成视频中不存在的信息。适用于广告拆解、教程转SOP文档、会议记录整理、演示笔记以及任何需要了解视频内容的场景。支持多种视频格式，如mp4、mov等，同时允许用户自定义分析参数以满足特定需求。","2026-06-11 03:59:02","CREATED_QUERY"]