Review any finished creative (text, graphic, video, audio, presentation, AI prompt, writing) and autonomously regenerate an improved version. Runs a full critique-diagnose-regenerate pipeline with domain-specific quality checks, banned AI word detection, and iterative improvement passes.
Autoimprove is a Claude Code skill. Review any finished creative (text, graphic, video, audio, presentation, AI prompt, writing) and autonomously regenerate an improved version. It includes 1 file and works well with dan-kennedy-copywriter, conversion-copywriting, autolearn.
---
name: autoimprove
version: 1.1.0
description: "Review any finished creative (text, graphic, video, audio, presentation, AI prompt, writing) and autonomously regenerate an improved version. Use after generating any creative output. Also use when the user mentions 'autoimprove,' 'improve this,' 'critique and redo,' 'make it better,' 'review and regenerate,' 'polish this creative,' 'improve this prompt,' 'better prompt,' 'rewrite this,' or 'tighten this up.'"
---
# Autoimprove
Autonomous critique-and-regenerate loop for any creative output. Finds the most recent creative in the conversation, grades it against domain-specific quality criteria, and regenerates an improved version. Supports text, graphics, video, presentation, audio, AI prompts, and general writing domains.
---
## Before Starting
1. **Find the creative.** Scan the conversation for the most recent creative output (copy, image, code, script, prompt).
2. **Detect the type.** Use the `{{type}}` argument if provided. Otherwise, auto-detect from the table below.
3. **Extract the original brief.** Find the user's request that triggered the creative. You need this for regeneration.
4. **If no creative found:** Say "No recent creative found. Generate something first, then run /autoimprove." and stop.
---
## Creative Type Detection
| Type | Detection Signals |
|------|-------------------|
| `carousel` | Numbered slides, "Slide 1/2/3", carousel copy, slide-by-slide breakdown |
| `email` | Subject line, email body, "Hi {{name}}", nurture sequence, drip campaign |
| `ad-copy` | Primary text + headline + description, Meta ad format, ad variations |
| `graphic-ad` | `nano-banana` command output, image file path (.png/.jpg), fal.ai prompt, ad image |
| `video-script` | Beat sheet, timing marks `[0:00]`, kinetic text beats, Remotion composition |
| `caption` | Social media caption, hashtags, platform-specific formatting |
| `script` | YouTube script, retention hooks, timestamps, video outline |
| `presentation` | Reveal.js HTML, slide deck, speaker notes, `<section>` tags |
| `audio-script` | Voiceover script, ElevenLabs directions, podcast outline, spoken delivery notes |
| `ai-prompt` | System prompt, user prompt, SKILL.md, agent instructions, prompt template, few-shot examples |
| `writing` | Blog post, article, thread, long-form copy, documentation, guide, tutorial, newsletter |
| `landing-page` | Hero section, feature blocks, testimonial section, HTML/JSX page code |
---
## Step 1: Run the Critique Pass
Evaluate the creative against **every applicable criterion** below. For each check, record: **pass** or **fail**, plus specific **evidence** (quote the exact text, describe the visual issue, or cite the missing element).
### Universal Checks (ALL creative types)
Apply these to every creative regardless of type.
**Deterministic (scan the text):**
1. **Banned AI words** — Scan for ALL of these. Any match = fail:
delve, tapestry, realm, landscape, ever-evolving, cutting-edge, robust, transformative, pivotal, vibrant, crucial, compelling, seamless, groundbreaking, leverage, harness, embark, navigate (metaphorical use), unveil, facilitate, synergy, game-changer, unlock, unleash, elevate, utilize, endeavour, multifaceted
2. **Banned AI phrases** — Any match = fail:
"In today's ever-evolving...", "Unlock the power of", "Master the art of", "Let's delve into", "Harness the power of", "Push the boundaries of"
3. **Em dash / en dash** — Scan for U+2014 (`---`) and U+2013 (`--`). Any match = fail. Use regular hyphens only.
**LLM judgment:**
4. **Brand voice** — Professional yet approachable, educational and value-driven. Sounds like a knowledgeable peer, not a textbook or corporate memo.
5. **Specificity** — Concrete numbers, examples, timeframes, or results. Vague claims ("grow your business", "get more leads") = fail.
6. **CTA strength** — Clear call-to-action with an action verb and obvious benefit. Missing or weak CTA = fail.
---
### Carousel Checks
7. **Slide count** — 8-10 slides optimal. Under 6 or over 12 = fail.
8. **Words per slide** — Maximum 15 words per slide. Any slide over 15 = fail.
9. **Hook (slide 1)** — Bold claim, provocative question, or surprising stat. Generic opening = fail.
10. **CTA (final slide)** — Clear call-to-action (follow, DM, link, save, share).
11. **Flow** — Each slide builds on the previous. Logical progression, not a random list.
---
### Email Checks
12. **Subject line** — Creates curiosity, states a benefit, or drives urgency. Generic = fail.
13. **No em dashes** — Emails must NEVER contain em dashes or en dashes. Regular hyphens only.
14. **Opening hook** — First sentence grabs attention. Filler openings ("I hope this finds you well") = fail.
15. **Single CTA** — One primary call-to-action per email, not 3 competing links.
16. **Conversational tone** — Reads like a message from a knowledgeable friend, not a marketing blast.
17. **P.S. line** — Bonus points if present with a secondary hook or urgency element.
---
### Ad Copy Checks
18. **Hook (first line)** — Stops the scroll with a stat, question, bold claim, or pattern interrupt.
19. **Framework** — PAS (Problem-Agitation-Solution) or AIDA structure present. Unstructured rambling = fail.
20. **Social proof** — Testimonials, stats, case studies, or credibility markers included.
21. **Urgency** — If used, must be genuine and specific ("5 spots left this month"), not fake ("Act now!!!").
22. **Benefit over features** — Leads with what the reader gets, not what the product does.
---
### Script Checks (YouTube, long-form video)
23. **Retention hooks** — Present at 0s, 30s, 60s, and every 2-3 minutes.
24. **Pattern interrupts** — Varies pacing with story, stat, question, list, analogy.
25. **Open loops** — Teases upcoming content to keep viewers watching.
26. **Cold open** — Starts with the most compelling moment, not a slow intro.
---
### Graphic Ad Checks (NEW)
For outputs from `nano-banana`, `fal.ai`, or any generated image.
**If an image file was generated (PNG/JPG):**
Use the `Read` tool to view the image, then grade visually:
27. **Brand colors** — #ED0D51 (hot pink) accent visible? Background appropriate (#0D0D0D dark or #FFFFFF white)?
28. **Text legibility** — All text readable at mobile size? Nothing cut off at edges?
29. **CTA button** — Visible CTA button present? Hot pink pill shape with white text?
30. **Layout balance** — Content well-distributed? No awkward empty space or crowding?
31. **Brand mark** — "Lead Gen Jay" or "@leadgenjay" visible somewhere?
32. **Aspect ratio** — Appears to match the target format (4:5 portrait, 1:1 square, 9:16 story)?
33. **No visual artifacts** — No garbled text, distorted faces, or AI generation artifacts?
**If a nano-banana prompt was generated:**
34. **Background color** — Explicitly specified (#FFFFFF or #0D0D0D)?
35. **Typography fonts** — Big Shoulders Black and Manrope mentioned?
36. **Accent color** — #ED0D51 specified for highlight elements?
37. **CTA button described** — Hot pink pill-shaped button with white text?
38. **Anti-slop directive** — "No gradients, no stock photos, no decorative filler" present?
39. **Placeholders filled** — No leftover `[HEADLINE]`, `[CTA_TEXT]`, `[ITEM1]` placeholders?
40. **Prompt length** — Between 100-400 words? Too short = underspecified, too long = conflicting instructions.
**Improvement method for graphics:**
- Rewrite the nano-banana prompt with all fixes applied
- Re-run the `nano-banana` CLI command to generate the improved image
- Show both the original and improved images to the user
---
### Video Script / Kinetic Text Ad Checks (NEW)
For beat sheets, kinetic typography scripts, and Remotion compositions.
41. **Beat count** — Matches target duration: 60s = 18-22 beats, 45s = 14-16 beats, 30s = 9-11 beats.
42. **Words per beat** — Maximum 8 words per beat. Any beat over 8 = fail.
43. **Hook beat** — First beat stops the scroll (stat, question, bold claim). Weak opener = fail.
44. **CTA beat** — Final beat has clear CTA with action verb and keyword/DM trigger.
45. **Pacing variation** — Mix of short punchy beats (2-3 words) and longer beats (6-8 words). All same length = fail.
46. **Accent words marked** — Each beat identifies which words display in hot pink (#ED0D51).
47. **Framework applied** — PAS (ToF), AIDA (MoF), or Proof Stack (BoF) structure matches funnel stage.
48. **Timing annotations** — Beats annotated with approximate timestamps `[0:00]`, `[0:03]`, etc.
49. **Sound-off friendly** — All key information conveyed in text. Script works without audio.
50. **Brand elements** — Background #0D0D0D, text #FFFFFF, accent #ED0D51, font Big Shoulders Black uppercase.
**Improvement method for video:**
- Rewrite the entire beat sheet with all fixes applied
- Recalculate timing annotations
- Present in the same table/format as the original
---
### Presentation Checks (NEW)
For Reveal.js slide decks and presentation HTML.
51. **Slide count** — Appropriate for talk length (~1 content slide per 2 minutes + title/closing).
52. **Typography compliance** — h1/h2 use Big Shoulders Display (uppercase, 900/800 weight)? Body uses Manrope?
53. **Color palette** — Using brand tokens: accent #ED0D51, text #111111, body #555555, dark-bg #0D0D0D?
54. **Content density** — No wall-of-text slides? One key point per slide? Max ~30 words per slide.
55. **Speaker notes** — Present for every content slide?
56. **Visual variety** — Mix of text-only, screenshot, and dark/light background slides?
57. **Narrative arc** — Opening hook, building tension/value, payoff, closing CTA?
58. **Spacing** — Standard `padding: 40px 60px`? Content fills the slide without wasted whitespace?
**If HTML file exists on disk:** Render key slides with Playwright, capture screenshots, vision-grade for layout and readability.
**Improvement method for presentations:**
- Rewrite the HTML file with all fixes applied
- Re-render and verify visually if possible
---
### Audio / Voiceover Script Checks (NEW)
For ElevenLabs voiceover scripts, podcast outlines, and spoken-word content.
59. **Sentence length** — Sentences under 20 words for natural spoken delivery. Long sentences = fail.
60. **Breath marks** — Natural pause points (commas, periods, ellipses) every 10-15 words.
61. **Tongue twisters** — No awkward consonant clusters or hard-to-pronounce sequences.
62. **Pace variation** — Mix of short punchy lines and flowing sentences. Monotone pacing = fail.
63. **Emphasis markers** — Key words/phrases marked for vocal emphasis (bold, CAPS, or explicit `[emphasis]` notes).
64. **Opening energy** — First line sets the right tone and energy level for the piece.
65. **Conversational flow** — Sounds like natural speech when read aloud, not written prose.
66. **Filler-free** — No "um", "uh", "you know", "like" unless intentionally placed for authenticity.
67. **Audio direction** — Includes pace/tone notes for the voice artist: `[pause]`, `[slower]`, `[excited]`.
**Improvement method for audio:**
- Rewrite the script with spoken-word optimizations
- Add missing audio direction markers
- Ensure every sentence reads naturally when spoken aloud
---
### AI Prompt Checks (NEW)
For system prompts, SKILL.md files, agent instructions, prompt templates, and few-shot examples.
68. **Clear role definition** — Does the prompt establish WHO the AI is and WHAT it does in the first 2-3 sentences? Vague or missing role = fail.
69. **Specificity over vagueness** — Are instructions concrete and actionable? "Write good copy" = fail. "Write 3 headline variations under 8 words using PAS framework" = pass.
70. **Output format defined** — Does the prompt specify the exact output structure (JSON, markdown, table, numbered list)? Ambiguous output expectations = fail.
71. **Constraints explicit** — Are boundaries clearly stated? (word limits, banned patterns, required elements, tone). Constraints buried in paragraphs = fail -- should be in lists or tables.
72. **Examples included** — For complex tasks, are good/bad examples provided? Tasks with subjective quality standards need at least one example.
73. **No conflicting instructions** — Do any two rules contradict each other? ("Be concise" + "Be comprehensive" = fail). Scan for tension between directives.
74. **Escape hatches** — Does the prompt handle edge cases? What if input is missing, malformed, or out of scope? No guidance = fail for production prompts.
75. **Instruction hierarchy** — Are the most important rules positioned first or visually emphasized (bold, headers, lists)? Critical rules buried at the bottom = fail.
76. **No prompt injection vulnerability** — Does the prompt guard against user input overriding system instructions? Unprotected user input interpolation = fail.
77. **Testability** — Could you write binary pass/fail assertions against this prompt's output? If the success criteria are too vague to test, the prompt is too vague.
**Improvement method for AI prompts:**
- Rewrite the prompt with all fixes applied
- Add missing role definition, output format, constraints, or examples
- Restructure for clarity: role -> context -> instructions -> constraints -> output format -> examples
---
### Writing Checks (NEW)
For blog posts, articles, threads, newsletters, guides, tutorials, and long-form content.
78. **Opening hook** — First paragraph grabs attention with a bold claim, surprising stat, story, or question. Slow warmup intros = fail.
79. **Clear thesis** — Is there a single, identifiable main argument or takeaway? Meandering without a point = fail.
80. **Structure** — Logical flow with clear sections/headers? Reader can skim and get the gist? Wall of text = fail.
81. **Sentence variety** — Mix of short punchy sentences and longer flowing ones. All same length = monotone = fail.
82. **Active voice** — Predominantly active voice. Excessive passive constructions ("was done by", "has been shown to") = fail.
83. **Show don't tell** — Concrete examples, stories, or data instead of abstract claims. "It's really important" = fail. "47% of cold emails never get opened" = pass.
84. **Reader value** — Does every section give the reader something useful (insight, tactic, framework)? Filler paragraphs = fail.
85. **Transitions** — Smooth flow between sections. Abrupt topic changes without connective tissue = fail.
86. **Closing strength** — Ends with a clear takeaway, CTA, or memorable line. Just stopping = fail.
87. **Appropriate length** — Content density matches the format. Blog posts: 800-2000 words. Threads: 5-15 posts. Newsletters: 500-1000 words. Severely over/under = fail.
**Improvement method for writing:**
- Rewrite the full piece with all fixes applied
- Tighten prose, sharpen the hook, add missing examples or data
- Maintain the original voice and argument
---
## Step 2: Score and Diagnose
Calculate results:
```typescript
passed = number of applicable checks that passed
total = number of applicable checks for this creative type
score = passed / total (as percentage)
```
For each failure, create a diagnosis:
- **What failed:** The specific check number and name
- **Evidence:** Quote the exact text, describe the visual issue, or cite the missing element
- **Fix:** The specific replacement text, rewritten prompt section, or structural change needed. Not vague advice -- write the actual fix.
---
## Step 3: Regenerate
Produce the improved creative with all fixes applied:
1. Start from the original user brief
2. Apply every fix from the diagnosis
3. Maintain the original intent, topic, and key messages
4. Use the correct output format for the domain:
- **Text domains** (carousel, email, ad-copy, caption, script): Rewrite the copy
- **Graphic-ad**: Rewrite the nano-banana prompt, then re-run the CLI command
- **Video-script**: Rewrite the full beat sheet with timing
- **Presentation**: Rewrite the HTML file
- **Audio-script**: Rewrite the spoken script with audio directions
- **AI-prompt**: Restructure and rewrite the prompt (role -> context -> instructions -> constraints -> output format -> examples)
- **Writing**: Rewrite the full piece with tighter prose, stronger hook, and better structure
- **Landing-page**: Rewrite the page code/copy
5. Do NOT mention the critique process in the output -- produce the improved creative as if it were the first draft
The regenerated output must be **complete and ready to use** -- not a diff or patch.
---
## Step 4: Present Results
Show the user:
### Improvements Made
- 3-5 bullet points summarizing key changes (e.g., "Replaced vague hook with specific stat", "Fixed 2 beats over 8-word limit", "Added missing CTA button to prompt")
### Quality Score
- Before: X% (N/M checks passed)
- After: Y% (should be higher)
### Improved Creative
- The full regenerated output, ready to use
- For graphics: show the improved image (and the original for comparison if both exist)
---
## Step 5: Loop Option
After presenting the improved version, ask:
**"Run another improvement pass? (y/n)"**
- If yes: repeat Steps 1-4 on the NEW version (not the original)
- Maximum **3 total passes**. After pass 3, present the final version regardless.
- If the score hits **100%**, stop early -- no further passes needed.
- Each pass should show the cumulative score progression: "Pass 1: 65% -> Pass 2: 88% -> Pass 3: 96%"
---
## Autonomy Rules
- Do NOT ask for permission between the critique and regeneration steps. Run the full pipeline automatically.
- Do NOT show the detailed critique table to the user. They want the improved result, not a grading report. The summary bullets are sufficient.
- Do NOT skip the regeneration step. The whole point is producing a better version, not just pointing out problems.
- For graphic regeneration: run the `nano-banana` command automatically. Do not ask "should I regenerate?"
- If a check is not applicable to the detected type, skip it silently. Do not report "N/A" checks.
- If you cannot find a recent creative in the conversation, say so and stop. Do not fabricate a creative to critique.
SKILL.mdLGJ-graphics
Lead Gen Jay brand illustration style system — flat SVG with floating UI elements, 30 brand logos, shared component library, Framer Motion animations, and consistent design tokens
brand-image-generator
Generate brand-consistent images using fal
conversion-copywriting
Write copy matched to audience awareness levels using Eugene Schwartz's framework and AIDA structure
dan-kennedy-copywriter
Write high-converting direct response copy using Dan Kennedy's proven principles
design-motion-principles
Expert motion and interaction design auditor based on Emil Kowalski, Jakub Krehel, and Jhey Tompkins
google-tracking
Configure Google tracking (GTM, GA4, tag scanning) and the full tracking infrastructure for Lead Gen Jay pages
hero-section-designer
Design high-converting hero sections combining compelling visuals, Dan Kennedy copy, and clear CTAs
hyros-integration
Add Hyros conversion tracking to opt-in forms, checkout pages, and landing pages
Enter your AI Automation Insiders email to get full access to documentation and the install command.
Write high-converting direct response copy using Dan Kennedy's proven principles. PAS framework, urgency tactics, message-to-market matching for sales pages and emails.
Write copy matched to audience awareness levels using Eugene Schwartz's framework and AIDA structure. Headlines for cold vs warm traffic, persuasion by buyer journey stage.
Autonomously improve any Claude Code skill using an eval-driven feedback loop adapted from Karpathy's autoresearch. Modifies SKILL.md, evaluates against binary assertions, commits if better, reverts if worse, and repeats until perfect or plateau. Supports general, copywriting, and graphic-design domains.
Lead Gen Jay brand illustration style system — flat SVG with floating UI elements, 30 brand logos, shared component library, Framer Motion animations, and consistent design tokens.
Generate brand-consistent images using fal.ai FLUX.1. Prompt templates for hero images, feature graphics, testimonial photos, and marketing visuals.
Write copy matched to audience awareness levels using Eugene Schwartz's framework and AIDA structure. Headlines for cold vs warm traffic, persuasion by buyer journey stage.
Write high-converting direct response copy using Dan Kennedy's proven principles. PAS framework, urgency tactics, message-to-market matching for sales pages and emails.
Expert motion and interaction design auditor based on Emil Kowalski, Jakub Krehel, and Jhey Tompkins. Reviews UI animations, transitions, hover states against industry best practices.
Configure Google tracking (GTM, GA4, tag scanning) and the full tracking infrastructure for Lead Gen Jay pages.
Design high-converting hero sections combining compelling visuals, Dan Kennedy copy, and clear CTAs. Above-fold optimization where 80% of conversion decisions start.
Add Hyros conversion tracking to opt-in forms, checkout pages, and landing pages. Server-side lead tracking via API, client-side checkout intent, and Whop purchase integration.
AI Automation Insiders members get unlimited access to all exclusive skills, commands, and agents — plus weekly updates.
Already a member? Scroll up to enter your email.