Browser Automation in Node.js: Record Narrated Videos Without Managing a Browser

Most browser automation tutorials start with npm install puppeteer and end with you debugging why your CI container ran out of memory at 2am.

There's a different model. Send the steps you want the browser to execute — navigate, click, fill, scroll — to an API. Get the result back: a video, a screenshot, a PDF. No browser process to manage. No Chromium binary. No memory leaks.

Here's what that looks like in Node.js.

Record a narrated browser automation video

The highest-value thing you can do with browser automation isn't screenshots — it's video. A narrated recording that shows your app working, in a real browser, with a voice explaining each step.

import fs from 'fs';

const response = await fetch('https://pagebolt.dev/api/v1/video', {
  method: 'POST',
  headers: {
    'x-api-key': process.env.PAGEBOLT_API_KEY,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    steps: [
      { action: 'navigate', url: 'https://yourapp.com', note: 'Opening the app' },
      { action: 'click', selector: '#sign-in', note: 'Signing in' },
      { action: 'fill', selector: '#email', value: 'demo@example.com' },
      { action: 'fill', selector: '#password', value: 'password123' },
      { action: 'click', selector: '#submit', note: 'Submitting' },
      { action: 'wait', ms: 1500 },
      { action: 'screenshot', name: 'dashboard' }
    ],
    audioGuide: {
      enabled: true,
      voice: 'nova',
      provider: 'openai',
      script: 'Welcome to the app. {{1}} Click Sign In to open the form. {{3}} Enter your credentials and submit. {{5}} The dashboard loads in under two seconds.'
    },
    pace: 'slow',
    frame: { enabled: true, style: 'macos' }
  })
});

fs.writeFileSync('demo.mp4', Buffer.from(await response.arrayBuffer()));

The {{N}} markers in the script sync the narration to specific steps. Step 3 is the email fill — the voice says that line as the browser types. The result is an MP4 with a macOS browser chrome frame, AI narration, and natural pacing.

Voices: nova, alloy, echo, fable, onyx, shimmer (OpenAI) or emma, ava, brian, aria (Azure).

Multi-step automation sequences

For flows that span multiple pages — onboarding, checkout, form submission — use the sequence endpoint. It captures screenshots at each step and returns them as a JSON response:

const response = await fetch('https://pagebolt.dev/api/v1/sequence', {
  method: 'POST',
  headers: {
    'x-api-key': process.env.PAGEBOLT_API_KEY,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    steps: [
      { action: 'navigate', url: 'https://yourapp.com/checkout' },
      { action: 'fill', selector: '#card-number', value: '4242 4242 4242 4242' },
      { action: 'fill', selector: '#expiry', value: '12/27' },
      { action: 'click', selector: '#pay-btn' },
      { action: 'wait', ms: 2000 },
      { action: 'screenshot', name: 'confirmation' }
    ]
  })
});

const { screenshots } = await response.json();
// screenshots: [{ name: 'confirmation', data: 'base64...' }]

Use this for end-to-end smoke tests, automated QA screenshots, or generating before/after comparisons.

Take a screenshot

The simplest case — a single page capture:

const response = await fetch('https://pagebolt.dev/api/v1/screenshot', {
  method: 'POST',
  headers: {
    'x-api-key': process.env.PAGEBOLT_API_KEY,
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    url: 'https://yourapp.com',
    fullPage: true,
    fullPageScroll: true,  // forces lazy-loaded images to render
    blockBanners: true     // removes cookie consent popups
  })
});

fs.writeFileSync('page.png', Buffer.from(await response.arrayBuffer()));

fullPageScroll is the one that catches people — without it, any content below the viewport that lazy-loads will appear blank.

Run through bot detection

Some pages return blank pages or redirect to a CAPTCHA for headless browsers. Pass stealth: true to mask browser fingerprints:

body: JSON.stringify({
  url: 'https://protected-site.com',
  stealth: true,
  fullPage: true
})

Works across screenshots, sequences, and video. If your target site uses IP-based blocking, add a proxy parameter with your own proxy URL.

What you don't need

No npm install puppeteer. No npm install playwright. No:

Chromium binary downloaded on install
PUPPETEER_SKIP_CHROMIUM_DOWNLOAD environment variable hacks
--no-sandbox flags for Docker
Browser pool management
Memory leak monitoring
pm2 restart loops

The browser runs on PageBolt's infrastructure. You send steps, you get output.

Try it free

Free tier includes 100 requests/month — enough to build and test a full automation workflow before you pay anything.

Get your free API key