Not a transcription tool.
An intelligent content platform.

Turn massive volumes of audio and video
into structured professional deliverables.
8 tabs. One pipeline. AI at every step.

An AI Guide helps you plan the perfect workflow. A Custom Prompt Engine designs the perfect extraction. AI Curation scores and sorts everything. You approve.

Researchers Lawyers Medical Professionals Course Creators Authors Filmmakers Journalists Students

Extract

Paste URLs from 8 platforms. Import entire channels, playlists, or paste 100+ URLs at once. XTC downloads the audio and queues everything for processing.

YouTube, Instagram, TikTok, X, Vimeo, Spotify, Podcasts, Local Files
Full channel and playlist import with range bounds
Bulk URL paste: one URL per line, process hundreds at once
Background downloading while you keep working

YouTube

Videos, channels, playlists. Set range bounds.

Instagram

Reels, stories, IGTV.

TikTok

Any public TikTok URL.

X (Twitter)

Spaces, video, voice tweets.

Vimeo

Public and unlisted videos.

Spotify

Podcast episodes.

Podcasts

RSS feeds, direct episode URLs.

Local Files

MP3, MP4, WAV, M4A, more.

Extract tab with bulk URL import

Transcribe

Four engines from fast drafts to studio-grade accuracy. AI-powered speaker diarization separates voices automatically. 90+ languages supported.

4 transcription engines: Fast Draft, Studio Quality, Studio Pro, AssemblyAI Cloud
Speaker diarization with one-click renaming
90+ languages via Whisper
Local processing: nothing leaves your machine (except AssemblyAI cloud option)

Fast Draft

Whisper tiny/base. Fastest turnaround.

Speed Fastest
Quality Good
Cost Free (local)

Studio Quality

Whisper medium/large. Balanced.

Speed Moderate
Quality High
Cost Free (local)

Studio Pro

Whisper large-v3 with diarization.

Speed Slower
Quality Highest (local)
Cost Free (local)

AssemblyAI

Cloud engine. Native diarization.

Speed Fast
Quality Highest
Cost Pay-per-use

Transcribe tab with speaker diarization

Correct

AI-powered transcript cleanup. Fix transcription errors, rename speakers, remove filler words, and merge segments. Integrated waveform editor for precise fixes when you need to verify against the audio.

AI cleanup via Claude: fix errors, clean up filler words, improve readability
One-click speaker renaming across the entire transcript
Integrated waveform + text editor for side-by-side correction
Merge or split transcript segments
Batch cleanup: run corrections across multiple transcripts at once

Correct tab with waveform editor and AI cleanup

Subtract

Remove the noise before you extract the signal. Strip intros, ads, outros, filler segments, and anything else that dilutes your source material.

Remove unwanted segments: intros, ads, outros, sponsor reads
Filter by speaker: keep only the guest, or only the host
Filter by keyword or topic
Strip timecodes and speaker labels when not needed
Clean transcript ready for extraction or export

Subtract tab filtering out ads and intros

Organize

Semantic clustering groups hundreds of extractions into topics automatically. No manual sorting. Related quotes, insights, and dialogues land together.

Embedding-based semantic clustering by theme
Browse by topic instead of scrolling through a wall of text
Rename clusters, merge topics, split groups
Works across multiple sources: cluster quotes from 50 different videos into coherent themes

Choose your LLM

Three providers. Your API keys. Pick the model that fits your budget and quality needs.

Claude (Anthropic)

Best for nuanced extraction and long-form content. Excellent at preserving voice and tone.

Haiku 3 ($) Sonnet ($$) Opus ($$$)

OpenAI

Reliable extraction with strong instruction following. Great all-rounder.

GPT-4o-mini ($) GPT-4o ($$)

Grok (xAI)

Fast, capable models from xAI. Growing model lineup.

Grok models

Organize tab with semantic clusters

Select

Keyboard-driven curation built for speed. Review extractions, keep the diamonds, discard the rest. Process hundreds of results in minutes, not hours.

Arrow keys to navigate, space to toggle, enter to confirm
Star, flag, or discard individual extractions
Batch operations: select all in a cluster, invert selection
Preview extractions in context before deciding
Your curated selections flow directly into Compose or Publish

Select tab with keyboard curation interface

Compose

This is where XTC goes from extraction tool to book-writing pipeline. AI drafts full book chapters from your curated selections. Hierarchical outline editor, side-by-side preview, and real-time generation.

AI drafts structured prose from your curated extractions
Hierarchical outline editor: chapters, sections, subsections
Import curated extracts from Select tab or add your own text files
Toggle which inputs to include per section
Claude generates structured prose in real-time
Side-by-side preview pane: source material on the left, generated chapter on the right
Iterate: refine, regenerate sections, adjust tone

Compose tab with outline editor and side-by-side preview

Publish

Export your work in six formats. Whether you need raw data for a pipeline, polished chapters for a publisher, or structured notes for your CMS.

Markdown (.md) for notes, blogs, and static sites
JSON (.json) for developer pipelines and CMS integration
Plain text (.txt) for universal compatibility
PDF (.pdf) for sharing and archival
Word (.docx) for publishers and collaborators
CSV (.csv) for spreadsheets and data analysis

.md

.json

.txt

.pdf

.docx

.csv

Prompt Library

25+ built-in extraction prompts across 5 categories. Write your own. Refine until it is right. Save as presets.

Quotes

Theme Quotes, Punchy/Short, Quote+Context, Actionable, Contrarian/Insight

Dialogues

Best Exchanges, Q&A Highlights, Conflict/Tension

Book English

Rewrite spoken content as polished, publishable prose

Remove Labels

Strip speaker labels and timecodes for clean output

User Presets

Your saved custom prompts, reusable across sessions

Custom

Write your own extraction logic. Iterate until output matches your needs.

Prompt refining

Run an extraction, review the output, adjust your prompt, run again. Iterate until the results are exactly what you need. No guessing.

Cost preview

See the estimated LLM cost before running any extraction. Strict fidelity toggle keeps extractions close to the original wording when accuracy matters most.

AI Guide

Describe what you need in plain language. The AI Guide interviews you, understands your goal, and plans the entire pipeline: which sources to pull, which transcription engine to use, what extraction prompts to run, and how to organize the output.

Conversational workflow planner: "I'm writing a book from 40 podcast interviews"
AI asks clarifying questions, then builds a step-by-step pipeline
Recommends transcription engine, extraction prompts, and export format
One click to execute the entire plan, or adjust any step before running
Perfect for first-time users and complex multi-source projects

For beginners

No need to learn the 8-tab pipeline. Tell the Guide what you want, and it configures everything for you.

For power users

Skip the manual setup on complex projects. Describe the deliverable, review the plan, hit go.

Dual Extraction Modes

Two fundamentally different ways to process content. Preserve mode keeps exact words for books and legal work. Distill mode extracts ideas, themes, and insights for research and course creation.

Preserve Mode

Exact words. Faithful to the source. Every quote, every phrase, every nuance stays intact. Built for authors compiling books from spoken material, lawyers needing verbatim testimony, and journalists who need the real words.

Verbatim extraction with speaker attribution
Context markers so you know where each block came from
Strict fidelity scoring to flag any AI paraphrasing

Distill Mode

Extract the signal, not the noise. AI identifies core ideas, synthesizes themes across sources, and produces clean, structured insights. Built for researchers, course creators, and anyone turning hours of content into knowledge.

Idea extraction across multiple sources
Theme synthesis and pattern detection
Structured output: frameworks, key takeaways, action items

AI Curation

AI scores every extracted block from 1 to 5 and recommends Keep, Star, or Trash. It explains its reasoning so you stay in control. Style-aware: it curates differently for a legal brief than for a research paper.

Quality scoring: 1 (noise) to 5 (gold) on every block
Action suggestions: Keep, Star, or Trash with one-line reasoning
Style-aware curation profiles: legal, medical, research, creative, journalistic
Override any suggestion. The AI learns from your corrections.
Process hundreds of blocks in minutes instead of hours of manual review

Legal

Prioritizes verbatim statements, dates, names, and legally significant language. Flags hearsay and speculation.

Medical

Surfaces clinical findings, dosages, patient outcomes, and diagnostic reasoning. Filters small talk.

Research

Weights novel insights, data points, methodology, and citations. Deprioritizes repetition and filler.

Custom Prompt Engine

Tell the AI what you're building. It scans your content, understands the structure, and designs a precision extraction prompt tailored to your exact deliverable. No prompt engineering required.

Describe your end goal: "a 12-module online course" or "a legal brief with cited testimony"
AI analyzes your source material and generates a custom extraction prompt
Preview the prompt, adjust if needed, then run it across all your content
Save generated prompts as reusable presets for future projects
Works with all three LLM providers (Claude, OpenAI, Grok)

From goal to prompt in seconds

You describe the output. The engine reverse-engineers the extraction logic. No trial and error, no wasted API calls.

Content-aware

The engine reads a sample of your transcripts before generating the prompt. It adapts to your specific content, not generic templates.

Duplicate Detection

When you process 50+ sources, the same idea shows up in different words across different files. Duplicate Detection finds blocks saying the same thing, clusters them, and lets you pick the best version. Runs locally. Free.

Semantic similarity matching across all your sources
Groups near-duplicate blocks so you can pick the strongest version
Works across speakers, episodes, and file types
Runs on local embeddings: no API cost, no data leaving your machine
Saves hours of manual deduplication on large projects

Obsidian Export

One click sends your curated content straight to your Obsidian vault. Wikilinks, frontmatter, tags, and folder structure included. Your extractions become a connected knowledge graph, not a pile of files.

Direct export to any Obsidian vault folder
Auto-generated [[wikilinks]] between related blocks and sources
YAML frontmatter: source URL, speaker, date, topic tags
Tag generation based on content themes
Folder mapping: organize exports by topic, source, or project
Works with Obsidian's graph view for visual knowledge exploration

[[wikilinks]]

YAML frontmatter

#auto-tags

folder mapping

Timecode Pointers

Toggle on source timestamps for every extracted block. Filmmakers and editors can jump straight to the exact moment in the original recording. Researchers can verify quotes against source material in seconds.

One toggle to show or hide timecodes on all extractions
Precise start and end timestamps per block
Click a timecode to jump to that moment in the waveform editor
Export with or without timecodes depending on your deliverable
Essential for filmmakers, podcast editors, legal transcription, and fact-checking

Filmmakers

Find the exact take. Click the timecode, hear the moment, mark it for your edit.

Editors

Verify every quote against the original audio. No more scrubbing through hours of footage.

Researchers

Cite with precision. Every extracted insight links back to the exact moment it was spoken.

System requirements

macOS 13 (Ventura) or later.

Dependencies: Python 3.10+, Homebrew, conda. All auto-installed on first launch.

Not a transcription tool.An intelligent content platform.