How to use
Overview
URL Extractor pulls links out of unstructured text—emails, newsletters, copied web pages, HTML source, or log snippets. You do not need one URL per line.
Processing includes automatic deduplication, trailing punctuation cleanup, and optional removal of common tracking parameters. Filter by keywords or hostname, view domain counts, then copy or export. Everything runs locally in your browser with no server upload.
Good for
- Collecting links from email threads or marketing messages
- Extracting URLs from pasted HTML or page source
- Cleaning shared links before analysis or bookmarking
- Getting a hostname list for SEO or competitor review
- Exporting to a spreadsheet or another tool
For a list that already has one URL per line, use Text Deduplicator.
Steps
- Paste into Input or Upload a .txt / .csv file (try Sample for a demo)
- Review Extracted URLs and the stats below the panels
- Use Domain breakdown (top 50 by count) and click to filter
- Adjust output mode, sort, cleaning options, and filters
- Copy, or Download as TXT or CSV; Clear text removes input only; Reset also restores defaults
Options
| Option | What it does |
|---|---|
| Output: Full URL / Domain only | One full link per line, or hostname only |
| Sort: first seen / A–Z / Z–A | Order of the result list |
| Remove tracking params | Strip utm_*, fbclid, gclid, and similar query keys (default on) |
| Remove protocol | Drop https:// from displayed/exported URLs |
| Remove trailing slash | Removes `/` at the end of paths (e.g. `/blog/` → `/blog`). Pure domain or root URLs such as `https://example.com/` usually look the same with this option on. |
| Include / Exclude keywords | Keep or drop URLs containing any listed substring (OR) |
| Domain filter | Keep matching hostnames only (OR; includes subdomains) |
Export
| Method | Contents |
|---|---|
| Copy / TXT | One result per line (URL or domain, per output mode) |
| CSV | url, hostname, protocol columns with header (UTF-8) |
Limits and tips
- Up to 500,000 characters per run; uploads ≤ 2 MB and must decode within the character limit
- Pasted HTML is scanned as text—not rendered in a browser
- PDF and Word files are not supported; copy the text and paste instead
- Path-only links such as `/api/user` are not extracted
Format-level extraction only—not a guarantee that every link is live or safe to open.