A blazing-fast, TypeScript-native syntax highlighter with comprehensive grammar support for modern web languages. Built for performance with both synchronous and asynchronous tokenization modes.
- ⚡ Blazing Fast - Highly optimized tokenization with async and sync modes for maximum performance
- 🎨 48 Languages - Comprehensive support for web, system, and specialized languages
- 🔥 Modern Syntax - Full support for ES2024+, BigInt, numeric separators, optional chaining, and more
- ⚛️ JSX/TSX Support - Complete React and TypeScript JSX highlighting
- 🎯 CSS4 Features - Modern color functions (hwb, lab, lch, oklab, oklch), container queries, CSS layers
- 💪 TypeScript-First - Fully typed APIs with comprehensive type definitions
- 📦 Zero Dependencies - Lightweight with no external runtime dependencies
npm install ts-syntax-highlighter
# or
bun add ts-syntax-highlighter
# or
pnpm add ts-syntax-highlighterimport { Tokenizer } from 'ts-syntax-highlighter'
// Create tokenizer instance
const tokenizer = new Tokenizer('javascript')
// Tokenize code (async mode - faster)
const tokens = await tokenizer.tokenizeAsync(`
const greeting = 'Hello World'
console.log(greeting)
`)
// Or use sync mode
const syncTokens = tokenizer.tokenize(`
function add(a: number, b: number): number {
  return a + b
}
`)ts-syntax-highlighter supports 48 languages across web development, systems programming, data formats, and specialized domains:
- JavaScript/JSX - ES2024+, async/await, template literals, JSX
- TypeScript/TSX - Type annotations, generics, interfaces, TSX
- HTML - HTML5, data/aria attributes, event handlers
- CSS - Modern features, custom properties, CSS4 functions
- SCSS/Sass - Variables, nesting, mixins, functions
- Vue - Single-file components, directives, templates
- GraphQL - Queries, mutations, fragments, directives
- Markdown - Headings, emphasis, code blocks, tables, links
- XML - Elements, attributes, CDATA, namespaces
- LaTeX - Commands, environments, math mode
- C - Preprocessor, pointers, functions
- C++ - Classes, templates, namespaces, modern C++
- Rust - Ownership, traits, macros, lifetimes
- Go - Goroutines, interfaces, defer
- C# - LINQ, async/await, properties, attributes
- Java - Classes, generics, annotations
- Swift - Optionals, protocols, closures
- Kotlin - Null safety, coroutines, data classes
- Dart - Async, mixins, null safety
- Python - Functions, classes, decorators, f-strings
- Ruby - Blocks, symbols, string interpolation
- PHP - Variables, functions, classes, HTML embedding
- Lua - Tables, functions, metatables
- R - Vectors, data frames, statistical functions
- JSON - Objects, arrays, proper escaping
- JSONC - JSON with comments
- JSON5 - Unquoted keys, trailing commas, comments
- YAML - Keys, nested structures, anchors
- TOML - Tables, arrays, data types
- CSV - Comma-separated values
- IDL - Interface definitions
- Protobuf - Messages, services, enums
- Bash/Shell - Variables, pipes, functions, control flow
- PowerShell - Cmdlets, pipelines, parameters
- Dockerfile - Instructions, multi-stage builds
- Makefile - Targets, variables, directives
- Terraform/HCL - Resources, variables, interpolation
- Nginx - Directives, server blocks, locations
- CMD/Batch - Commands, variables, control flow
- SQL - Queries, DDL, DML, functions
- Diff/Patch - File headers, hunks, added/removed lines
- RegExp - Character classes, groups, quantifiers
- BNF - Grammar notation
- ABNF - Augmented BNF
- Solidity - Smart contracts, events, modifiers
- Log - Timestamps, log levels, stack traces
- Text/Plain - Plain text (no highlighting)
- Blade-like templating syntax
- 50+ directives
- Components, layouts, includes
- Control flow, loops
- Authentication, authorization
ts-syntax-highlighter is built for speed with highly optimized tokenization algorithms:
| Operation | Fast Mode (Async) | Sync Mode | 
|---|---|---|
| JavaScript tokenization | ~0.05ms | ~0.08ms | 
| TypeScript tokenization | ~0.08ms | ~0.12ms | 
| HTML tokenization | ~0.04ms | ~0.06ms | 
| CSS tokenization | ~0.03ms | ~0.05ms | 
- Fast Mode: Async tokenization with worker-like performance characteristics
- Sync Mode: Synchronous processing for simpler integration
- Optimized Patterns: Pattern matching ordered by frequency
- Pre-compiled Regex: All patterns compiled and cached
- Minimal Backtracking: Patterns designed for efficiency
- Memory Efficient: ~3x source code size in memory
When compared to popular syntax highlighters:
| Library | JavaScript | TypeScript | HTML | CSS | 
|---|---|---|---|---|
| ts-syntax-highlighter (Fast) | 0.05ms | 0.08ms | 0.04ms | 0.03ms | 
| highlight.js | 3.8ms | 1.0ms | 1.2ms | 0.9ms | 
| Prism.js | 2.1ms | 0.6ms | 0.8ms | 0.5ms | 
Run benchmarks yourself:
bun run benchimport { Tokenizer } from 'ts-syntax-highlighter'
// Create tokenizer for a specific language (48 languages supported)
const tokenizer = new Tokenizer('javascript') // or any of the 48 supported languages
// Async tokenization (faster, recommended)
const tokens = await tokenizer.tokenizeAsync(code: string)
// Sync tokenization
const tokens = tokenizer.tokenize(code: string)
// Supported languages:
// 'javascript', 'typescript', 'html', 'css', 'json', 'stx',
// 'bash', 'markdown', 'yaml', 'jsonc', 'diff', 'python',
// 'php', 'java', 'c', 'cpp', 'rust', 'csharp', 'dockerfile',
// 'ruby', 'go', 'sql', 'idl', 'text', 'json5', 'vue', 'toml',
// 'scss', 'kotlin', 'swift', 'dart', 'r', 'graphql', 'powershell',
// 'makefile', 'terraform', 'bnf', 'regexp', 'lua', 'cmd', 'abnf',
// 'csv', 'log', 'nginx', 'xml', 'protobuf', 'solidity', 'latex'import { getLanguage, getLanguageByExtension } from 'ts-syntax-highlighter'
// Get language by ID or alias
const lang = getLanguage('js') // Returns JavaScript language
const langTs = getLanguage('tsx') // Returns TypeScript language
// Get language by file extension
const langFromExt = getLanguageByExtension('.jsx') // Returns JavaScript languageinterface Token {
  type: string // Token scope (e.g., 'keyword.control.js', 'string.quoted.double.ts')
  content: string // The actual text content
  line: number // Line number (0-indexed)
  startIndex: number // Character position in the line
}
interface LineTokens {
  line: number
  tokens: Token[]
}# Install dependencies
bun install
# Build the library
bun run build
# Run tests
bun test
# Run benchmarks
bun run bench
# Type checking
bun run typecheck
# Linting
bun run lintPlease see our releases page for more information on what has changed recently.
Please see CONTRIBUTING for details.
For help, discussion about best practices, or any other conversation that would benefit from being searchable:
For casual chit-chat with others using this package:
Join the Stacks Discord Server
“Software that is free, but hopes for a postcard.” We love receiving postcards from around the world showing where Stacks is being used! We showcase them on our website too.
Our address: Stacks.js, 12665 Village Ln #2306, Playa Vista, CA 90094, United States 🌎
We would like to extend our thanks to the following sponsors for funding Stacks development. If you are interested in becoming a sponsor, please reach out to us.
The MIT License (MIT). Please see LICENSE for more information.
Made with 💙
