Textalyzer

Analyze key metrics like number of words, readability, complexity, etc. of any kind of text.

CLI	Web

Usage

# Word frequency histogram
textalyzer histogram <filepath>

# Find duplicated code blocks (default: minimum 3 non-empty lines)
textalyzer duplication <path> [<additional paths...>]

# Find duplications with at least 5 non-empty lines
textalyzer duplication --min-lines=5 <path> [<additional paths...>]

# Include single-line duplications
textalyzer duplication --min-lines=1 <path> [<additional paths...>]

The duplication command analyzes files for duplicated text blocks. It can:

Analyze multiple files or recursively scan directories
Filter duplications based on minimum number of non-empty lines with --min-lines=N (default: 2)
Detect single-line duplications when using --min-lines=1
Rank duplications by number of consecutive lines
Show all occurrences with file and line references
Utilize multithreaded processing for optimal performance on all available CPU cores
Use memory mapping for efficient processing of large files with minimal memory overhead

Rewrite in Rust

This CLI tool was originally written in JavaScript and was later rewritten in Rust to improve the performance.

Before:

hyperfine --warmup 3 'time ./cli/index.js examples/1984.txt'
Benchmark #1: time ./cli/index.js examples/1984.txt
  Time (mean ± σ):     390.3 ms ±  15.6 ms    [User: 402.6 ms, System: 63.5 ms]
  Range (min … max):   366.7 ms … 425.7 ms

After:

hyperfine --warmup 3 'textalyzer histogram examples/1984.txt'
Benchmark #1: textalyzer histogram examples/1984.txt
  Time (mean ± σ):      40.4 ms ±   2.5 ms    [User: 36.0 ms, System: 2.7 ms]
  Range (min … max):    36.9 ms …  48.7 ms

Pretty impressive 10x performance improvement! 😁

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github		.github
examples		examples
images		images
languages/english		languages/english
textalyzer-wasm		textalyzer-wasm
textalyzer		textalyzer
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
changelog.md		changelog.md
makefile		makefile
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Textalyzer

Usage

Related

Rewrite in Rust

About

Sponsor this project

Languages

ad-si/Textalyzer

Folders and files

Latest commit

History

Repository files navigation

Textalyzer

Usage

Related

Rewrite in Rust

About

Topics

Resources

Stars

Watchers

Forks

Sponsor this project

Languages