String Similarity Calculator

Compute Levenshtein edit distance and similarity percentage between two strings. Find the minimum edits (insertions, deletions, substitutions) needed to transform one string into another.

String A

6 chars

String B

7 chars

Similarity Result

Edit Distance

operations

Similarity

57.14%

Partly similar

Character Diff

insertdeletesubstituteunchanged

sitting

1 insertions2 substitutions

How to Use String Similarity Calculator

1Enter two strings to compare in the input fields.
2View the Levenshtein edit distance and similarity percentage.
3See the character-level diff and list of operations (insert, delete, substitute).

ZenovayAnalytics

Analytics built for founders.

Real-time visitor tracking
Privacy-first, no cookie banner
Set up in two minutes

Explore Zenovay

Related Tools

Word Counter

Count words, characters, sentences, and paragraphs. Estimate reading and speaking time.

Character Counter

Count characters with and without spaces. Track limits for Twitter, meta descriptions, and more.

Case Converter

Convert text between UPPERCASE, lowercase, Title Case, camelCase, snake_case, and kebab-case.

Lorem Ipsum Generator

Generate placeholder text in paragraphs, sentences, or words. Copy with one click.

Frequently Asked Questions

What is Levenshtein distance?▾

Levenshtein distance (edit distance) is the minimum number of single-character edits needed to transform one string into another. The three allowed operations are: insertion (add a character), deletion (remove a character), substitution (replace one character with another). Example: "kitten" → "sitting" requires 3 operations: k→s, e→i, +g. Named after Vladimir Levenshtein who described this algorithm in 1965. Applications: spell checking, DNA sequence alignment, natural language processing, fuzzy string matching.

How is similarity percentage calculated?▾

Similarity % = (1 − editDistance / max(len1, len2)) × 100. This normalizes the distance relative to the longest string. Example: distance = 2, max length = 8 → similarity = (1 − 2/8) × 100 = 75%. A distance of 0 means identical strings (100% similar). A distance equal to max length means completely different strings (0% similar). Other similarity metrics include Jaro-Winkler (weights initial character agreement more highly) and cosine similarity (for token-based comparison).

What is the Damerau-Levenshtein distance?▾

Damerau-Levenshtein extends Levenshtein by adding a fourth operation: transposition (swap two adjacent characters). Example: "ab" → "ba" has Levenshtein distance 2 (delete + insert) but Damerau-Levenshtein distance 1 (one transposition). This is important for spell-checking since many typos are transpositions ("teh" → "the"). There are two variants: restricted (adjacent transpositions only) and unrestricted (any transpositions). This tool uses standard Levenshtein (no transpositions).

What are common applications of edit distance?▾

Spell checkers: suggest words with low edit distance from the misspelled word. DNA sequencing: align nucleotide sequences to find mutations and insertions. Version control: git diff uses edit distance algorithms to show line-level changes. Natural language processing: named entity disambiguation, coreference resolution. Fraud detection: detect name variations (typosquatting, identity fraud). Record linkage: match records across databases when names/addresses vary slightly. OCR correction: fix optical character recognition errors.

How does the Wagner-Fischer algorithm work?▾

The Wagner-Fischer algorithm computes Levenshtein distance using dynamic programming with time O(m×n) and space O(m×n) where m,n are string lengths. It builds a matrix where cell [i][j] = edit distance between the first i characters of string 1 and first j characters of string 2. Base cases: row 0 = 0..n (delete all chars of s1), col 0 = 0..m (insert all chars of s2). Recurrence: if s1[i] == s2[j], cell = previous diagonal; otherwise cell = 1 + min(left, above, diagonal). Space can be optimized to O(min(m,n)) using two rows.