Break Ciphers with Statistical Analysis
Frequency analysis is one of the most powerful techniques in cryptanalysis. By examining how often letters, pairs (bigrams), and triplets (trigrams) appear in encrypted text, you can break substitution ciphers and understand text patterns.
In English, the letter E appears most frequently (~12.7%), followed by T (~9.1%), A (~8.2%), and so on. Common bigrams include TH, HE, IN, ER, while common trigrams include THE, AND, ING, HER.
Every language has predictable patterns in how often letters appear. In English, some letters are far more common than others. By counting letter frequencies in encrypted text, we can make educated guesses about substitutions.
Looking at pairs and triplets of letters provides even more powerful analysis:
To break a substitution cipher with frequency analysis:
Frequency analysis works best on longer texts (100+ characters). Short messages may not have representative frequencies. It's also ineffective against polyalphabetic ciphers (like Vigenรจre) that use multiple substitution alphabets.
Generally, at least 100 characters are needed for basic patterns, but 500+ characters provide much more reliable results. The longer the text, the more closely it will match expected frequency distributions.
Frequency analysis is effective against monoalphabetic substitution ciphers (where each letter always maps to the same letter). It doesn't work well on:
Bigrams are two-letter combinations (TH, HE, IN), and trigrams are three-letter combinations (THE, AND, ING). These patterns are even more distinctive than single letters and can help confirm substitution guesses.
Yes! Every language has its own frequency distribution. However, this tool compares results to English frequencies. For other languages, you'd need to compare against that language's expected patterns.
No. All analysis happens in your browser. Your text never leaves your computer, ensuring complete privacy. This tool works entirely client-side using JavaScript.