Skip to main content

Check If Text Is Fake

Parse text for Cyrillic mimics and full-width characters. Sanitize datasets to prevent IDN homograph attacks while preserving layout. Validate integrity.

1
?
2

Please configure parameters and execute the action.

About Check If Text Is Fake


This tool scans your text for known homoglyph characters (e.g., Cyrillic/Greek lookalikes) and full-width characters commonly used to forge text.

Features


This tool provides the following features:

  • Homoglyph Detection - Detects common Cyrillic/Greek lookalike characters.
  • Full-width Detection - Detects full-width Latin and punctuation characters.
  • Detailed Report - Shows counts and the suspicious characters found.

Examples


  • Likely fake (homoglyphs)
    Input:
    Pаsswоrd rеsеt nоw
    
    Output (example):
    Suspicious: YES
    Homoglyph chars: 4
    Full-width chars: 0
  • Likely fake (full-width)
    Input:
    Hello, world!
    
    Output (example):
    Suspicious: YES
    Homoglyph chars: 0
    Full-width chars: 13
  • Looks normal
    Input:
    Hello, world!
    
    Output (example):
    Suspicious: NO
    Homoglyph chars: 0
    Full-width chars: 0

Real-World Usage Scenarios


  • Phishing-Link Verification - Identify deceptive links where attackers swap Latin characters for Cyrillic or Greek homoglyphs (e.g., using a Cyrillic 'а' instead of a Latin 'a') to mimic legitimate banking or login portals.
  • Business Email Compromise - BEC - Prevention - Scan sender display names and email body text in incoming invoices to detect subtle character substitutions often used in wire transfer fraud and executive impersonation.
  • Database Integrity - Data Sanitization - Filter out full-width Unicode characters from user inputs that can bypass legacy security filters or cause errors in SQL databases and form processing scripts.
  • Brand Reputation - Typosquatting Analysis - Check potential domain names or social media handles for look-alike characters to prevent attackers from registering spoofed versions of your official brand assets.

Frequently Asked Questions


What is a homoglyph attack?

It is a spoofing technique where an attacker uses visually identical characters from different alphabets (like Greek, Cyrillic, or Latin) to deceive users into visiting fake websites or trusting fraudulent emails.

Why are full-width characters flagged as suspicious?

Full-width characters are often used to bypass Web Application Firewalls (WAFs) or security filters that only look for standard ASCII characters. They can also break text formatting in professional software.

Can this tool detect all types of forged text?

This tool specifically targets character-level forgery involving homoglyphs and width-variant characters. It does not analyze AI-generated syntax or semantic misinformation.

Is the analyzed text stored on your servers?

No. The scanning process is performed locally in your browser. Your input text and the resulting detection report are never saved or transmitted to external servers.

Text Tools
Other tools you might like
Write Text in Cursive
Map Latin characters to Unicode cursive glyphs. The logic handles Mathematical Alphanumeric exceptions to ensure cross-platform compatibility and parsing.
Visualize Text Structure
Parse string architecture into vector graphics. Map tokens, whitespace, and punctuation to distinct hex layers. Export precise SVG schematics for analysis.
Unwrap Text Lines
Parse and sanitize string buffers by mapping hard breaks to custom separators. Employs paragraph-aware logic to maintain semantic data integrity.
Undo Zalgo Text Effect
Parse corrupted strings to strip non-spacing marks. Normalize Unicode input by removing recursive combining characters. Restore data integrity now.
Sort Symbols in Text
Parse and normalize character sequences via Unicode point values. Sanitize strings using skip lists, case logic, and duplicate removal for clean datasets.
Rotate Text
Shift characters cyclically across strings. Map offsets to reformat multiline structures with line-by-line logic. Normalize text for data schemas.
ROT47 Text
Shift printable ASCII characters by 47 positions to obfuscate sensitive strings. Implement symmetric mapping for range 33-126 to ensure data integrity.
ROT13 Text
Parse and shift alphabetic characters 13 positions. Maintain case sensitivity and non-letter integrity for spoiler protection or data obfuscation.
Rewrite Text
Sanitize datasets with custom mapping and whole-word logic. Apply recursive double-pass processing to clean whitespace. Normalize your data structure.
Replace Words with Digits
Normalize datasets by mapping verbal numbers to digits. Sanitize text with case-sensitive matching and whole-word logic for secure data ingestion.
Replace Text Vowels
Map specific vowel patterns using custom substitution logic. Supports case-sensitive matching and secondary passes to sanitize or obfuscate string data.
Replace Text Spaces
Normalize datasets by converting tabs, newlines, and spaces into custom symbols. Collapse whitespace clusters to ensure strict character counts.
Replace Text Letters
Normalize strings using custom character rules. Execute case-sensitive matching and recursive replacement passes to ensure data integrity. Export clean results.
Replace Text Consonants
Map consonants to custom characters using iterative substitution rules. Sanitize strings with case-sensitive precision for technical datasets and linguistics.
Replace Line Breaks in Text
Sanitize raw data by mapping CRLF sequences to custom delimiters. Collapse repeated breaks and trim whitespace to ensure valid dataset parsing.
Replace Digits with Words
Map numeric sequences to cardinal words. Parse standalone digits or specific patterns. Optimized for TTS data prep and document sanitization logic.
Replace Commas in Text
Parse and reformat datasets by mapping commas to custom symbols. Logic-aware processing preserves numeric separators while collapsing redundant clusters.
Remove Text Letters
Parse raw strings to eliminate specific character sets. This utility handles case-sensitive matching and collapses redundant whitespace for clean datasets.
Remove Text Font
Sanitize stylized Unicode glyphs into standard Latin script. Parse decorative fonts for screen reader accessibility and database safety [UTF-8].
Remove Quotes from Words
Strip leading and trailing quotation marks from individual words. Recursive logic handles nested delimiters in SQL, JSON, and CSV datasets efficiently.