Unicode Support & Internationalization

Modern applications serve global audiences with diverse languages, scripts, and cultural conventions. Geode provides comprehensive Unicode support and internationalization (i18n) capabilities, enabling you to build truly global graph applications that handle multilingual content, complex scripts, and locale-specific operations with ease and correctness. As an ISO/IEC 39075:2024 GQL-compliant graph database, Geode uses UTF-8 encoding throughout, supporting the full Unicode 15.0 character set including emoji, mathematical symbols, ancient scripts, and all modern languages. From right-to-left text to combining diacritical marks, Geode handles the complexities of international text processing transparently and efficiently. <h3 id="unicode-fundamentals-in-geode" class="position-relative d-flex align-items-center group"> Unicode Fundamentals in Geode <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="unicode-fundamentals-in-geode" aria-haspopup="dialog" aria-label="Share link: Unicode Fundamentals in Geode"> Share link </button> </h3><div id="headingShareModal" class="heading-share-modal" role="dialog" aria-modal="true" aria-labelledby="headingShareTitle" hidden> <div class="hsm-dialog" role="document"> <div class="hsm-header"> <h2 id="headingShareTitle" class="h6 mb-0 fw-bold">Share this section</h2> <button type="button" class="hsm-close" aria-label="Close"> </button> </div> <div class="hsm-body"> <label for="headingShareInput" class="form-label small text-muted mb-1 text-uppercase fw-bold" style="font-size: 0.7rem; letter-spacing: 0.5px;">Permalink</label> <div class="input-group mb-4 hsm-url-group"> <input id="headingShareInput" type="text" class="form-control font-monospace" readonly aria-readonly="true" style="font-size: 0.85rem;" /> <button class="btn btn-primary hsm-copy" type="button" aria-label="Copy" title="Copy"> </button> </div> <div class="small fw-bold mb-2 text-muted text-uppercase" style="font-size: 0.7rem; letter-spacing: 0.5px;">Share via</div> <div class="hsm-share-grid"> <a id="share-twitter" class="btn btn-outline-secondary w-100" target="_blank" rel="noopener noreferrer"> Twitter </a> <a id="share-linkedin" class="btn btn-outline-secondary w-100" target="_blank" rel="noopener noreferrer"> LinkedIn </a> <a id="share-facebook" class="btn btn-outline-secondary w-100" target="_blank" rel="noopener noreferrer"> Facebook </a> </div> </div> </div> </div> <style> .heading-share-modal { position: fixed; inset: 0; display: flex; justify-content: center; align-items: center; background: rgba(0, 0, 0, 0.6); z-index: 1050; padding: 1rem; backdrop-filter: blur(4px); -webkit-backdrop-filter: blur(4px); } .heading-share-modal[hidden] { display: none !important; } .hsm-dialog { max-width: 420px; width: 100%; background: var(--bs-body-bg, #fff); color: var(--bs-body-color, #212529); border: 1px solid var(--bs-border-color, rgba(0,0,0,0.1)); border-radius: 1rem; box-shadow: 0 25px 50px -12px rgba(0, 0, 0, 0.25); overflow: hidden; animation: hsm-fade-in 0.2s ease-out; } @keyframes hsm-fade-in { from { opacity: 0; transform: scale(0.95); } to { opacity: 1; transform: scale(1); } } [data-bs-theme="dark"] .hsm-dialog { background: #1e293b; border-color: rgba(255,255,255,0.1); color: #f8f9fa; } .hsm-header { display: flex; justify-content: space-between; align-items: center; padding: 1rem 1.5rem; border-bottom: 1px solid var(--bs-border-color, rgba(0,0,0,0.1)); background: rgba(0,0,0,0.02); } [data-bs-theme="dark"] .hsm-header { background: rgba(255,255,255,0.02); border-color: rgba(255,255,255,0.1); } .hsm-close { background: transparent; border: none; color: inherit; opacity: 0.5; padding: 0.25rem 0.5rem; border-radius: 0.25rem; font-size: 1.2rem; line-height: 1; transition: opacity 0.2s; } .hsm-close:hover { opacity: 1; } .hsm-body { padding: 1.5rem; } .hsm-url-group { display: flex !important; align-items: stretch; } .hsm-url-group .form-control { flex: 1; min-width: 0; margin: 0; background: var(--bs-secondary-bg, #f8f9fa); border-color: var(--bs-border-color, #dee2e6); border-top-right-radius: 0; border-bottom-right-radius: 0; height: 42px; } .hsm-url-group .btn { flex: 0 0 auto; margin: 0; margin-left: -1px; border-top-left-radius: 0; border-bottom-left-radius: 0; height: 42px; display: flex; align-items: center; justify-content: center; padding: 0 1.25rem; z-index: 2; } [data-bs-theme="dark"] .hsm-url-group .form-control { background: #0f172a; border-color: #334155; color: #e2e8f0; } .hsm-share-grid { display: flex; flex-direction: column; gap: 0.5rem; } .hsm-share-grid .btn { display: flex; align-items: center; justify-content: center; font-size: 0.9rem; padding: 0.6rem; border-color: var(--bs-border-color); width: 100%; } [data-bs-theme="dark"] .hsm-share-grid .btn { color: #e2e8f0; border-color: #475569; } [data-bs-theme="dark"] .hsm-share-grid .btn:hover { background: #334155; border-color: #cbd5e1; } </style> <script> (function(){ const modal = document.getElementById('headingShareModal'); if(!modal) return; const input = modal.querySelector('#headingShareInput'); const copyBtn = modal.querySelector('.hsm-copy'); const twitter = modal.querySelector('#share-twitter'); const linkedin = modal.querySelector('#share-linkedin'); const facebook = modal.querySelector('#share-facebook'); const closeBtn = modal.querySelector('.hsm-close'); let lastFocus=null; let trapBound=false; function buildUrl(id){ return window.location.origin + window.location.pathname + '#' + id; } function isOpen(){ return !modal.hasAttribute('hidden'); } function hydrate(id){ const url=buildUrl(id); input.value=url; const enc=encodeURIComponent(url); const text=encodeURIComponent(document.title); if(twitter) twitter.href=`https://twitter.com/intent/tweet?url=${enc}&text=${text}`; if(linkedin) linkedin.href=`https://www.linkedin.com/sharing/share-offsite/?url=${enc}`; if(facebook) facebook.href=`https://www.facebook.com/sharer/sharer.php?u=${enc}`; } function openModal(id){ lastFocus=document.activeElement; hydrate(id); if(!isOpen()){ modal.removeAttribute('hidden'); } requestAnimationFrame(()=>{ input.focus(); }); trapFocus(); } function closeModal(){ if(!isOpen()) return; modal.setAttribute('hidden',''); if(lastFocus && typeof lastFocus.focus==='function') lastFocus.focus(); } function copyCurrent(){ try{ navigator.clipboard.writeText(input.value).then(()=>feedback(true),()=>fallback()); } catch(e){ fallback(); } } function fallback(){ input.select(); try{ document.execCommand('copy'); feedback(true);}catch(e){ feedback(false);} } function feedback(ok){ if(!copyBtn) return; const icon=copyBtn.querySelector('i'); if(!icon) return; const prev=copyBtn.getAttribute('data-prev')||icon.className; if(!copyBtn.getAttribute('data-prev')) copyBtn.setAttribute('data-prev',prev); icon.className= ok ? 'fa-duotone fa-clipboard-check':'fa-duotone fa-circle-exclamation'; setTimeout(()=>{ icon.className=prev; },1800); } function handleShareClick(e){ e.preventDefault(); const btn=e.currentTarget; const id=btn.getAttribute('data-share-target'); if(id) openModal(id); } function bindShareButtons(){ document.querySelectorAll('.h-share').forEach(btn=>{ if(!btn.dataset.hShareBound){ btn.addEventListener('click', handleShareClick); btn.dataset.hShareBound='1'; } }); } bindShareButtons(); if(document.readyState==='loading'){ document.addEventListener('DOMContentLoaded', bindShareButtons); } else { requestAnimationFrame(bindShareButtons); } document.addEventListener('click', function(e){ const shareBtn=e.target.closest && e.target.closest('.h-share'); if(shareBtn && !shareBtn.dataset.hShareBound){ handleShareClick.call(shareBtn, e); } }, true); document.addEventListener('click', e=>{ if(e.target===modal) closeModal(); if(e.target.closest && e.target.closest('.hsm-close')){ e.preventDefault(); closeModal(); } if(copyBtn && (e.target===copyBtn || (e.target.closest && e.target.closest('.hsm-copy')))) { e.preventDefault(); copyCurrent(); } }); document.addEventListener('keydown', e=>{ if(e.key==='Escape' && isOpen()) closeModal(); }); function trapFocus(){ if(trapBound) return; trapBound=true; modal.addEventListener('keydown', f=>{ if(f.key==='Tab' && isOpen()){ const focusable=[...modal.querySelectorAll('a[href],button,input,textarea,select,[tabindex]:not([tabindex="-1"])')].filter(el=>!el.hasAttribute('disabled')); if(!focusable.length) return; const first=focusable[0]; const last=focusable[focusable.length-1]; if(f.shiftKey && document.activeElement===first){ f.preventDefault(); last.focus(); } else if(!f.shiftKey && document.activeElement===last){ f.preventDefault(); first.focus(); } } }); } if(closeBtn) closeBtn.addEventListener('click', e=>{ e.preventDefault(); closeModal(); }); })(); </script>UTF-8 Encoding: Geode stores all text using UTF-8, the universal character encoding that represents every character in the Unicode standard. UTF-8 is backward-compatible with ASCII, efficient for most languages, and widely supported across platforms and programming languages. <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- All these characters work seamlessly CREATE (u:User { name_en: 'Alice', name_ja: 'アリス', name_ar: 'أليس', name_ru: 'Алиса', name_zh: '爱丽丝', greeting_emoji: '👋🌍🎉' }); -- Unicode escapes CREATE (t:Text { content: '\u0048\u0065\u006C\u006C\u006F' -- "Hello" }); </code></pre></div>Character Properties: Geode correctly handles Unicode character properties including: <ul> <li>Case Mapping: Upper, lower, and title case transformations</li> <li>Character Categories: Letters, numbers, punctuation, symbols, separators</li> <li>Script Detection: Latin, Cyrillic, Arabic, Han, etc.</li> <li>Directionality: Left-to-right (LTR) and right-to-left (RTL) text</li> <li>Combining Characters: Diacritics, accents, and modifiers</li> </ul> <h3 id="multilingual-text-storage" class="position-relative d-flex align-items-center group"> Multilingual Text Storage <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="multilingual-text-storage" aria-haspopup="dialog" aria-label="Share link: Multilingual Text Storage"> Share link </button> </h3>Storing Multiple Languages: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Product with multilingual descriptions CREATE (p:Product { sku: 'LAPTOP-001', name_en: 'Professional Laptop', name_es: 'Portátil Profesional', name_fr: 'Ordinateur Portable Professionnel', name_de: 'Professioneller Laptop', name_ja: 'プロフェッショナル ラップトップ', name_zh: '专业笔记本电脑' }); -- Content with mixed scripts CREATE (doc:Document { title: 'International Meeting Notes', content: 'Discussion about データベース (database) and قاعدة البيانات' }); </code></pre></div>Language-Specific Properties: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Query by language MATCH (p:Product) RETURN p.name_en, p.name_es, p.name_fr; -- Dynamic language selection MATCH (p:Product) WITH p, 'es' AS lang RETURN p.sku, CASE lang WHEN 'en' THEN p.name_en WHEN 'es' THEN p.name_es WHEN 'fr' THEN p.name_fr ELSE p.name_en END AS localized_name; </code></pre></div> <h3 id="character-normalization" class="position-relative d-flex align-items-center group"> Character Normalization <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="character-normalization" aria-haspopup="dialog" aria-label="Share link: Character Normalization"> Share link </button> </h3>Unicode defines multiple ways to represent the same character (e.g., “é” can be a single character or “e” + combining accent). Normalization ensures consistent representation: Normalization Forms: <ul> <li>NFC (Canonical Composition): Combines base + modifiers into single characters</li> <li>NFD (Canonical Decomposition): Separates characters into base + modifiers</li> <li>NFKC (Compatibility Composition): Aggressive composition with compatibility mappings</li> <li>NFKD (Compatibility Decomposition): Aggressive decomposition with compatibility mappings</li> </ul> <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Normalize text to NFC (recommended for most use cases) MATCH (u:User) SET u.name = NORMALIZE(u.name, 'NFC'); -- Compare normalized text MATCH (u:User) WHERE NORMALIZE(u.name, 'NFC') = NORMALIZE('José', 'NFC') RETURN u; -- Search with normalization MATCH (u:User) WHERE NORMALIZE(LOWER(u.name), 'NFC') CONTAINS NORMALIZE(LOWER('josé'), 'NFC') RETURN u.name; </code></pre></div>When to Normalize: <ul> <li>Data Ingestion: Normalize on insert to ensure consistency</li> <li>Comparison: Normalize before comparing user input with stored data</li> <li>Indexing: Create indexes on normalized values for consistent matching</li> <li>Search: Normalize search terms and content for accurate matching</li> </ul> <h3 id="collation-and-sorting" class="position-relative d-flex align-items-center group"> Collation and Sorting <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="collation-and-sorting" aria-haspopup="dialog" aria-label="Share link: Collation and Sorting"> Share link </button> </h3>Collation determines how text is sorted and compared, respecting language-specific rules: Default Collation: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Default UTF-8 binary collation (byte-order sorting) MATCH (u:User) RETURN u.name ORDER BY u.name; </code></pre></div>Locale-Specific Collation: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- English collation MATCH (u:User) RETURN u.name ORDER BY u.name COLLATE 'en_US'; -- Spanish collation (ñ sorted after n) MATCH (u:User) RETURN u.name ORDER BY u.name COLLATE 'es_ES'; -- German collation (ä, ö, ü sorted specifically) MATCH (u:User) RETURN u.name ORDER BY u.name COLLATE 'de_DE'; -- Case-insensitive collation MATCH (u:User) RETURN u.name ORDER BY u.name COLLATE 'en_US_CI'; -- CI = Case Insensitive </code></pre></div>Supported Locales: Geode supports 100+ locales including: <ul> <li>Western European: en_US, es_ES, fr_FR, de_DE, it_IT, pt_BR</li> <li>Nordic: sv_SE, no_NO, da_DK, fi_FI</li> <li>Eastern European: pl_PL, cs_CZ, ru_RU, uk_UA</li> <li>Asian: zh_CN, ja_JP, ko_KR, th_TH, vi_VN</li> <li>Middle Eastern: ar_SA, he_IL, fa_IR, tr_TR</li> </ul> <h3 id="case-conversion" class="position-relative d-flex align-items-center group"> Case Conversion <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="case-conversion" aria-haspopup="dialog" aria-label="Share link: Case Conversion"> Share link </button> </h3>Unicode-aware case conversion respects language-specific rules: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Standard case conversion MATCH (u:User) RETURN UPPER(u.name), LOWER(u.name); -- Locale-specific case conversion MATCH (u:User) RETURN UPPER(u.name COLLATE 'tr_TR') AS turkish_upper; -- Turkish 'i' has special case rules: -- 'i' → 'İ' (dotted capital I) -- 'ı' → 'I' (dotless capital I) RETURN UPPER('istanbul' COLLATE 'tr_TR'); -- 'İSTANBUL' </code></pre></div>Title Case: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Initialize capitals (title case) MATCH (u:User) RETURN INITCAP(u.name) AS title_case; -- Example: 'alice johnson' → 'Alice Johnson' </code></pre></div> <h3 id="emoji-and-special-characters" class="position-relative d-flex align-items-center group"> Emoji and Special Characters <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="emoji-and-special-characters" aria-haspopup="dialog" aria-label="Share link: Emoji and Special Characters"> Share link </button> </h3>Geode fully supports emoji and special Unicode characters: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Store emoji CREATE (p:Post { content: 'Loving the new features! 🎉🚀💯', reactions: ['❤️', '👍', '😂'] }); -- Search for emoji MATCH (p:Post) WHERE p.content CONTAINS '🎉' RETURN p.content; -- Count emoji MATCH (p:Post) RETURN LENGTH(REGEXP_EXTRACT_ALL(p.content, '[\u{1F600}-\u{1F64F}]')) AS emoji_count; -- Mathematical symbols CREATE (eq:Equation { formula: '∫₀^∞ e^(-x²) dx = √π / 2', symbols: ['∫', '∞', '√', 'π'] }); </code></pre></div>Surrogate Pairs: Geode correctly handles characters outside the Basic Multilingual Plane (BMP), including emoji that require surrogate pairs in UTF-16: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- These emoji use 4-byte UTF-8 sequences CREATE (p:Post { content: '🌈🦄🎨' -- Rainbow, unicorn, palette }); -- Character length is correct (3 characters, not 6 or 12) MATCH (p:Post) RETURN CHAR_LENGTH(p.content); -- Returns 3 </code></pre></div> <h3 id="right-to-left-rtl-text" class="position-relative d-flex align-items-center group"> Right-to-Left (RTL) Text <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="right-to-left-rtl-text" aria-haspopup="dialog" aria-label="Share link: Right-to-Left (RTL) Text"> Share link </button> </h3>Geode stores and retrieves RTL text (Arabic, Hebrew, etc.) correctly: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Arabic text (RTL) CREATE (p:Post { content_ar: 'مرحبا بك في قاعدة البيانات', content_he: 'ברוכים הבאים למסד הנתונים' }); -- Mixed LTR/RTL (bidirectional text) CREATE (p:Post { content: 'Welcome مرحبا שלום to our database!' }); -- Query RTL text MATCH (p:Post) WHERE p.content_ar CONTAINS 'قاعدة البيانات' RETURN p.content_ar; </code></pre></div> <h3 id="internationalization-patterns" class="position-relative d-flex align-items-center group"> Internationalization Patterns <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="internationalization-patterns" aria-haspopup="dialog" aria-label="Share link: Internationalization Patterns"> Share link </button> </h3>Language Detection: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Store language metadata CREATE (doc:Document { content: 'This is an English document', language: 'en', detected_script: 'Latin' }); -- Query by language MATCH (doc:Document {language: 'en'}) RETURN doc.content; </code></pre></div>Locale-Specific Formatting: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Store locale preferences CREATE (u:User { name: 'Alice', locale: 'en_US', timezone: 'America/New_York', date_format: 'MM/DD/YYYY', number_format: '#,##0.00' }); -- Query with locale MATCH (u:User) RETURN u.name, FORMAT_DATE(u.created_at, u.date_format) AS formatted_date, FORMAT_NUMBER(u.balance, u.number_format) AS formatted_balance; </code></pre></div> <h3 id="full-text-search-with-multiple-languages" class="position-relative d-flex align-items-center group"> Full-Text Search with Multiple Languages <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="full-text-search-with-multiple-languages" aria-haspopup="dialog" aria-label="Share link: Full-Text Search with Multiple Languages"> Share link </button> </h3>Create language-specific full-text indexes: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- English full-text index CREATE FULLTEXT INDEX content_en ON :Document(content_en) WITH (language: 'english'); -- Spanish full-text index CREATE FULLTEXT INDEX content_es ON :Document(content_es) WITH (language: 'spanish'); -- Chinese full-text index (requires CJK tokenization) CREATE FULLTEXT INDEX content_zh ON :Document(content_zh) WITH (language: 'chinese'); -- Multi-language search MATCH (d:Document) WHERE d.content_en MATCHES 'database' OR d.content_es MATCHES 'base de datos' RETURN d; </code></pre></div> <h3 id="character-analysis-functions" class="position-relative d-flex align-items-center group"> Character Analysis Functions <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="character-analysis-functions" aria-haspopup="dialog" aria-label="Share link: Character Analysis Functions"> Share link </button> </h3>Character Categories: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Check character type RETURN IS_ALPHA('A'); -- true RETURN IS_ALPHA('5'); -- false RETURN IS_DIGIT('5'); -- true RETURN IS_ALPHANUMERIC('A5'); -- true -- Unicode category RETURN UNICODE_CATEGORY('A'); -- 'Lu' (Letter, uppercase) RETURN UNICODE_CATEGORY('π'); -- 'Ll' (Letter, lowercase) RETURN UNICODE_CATEGORY('5'); -- 'Nd' (Number, decimal) RETURN UNICODE_CATEGORY('!'); -- 'Po' (Punctuation, other) </code></pre></div>Script Detection: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Detect script RETURN DETECT_SCRIPT('Hello'); -- 'Latin' RETURN DETECT_SCRIPT('こんにちは'); -- 'Hiragana' RETURN DETECT_SCRIPT('你好'); -- 'Han' RETURN DETECT_SCRIPT('مرحبا'); -- 'Arabic' RETURN DETECT_SCRIPT('Привет'); -- 'Cyrillic' </code></pre></div> <h3 id="performance-considerations" class="position-relative d-flex align-items-center group"> Performance Considerations <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="performance-considerations" aria-haspopup="dialog" aria-label="Share link: Performance Considerations"> Share link </button> </h3>Indexing Unicode Text: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Create index on normalized text CREATE INDEX user_name_normalized ON :User(NORMALIZE(name, 'NFC')); -- Efficient search with normalization MATCH (u:User) WHERE NORMALIZE(u.name, 'NFC') = NORMALIZE($search_term, 'NFC') RETURN u; </code></pre></div>Storage Efficiency: <ul> <li>UTF-8 is most efficient for ASCII and European languages (1 byte per character)</li> <li>Asian scripts require 3-4 bytes per character</li> <li>Emoji and rare characters may require 4 bytes</li> <li>Normalization can reduce storage size by combining characters</li> </ul> Query Optimization: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Inefficient: case conversion on every row MATCH (u:User) WHERE LOWER(u.name) = 'alice' RETURN u; -- Efficient: store lowercase version CREATE INDEX user_name_lower ON :User(LOWER(name)); SET u.name_lower = LOWER(u.name); -- Compute once on insert/update </code></pre></div> <h3 id="best-practices" class="position-relative d-flex align-items-center group"> Best Practices <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="best-practices" aria-haspopup="dialog" aria-label="Share link: Best Practices"> Share link </button> </h3><ol> <li> Always Normalize: Normalize text to NFC on insertion for consistent storage and comparison. </li> <li> Choose Appropriate Collation: Use locale-specific collation for sorting user-visible lists. </li> <li> Index Normalized Values: Create indexes on normalized text for efficient searches. </li> <li> Validate Input: Use Unicode-aware validation for emails, URLs, and other constrained fields. </li> <li> Store Language Metadata: Track the language/locale of multilingual content for proper processing. </li> <li> Test with Real Data: Use realistic multilingual test data including RTL text, emoji, and complex scripts. </li> <li> Consider Locale in Application Logic: Make locale selection user-configurable for formatting and sorting. </li> </ol> <h3 id="common-use-cases" class="position-relative d-flex align-items-center group"> Common Use Cases <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="common-use-cases" aria-haspopup="dialog" aria-label="Share link: Common Use Cases"> Share link </button> </h3>Multilingual E-Commerce: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">MATCH (p:Product) WHERE p.category_en = 'Electronics' RETURN p.name_en, p.name_es, p.name_zh, p.price ORDER BY p.name_en COLLATE 'en_US'; </code></pre></div>Global User Profiles: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">CREATE (u:User { username: 'alice', display_name: 'Alice Johnson', display_name_ja: 'アリス・ジョンソン', bio: 'Software engineer 👨‍💻 who loves databases 💾', preferred_locale: 'en_US' }); </code></pre></div>International Search: <div class="highlight"><pre tabindex="0" class="chroma"><code class="language-gql" data-lang="gql">-- Search across languages with normalization MATCH (doc:Document) WHERE NORMALIZE(LOWER(doc.content), 'NFC') CONTAINS NORMALIZE(LOWER($search_query), 'NFC') RETURN doc.title, doc.language; </code></pre></div> <h3 id="related-topics" class="position-relative d-flex align-items-center group"> Related Topics <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="related-topics" aria-haspopup="dialog" aria-label="Share link: Related Topics"> Share link </button> </h3><ul> <li>Text Processing and String Operations</li> <li>Full-Text Search and Indexing</li> <li>Collation Configuration</li> <li>Data Validation and Constraints</li> <li>JSON and Semi-Structured Data</li> <li>Regular Expressions</li> <li>Client Library Character Encoding</li> </ul> <h3 id="further-reading" class="position-relative d-flex align-items-center group"> Further Reading <button type="button" class="h-share btn btn-link p-0 text-decoration-none link-secondary opacity-50 hover-opacity-100 transition-all ms-1" data-share-target="further-reading" aria-haspopup="dialog" aria-label="Share link: Further Reading"> Share link </button> </h3><ul> <li>Unicode Standard Documentation</li> <li>UTF-8 Encoding Specification</li> <li>Unicode Normalization Forms (TR15)</li> <li>Collation Algorithm (UCA)</li> <li>Locale Data and Cultural Conventions</li> <li>Emoji and Symbol Support</li> <li>Internationalization Best Practices</li> </ul>

Popular

Related Articles

Unicode Support and Text Processing