Constraint 4: Syntactic Linearity
Complex Noun Phrases & Modification Scope
English technical nomenclature is often Head-Final (Modifiers + Noun), whereas French is Head-Initial (Noun + Modifiers). Generic NMT models suffer from "Linearity Bias," processing text sequentially without performing the necessary syntactic inversion. This disconnects the noun from its modifiers, frequently producing lists of unrelated items rather than a single compound term, leading to claims that are legally indefinite.
Corpus status:
4 cases
— Active. New cases added as documented from ongoing HITL review.
| ID | Error Type | Case Summary | |
|---|---|---|---|
| C4-001 | Linearity Bias | Syntactic Linearity Bias — "Electrocardiogram Lead". The source phrase "Electrocardiogram, ECG, Lead" defined a single component (a lead of an ECG). The generic NMT model processed the tokens linearly (1-2-3), outputting Électrocardiogramme, ECG, Dérivation (Head-Final structure in French), structurally creating a list of three separate items — a machine, an acronym, and a wire. The alignment protocol enforces "Head-Initial Inversion," restructuring the phrase to Dérivation d'électrocardiogramme (ECG) to preserve the singular legal object. | View PDF |
| C4-002 | Contextual Contamination | Mathematical Integrity — The "Triple Prime Contagion". In a semiconductor patent, the triple prime symbol (''') appeared frequently in reference numerals. The generic NMT model, exhibiting "Contextual Contagion," hallucinated this symbol into a mathematical formula, translating 0.3·Lp2 as 0,3''' lp2. This error deleted the multiplication operator and corrupted the variable case (L vs l), rendering the claimed range mathematically undefined. The alignment protocol implements "Regex Protection Layers" to strictly quarantine formulas from text-based symbol hallucination. | View PDF |
| C4-003 | Determiner Chain | Nested Determiner Chains — The "Double-Two" Tangle. The phrase "a second two of the plurality" uses a recursive [Ordinal + Cardinal] structure to define a data subset. Generic NMT models processed this linearly, mistranslating "Second" (Ordinal) as the noun Secondes (Time Unit), effectively claiming "Two seconds of values." This renders the claim physically impossible. The alignment protocol enforces a "Functional Restructuring" to Deux autres (Two others), avoiding the time-trap while strictly preventing Added Matter (e.g., adding nouns like "pair"). | View PDF |
| C4-004 | Mood Fragmentation | Syntactic Discontinuity — The "Mood Swing" Method. A method claim with five parallel steps used consistent present participles in English. The generic NMT model failed to maintain parallelism in French, oscillating between noun phrases (la mise), infinitives (insérer), and technical nouns (imagerie). This structural fragmentation undermines the "Single-Sentence" requirement of patent claims and creates legal ambiguity. The alignment protocol enforces "Mood Synchronization," ensuring all method steps utilize a consistent infinitive structure. | View PDF |