From 94b4df4e85607dd8f7ed6488dd5988682ad3fb17 Mon Sep 17 00:00:00 2001 From: Akash Mozumdar Date: Sat, 8 Feb 2020 18:04:32 -0700 Subject: [PATCH] Updated FAQ (markdown) --- FAQ.md | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/FAQ.md b/FAQ.md index 81f41f9..3ec846d 100644 --- a/FAQ.md +++ b/FAQ.md @@ -3,11 +3,11 @@ ## Textractor is extracting text *mostly* correctly, but there's some extra characters as markup/garbage (e.g. a `\n` in place of every line break). Is there a way to clean the text? Yup, use the `Regex Filter` or `Replacer` extension. Remember to put the extension near the top of the list so the other extensions see the cleaned text. Some useful regex filters: -`\s` (filters all whitespace) -`[\u0021-\u00ff]` (filters all european language and most special characters) -`[\u0100-\uffff]` (filters all non european language characters) -`[\u0000-\u2fff\ua000-\uffff]` (filters all non Chinese/Japanese/Korean characters) -`<.+?>` (filters all HTML tags like

) +- `\s` (filters all whitespace) +- `[\u0021-\u00ff]` (filters all european language and most special characters) +- `[\u0100-\uffff]` (filters all non european language characters) +- `[\u0000-\u2fff\ua000-\uffff]` (filters all non Chinese/Japanese/Korean characters) +- `<.+?>` (filters all HTML tags like

) ## Textractor is extracting text with some characters missing or is unable to extract any text remotely close to what I need. How do I extract the correct text? Oof, looks like you found a game with an engine that Textractor doesn't natively support. There's two things you should try: