Page 1 of 1

SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

Posted: Fri Mar 15, 2019 12:31 pm
by Debugger
This document contains characters... (Current Encoding: 1250).


How to find/DETECT a strange character?
This is strange for me because it is a simple text in Polish, so the 1250 encoding is 100% correct.

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 2:25 am
by void
To search for any character with PCRE regex not in code page 1250:

Code: Select all

[^\x{0000}\x{0001}\x{0002}\x{0003}\x{0004}\x{0005}\x{0006}\x{0007}\x{0008}\x{0009}\x{000a}\x{000b}\x{000c}\x{000d}\x{000e}\x{000f}\x{0010}\x{0011}\x{0012}\x{0013}\x{0014}\x{0015}\x{0016}\x{0017}\x{0018}\x{0019}\x{001a}\x{001b}\x{001c}\x{001d}\x{001e}\x{001f}\x{0020}\x{0021}\x{0022}\x{0023}\x{0024}\x{0025}\x{0026}\x{0027}\x{0028}\x{0029}\x{002A}\x{002B}\x{002C}\x{002D}\x{002E}\x{002F}\x{030}\x{0031}\x{0032}\x{0033}\x{0034}\x{0035}\x{0036}\x{0037}\x{0038}\x{0039}\x{003A}\x{003B}\x{003C}\x{003D}\x{003E}\x{003F}\x{040}\x{0041}\x{0042}\x{0043}\x{0044}\x{0045}\x{0046}\x{0047}\x{0048}\x{0049}\x{004A}\x{004B}\x{004C}\x{004D}\x{004E}\x{004F}\x{050}\x{0051}\x{0052}\x{0053}\x{0054}\x{0055}\x{0056}\x{0057}\x{0058}\x{0059}\x{005A}\x{005B}\x{005C}\x{005D}\x{005E}\x{005F}\x{060}\x{0061}\x{0062}\x{0063}\x{0064}\x{0065}\x{0066}\x{0067}\x{0068}\x{0069}\x{006A}\x{006B}\x{006C}\x{006D}\x{006E}\x{006F}\x{070}\x{0071}\x{0072}\x{0073}\x{0074}\x{0075}\x{0076}\x{0077}\x{0078}\x{0079}\x{007A}\x{007B}\x{007C}\x{007D}\x{007E}\x{007F}\x{0AC}\x{201A}\x{201E}\x{2026}\x{2020}\x{2021}\x{2030}\x{0160}\x{2039}\x{015A}\x{0164}\x{017D}\x{0179}\x{018}\x{2019}\x{201C}\x{201D}\x{2022}\x{2013}\x{2014}\x{2122}\x{0161}\x{203A}\x{015B}\x{0165}\x{017E}\x{017A}\x{0A0}\x{02C7}\x{02D8}\x{0141}\x{00A4}\x{0104}\x{00A6}\x{00A7}\x{00A8}\x{00A9}\x{015E}\x{00AB}\x{00AC}\x{00AD}\x{00AE}\x{017B}\x{0B0}\x{00B1}\x{02DB}\x{0142}\x{00B4}\x{00B5}\x{00B6}\x{00B7}\x{00B8}\x{0105}\x{015F}\x{00BB}\x{013D}\x{02DD}\x{013E}\x{017C}\x{154}\x{00C1}\x{00C2}\x{0102}\x{00C4}\x{0139}\x{0106}\x{00C7}\x{010C}\x{00C9}\x{0118}\x{00CB}\x{011A}\x{00CD}\x{00CE}\x{010E}\x{110}\x{0143}\x{0147}\x{00D3}\x{00D4}\x{0150}\x{00D6}\x{00D7}\x{0158}\x{016E}\x{00DA}\x{0170}\x{00DC}\x{00DD}\x{0162}\x{00DF}\x{155}\x{00E1}\x{00E2}\x{0103}\x{00E4}\x{013A}\x{0107}\x{00E7}\x{010D}\x{00E9}\x{0119}\x{00EB}\x{011B}\x{00ED}\x{00EE}\x{010F}\x{111}\x{0144}\x{0148}\x{00F3}\x{00F4}\x{0151}\x{00F6}\x{00F7}\x{0159}\x{016F}\x{00FA}\x{0171}\x{00FC}\x{00FD}\x{0163}\x{02D9}]

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 7:03 am
by Debugger
He finds nothing.
I tested in Notepad++ and EmEditor.

Image





Image

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 10:50 am
by tuska
Debugger wrote:
Sat Mar 16, 2019 7:03 am
He finds nothing.
I tested in ... EmEditor.
Any character with PCRE regex not in code page 1250.png

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 11:18 am
by Debugger
Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)

A strange symbol can also be something that can not be seen.

I have no idea why can not save text with the default encoding.
Which symbol, unicode, special character, it makes it impossible and how you save eg without UTF-8 you will get the name

Code: Select all

??

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 11:32 am
by tuska
Debugger wrote:Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)
That's just the find toolbar (menu "View" - "Toolbars"...) - I use EmEditor Pro Version 18.6.91 x64.

If I want to save a file with special characters (as shown in the picture) as .txt file,
then "Save as Unicode (UTF-16LE with signature)" is automatically suggested to me...
The storage in this format allows me to open the text file without any problems and later on
saving without any further request.

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 11:54 am
by Debugger
I understand. Filter Toolbar
but he still does not detect the strange char
I want to save as 1250 - for the Polish text. I do not need any UTF-16LE.
I also tried online tools to detect strange characters, but they also can not detect strange char.


Detects the Polish text, but does not detect strange characters.
[a-żA-Ż ?,:]

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 12:49 pm
by tuska
Debugger wrote:I understand. Filter Toolbar
No, I meant the "Find Toolbar"! -> Menu "View" - "Toolbars" - "Find Toolbar"

Did you paste the code in the field "Find" of the "Find Toolbar" and
- clicked on button "Use Regular Expressions" in the "Find Toolbar" and
- then clicked on "Find Next" in the "Find Toolbar"?
I then get the special characters marked as shown.

Re: How to find a strange character?

Posted: Sat Mar 16, 2019 1:14 pm
by Debugger
But in my text there are no special characters you mentioned.

I have checked thousands of different Unicode, and still do not detect a strange or illegal symbol.

[^\x00-\x7F]+