deleteDiacriticalMarks missing some characters
R
Rick Nieuweveen
some diacritics are not replaced by deleteDiacriticalMarks() function
Is it possible to add those to the function?
At this point I had issues with the characters: ĉĕČ
I ran the function for many diacritics and the following list were also not replaced:
ÆÝÞæþĀāĂ㥹ĈĉĊċČčĎďĐđĒēĔĕĖėĘęĚěĜĝĠġĢģĤĥĦħĨĩĪīĬĭĮįİıIJĴĵĶķĸĹĺĻļ ĽľĿŀŁłŃńŅņŇňʼnŊŋŒœŔŕŖŗŘřŚśŜŝŢţŤťŦŧŨũŪūŬŭŮůŲųŴŵŶŷŹźŻżſ
M
Mike van Eck
Also, when updating the function deleteDiacriticalMarks, maybe call it deprecated and add a new function replaceSpecialCharacters (the characters aren't deleted but replaced ;-)
M
Mike van Eck
This would be nice indeed. We have written an extra function to replace Polish characters - not as complete as the sample shown above - it would be nice to introduce an extra standard function for this. Yes, it may not be diacritics, but the characters aren't allowed in for example UPN's.
Rick Jongbloed
I think an issue here is that some characters are defined as letters in for example the Polish alphabet. You can't strip the mark but you have to replace the character with something else. See https://en.wikipedia.org/wiki/Diacritic
O
Onno van Hooff
I had an issue with character ł working with a Polish company. Created a temporary workaround with an single replace for one character. Not the best solution. So please adjust deleteDiacriticalMarks.
R
Rick van den Dijssel
Onno van Hooff: Is this one of the above characters or another one? If it's another character could you please let us know which one.
O
Onno van Hooff
Rick van den Dijssel: Hi Rick. This character [ł] is already mentioned by Rick Nieuweveen. I only want to let you know I also ran in this problem.
R
Rick van den Dijssel
Onno van Hooff: Oke thanks. Didn't want to miss your character in case we are adjusting it.