From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on ip-172-31-65-14.ec2.internal X-Spam-Level: * X-Spam-Status: No, score=1.1 required=3.0 tests=AC_FROM_MANY_DOTS,BAYES_00, T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 Path: eternal-september.org!feeder.eternal-september.org!news.gegeweb.eu!gegeweb.org!usenet-fr.net!proxad.net!feeder1-2.proxad.net!cleanfeed3-b.proxad.net!nnrp1-1.free.fr!not-for-mail From: Thomas Newsgroups: comp.lang.ada Mail-Copies-To: nobody Subject: Re: Ada and Unicode References: <607b5b20$0$27442$426a74cc@news.free.fr> <660e25a5-506b-43c0-b4ac-e7738e5500e5n@googlegroups.com> <48309745-aa2a-47bd-a4f9-6daa843e0771n@googlegroups.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit User-Agent: MT-NewsWatcher/3.5.3b3 (Intel Mac OS X) Date: Fri, 31 Mar 2023 01:35:48 +0200 Message-ID: <64261cd4$0$7661$426a74cc@news.free.fr> Organization: Guest of ProXad - France NNTP-Posting-Date: 31 Mar 2023 01:35:48 CEST NNTP-Posting-Host: 91.175.52.121 X-Trace: 1680219348 news-2.free.fr 7661 91.175.52.121:7873 X-Complaints-To: abuse@proxad.net Xref: feeder.eternal-september.org comp.lang.ada:65028 List-Id: sorry for the delay. In article <48309745-aa2a-47bd-a4f9-6daa843e0771n@googlegroups.com>, Vadim Godunko wrote: > On Sunday, April 3, 2022 at 10:20:21 PM UTC+3, Thomas wrote: > > > > > But don't use unit names containing international characters, at any > > > rate if you're (interested in compiling on) Windows or macOS: > > > > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81114 > > > > and this kind of problems would be easier to avoid if string types were > > stronger ... > > > > Your suggestion is unable to resolve this issue on Mac OS X. i said "easier" not "easy". don't forget that Unicode has 2 levels : - octets <-> code points - code points <-> characters/glyphs and you can't expect the upper to work if the lower doesn't. > Like case > sensitivity, binary compare of two strings can't compare strings in different > normalization forms. Right solution is to use right type to represent any > paths, what would be the "right type", according to you? In fact, here the first question to ask is: what's the expected encoding for Ada.Text_IO.Open.Name? - is it Latin-1 because the type is String not UTF_8_String? - is it undefined because it depends on the underling FS? -- RAPID maintainer http://savannah.nongnu.org/projects/rapid/