From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on ip-172-31-65-14.ec2.internal X-Spam-Level: X-Spam-Status: No, score=-1.9 required=3.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.6 Path: eternal-september.org!reader02.eternal-september.org!aioe.org!vNObJwB5W4WN632vBkQn9g.user.46.165.242.75.POSTED!not-for-mail From: Simon Wright Newsgroups: comp.lang.ada Subject: Re: Ada and Unicode Date: Fri, 08 Apr 2022 20:19:08 +0100 Organization: Aioe.org NNTP Server Message-ID: References: <607b5b20$0$27442$426a74cc@news.free.fr> <86mttuk5f0.fsf@stephe-leake.org> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: gioia.aioe.org; logging-data="55470"; posting-host="vNObJwB5W4WN632vBkQn9g.user.gioia.aioe.org"; mail-complaints-to="abuse@aioe.org"; User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (darwin) X-Notice: Filtered by postfilter v. 0.9.2 Cancel-Lock: sha1:R18A7vRIOGoE1z5Qk0FnLIExUhc= Xref: reader02.eternal-september.org comp.lang.ada:63716 List-Id: "Dmitry A. Kazakov" writes: > On 2022-04-08 10:56, Simon Wright wrote: >> "Randy Brukardt" writes: >> >>> If you had an Ada-like language that used a universal UTF-8 string >>> internally, you then would have a lot of old and mostly useless >>> operations supported for array types (since things like slices are >>> mainly useful for string operations). >> >> Just off the top of my head, wouldn't it be better to use >> UTF32-encoded Wide_Wide_Character internally? > > Yep, that is the exactly the problem, a confusion between interface > and implementation. Don't understand. My point was that *when you are implementing this* it mught be easier to deal with 32-bit charactrs/code points/whatever the proper jargon is than with UTF8. > Encoding /= interface, e.g. an interface of a string viewed as an > array of characters. That interface just same for ASCII, Latin-1, > EBCDIC, RADIX50, UTF-8 etc strings. Why do you care what is inside? With a user's hat on, I don't. Implementers might have a different point of view.