comp.lang.ada
 help / color / mirror / Atom feed
From: Simon Wright <simon@pushface.org>
Subject: Re: How to read in a (long) UTF-8 file, incrementally?
Date: Fri, 05 Nov 2021 19:55:33 +0000	[thread overview]
Message-ID: <lymtmixtqi.fsf@pushface.org> (raw)
In-Reply-To: 1c6b151b-f017-496d-b381-ba08bef1bbb7n@googlegroups.com

Marius Amado-Alves <amado.alves@gmail.com> writes:

>> Characters no longer exist as a thing as one can even be represented as 
>> multiple utf-32 code points.
>
> You're alluding to combining characters?

Fun & games on macOS[1]:

> $ GNAT_FILE_NAME_CASE_SENSITIVE=1 gnatmake -c p*.ads
> gcc -c páck3.ads
> páck3.ads:1:10: warning: file name does not match unit name, should be "páck3.ads"
> 
> The reason for this apparently-bizarre message is that macOS takes the
> composed form (lowercase a acute) and converts it under the hood to
> what HFS+ insists on, the fully decomposed form (lowercase a,
> combining acute); thus the names are actually different even though
> they _look_ the same.

[1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81114#c1

  reply	other threads:[~2021-11-05 19:55 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-11-02 17:42 How to read in a (long) UTF-8 file, incrementally? Marius Amado-Alves
2021-11-02 18:17 ` Dmitry A. Kazakov
2021-11-03  7:43 ` Vadim Godunko
2021-11-03  8:48 ` Luke A. Guest
2021-11-04 11:43   ` Marius Amado-Alves
2021-11-04 12:13     ` Dmitry A. Kazakov
2021-11-04 14:30     ` Luke A. Guest
2021-11-05 10:56       ` Marius Amado-Alves
2021-11-05 19:55         ` Simon Wright [this message]
2021-11-16 11:55           ` Marius Amado-Alves
2021-11-16 12:36             ` Dmitry A. Kazakov
2021-11-16 13:52               ` Marius Amado-Alves
2021-11-16 20:23               ` Randy Brukardt
2021-11-16 15:25             ` Luke A. Guest
2021-11-16 17:38             ` Vadim Godunko
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox