From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on ip-172-31-74-118.ec2.internal X-Spam-Level: X-Spam-Status: No, score=0.8 required=3.0 tests=BAYES_50,FREEMAIL_FROM autolearn=ham autolearn_force=no version=3.4.6 X-Received: by 2002:a05:620a:2950:: with SMTP id n16mr32753651qkp.405.1636109803270; Fri, 05 Nov 2021 03:56:43 -0700 (PDT) X-Received: by 2002:a5b:846:: with SMTP id v6mr58658703ybq.457.1636109803118; Fri, 05 Nov 2021 03:56:43 -0700 (PDT) Path: eternal-september.org!reader02.eternal-september.org!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail Newsgroups: comp.lang.ada Date: Fri, 5 Nov 2021 03:56:42 -0700 (PDT) In-Reply-To: Injection-Info: google-groups.googlegroups.com; posting-host=193.137.201.145; posting-account=3cDqWgoAAAAZXc8D3pDqwa77IryJ2nnY NNTP-Posting-Host: 193.137.201.145 References: User-Agent: G2/1.0 MIME-Version: 1.0 Message-ID: <1c6b151b-f017-496d-b381-ba08bef1bbb7n@googlegroups.com> Subject: Re: How to read in a (long) UTF-8 file, incrementally? From: Marius Amado-Alves Injection-Date: Fri, 05 Nov 2021 10:56:43 +0000 Content-Type: text/plain; charset="UTF-8" Xref: reader02.eternal-september.org comp.lang.ada:63099 List-Id: > Characters no longer exist as a thing as one can even be represented as > multiple utf-32 code points. You're alluding to combining characters?