`long double` support for ImageIO, MeshIO pixels?

Niels_Dekker · June 24, 2025, 4:49pm

Should ITK’s ImageIO and MeshIO both support long double as a possible type for pixel components? The IOComponent enum class has an enumerator for long double, named LDOUBLE, at ITK/Modules/Core/Common/include/itkCommonEnums.h at 45b4e6cfb8cda8f156ed9c67a38b7a3a71dae8ee · InsightSoftwareConsortium/ITK · GitHub

It appears that long double is now only supported by MeshIO, not by ImageIO. Honestly I don’t really have a use case for long double, and I realize that its size is very platform dependent: typically either 64 bits, 80 bits, or 128 bits. Is long double still important to other users?

I’m asking, just because I’m considering some refactoring, to share code between ImageIO and MeshIO. Specifically, MeshIOBase::MapComponentType and ImageIOBase::MapPixelType are very similar, but the first one supports long double, and the second one does not!

dzenanz · June 24, 2025, 6:06pm

long double was interesting with IA32 and older architectures in x64 series, because 80-bit float was a (the?) native type. With AMD64 that disappeared, which has 64-bit native float.

As removals are easier, I have a mild preference for adding LDOUBLE to ImageIO.

blowekamp · June 24, 2025, 6:23pm

The concept of having the ImageIO express pixel component types as non-portable types is problematic. Files are potable between systems, these descriptive types are not. IMHO describing the component type as “long double” is not descriptive and will not be well defined.

For comparison of a portable specification, consider the ZARR dtype spec:

Niels_Dekker · June 26, 2025, 10:16am

Thanks to you both @dzenanz and @blowekamp So basically, there are three possible options:

Let ImageIO and MeshIO both support long double
Drop long double for both ImageIO and MeshIO
Let MeshIO support long double, while ImageIO does not (current situation)

Could it be that long double is more relevant for MeshIO than it is for ImageIO, for fundamental reasons?

By the way, we could of course also add fixed width floating point types, FLOAT80 and FLOAT128 to the enum class IOComponent, but then they should be processed in a platform/compiler dependent way. (Possibly throw an exception when the platform does not have a floating point type of that size.)

For the record

It looks like ImageIO never supported LDOUBLE, looking at revision COMP: Restore GetComponentTypeInfo Method to itk::ImageIOBase · InsightSoftwareConsortium/ITK@aa87186 · GitHub (Feb 2011):

github.com/InsightSoftwareConsortium/ITK

Code/IO/itkImageIOBase.h

aa87186ca


      
          IMAGEIOBASE_TYPEMAP(char, CHAR);
          IMAGEIOBASE_TYPEMAP(unsigned char, UCHAR);
          IMAGEIOBASE_TYPEMAP(short, SHORT);
          IMAGEIOBASE_TYPEMAP(unsigned short, USHORT);
          IMAGEIOBASE_TYPEMAP(int, INT);
          IMAGEIOBASE_TYPEMAP(unsigned int, UINT);
          IMAGEIOBASE_TYPEMAP(long, LONG);
          IMAGEIOBASE_TYPEMAP(unsigned long, ULONG);
          IMAGEIOBASE_TYPEMAP(float, FLOAT);
          IMAGEIOBASE_TYPEMAP(double, DOUBLE);
          #undef IMAGIOBASE_TYPEMAP

On the other hand, it looks like MeshIO always supported LDOUBLE, looking at revision ENH: Add mesh IO · InsightSoftwareConsortium/ITK@9403e3b · GitHub (Aug 2011):

github.com/InsightSoftwareConsortium/ITK

Modules/IO/Mesh/include/itkMeshIOBase.h

9403e3b37


      
          MESHIOBASE_TYPEMAP(unsigned char, UCHAR);
          MESHIOBASE_TYPEMAP(char, CHAR);
          MESHIOBASE_TYPEMAP(unsigned short, USHORT);
          MESHIOBASE_TYPEMAP(short, SHORT);
          MESHIOBASE_TYPEMAP(unsigned int, UINT);
          MESHIOBASE_TYPEMAP(int, INT);
          MESHIOBASE_TYPEMAP(unsigned long, ULONG);
          MESHIOBASE_TYPEMAP(long, LONG);
          MESHIOBASE_TYPEMAP(unsigned long long, ULONGLONG);
          MESHIOBASE_TYPEMAP(long long, LONGLONG);
          MESHIOBASE_TYPEMAP(float, FLOAT);
          MESHIOBASE_TYPEMAP(double, DOUBLE);
          MESHIOBASE_TYPEMAP(long double, LDOUBLE);
          #undef MESHIOBASE_TYPEMAP

blowekamp · June 26, 2025, 12:04pm

Does “long double” for ImageIO provide anything meaningful? Does any Image file format support “long double”?

Also how would FLOAT80 be stores as in a buffer? 3-bytes with poor alignment? 4-bytes?

I don’t see a need to for this and you state that you don’t have a use case either. There are details here that could be done incorrectly that would cause more problems if an implementation with a well defined use case was required.

dzenanz · June 26, 2025, 1:56pm

Brad, do you prefer leaving things as is, or removing LDOUBLE from MeshIO?

blowekamp · June 26, 2025, 2:14pm

I do not know if the MeshIO is usable or functional with long double. A quick grep though the IO code seems to indicate that there are a large number of occurrences of LDOUBLE in many of the MeshIO implementation so this would indicate good support for it at one point.

The ImageIOBase::IOComponentEnum is already itk::IOComponentEnum which contains LDOUBLE in the enum. So it looks like this enum is already valid if ImageIO classes.

In that context, defining the behavior or LDOUBLE for imageIO seems more reasonable. I hope that all the ImageIO implementations have correct handling of the “default” case for this enum.

Niels_Dekker · June 26, 2025, 3:13pm

Ideally I would like to move the enum class { UCHAR, CHAR, ..., LDOUBLE }, as well as the type-to-enum-value map (Map...Type<T>) to “Core/Common”, so that they can be shared between the ImageIO and MeshIO. What do you think?

blowekamp · June 26, 2025, 3:26pm

Isn’t the IOComponent already in Core/Common/itkCommonEnums.h? What change are you proposing?

Have a common ctype-to-enum-value map seems to be a good idea to me.

Niels_Dekker · June 26, 2025, 6:03pm

Yes indeed, IOComponent is already in Core/Common/itkCommonEnums.h. (Sorry I was mistaken!)

So I would then just propose a common ctype-to-enum-value map.

Note that there appears another difference between MeshIOBase and ImageIOBase. MeshIOBase::MapComponentType unconditionally maps char to CHAR, whereas ImageIOBase::MapPixelType maps char to either CHAR or UCHAR, depending on the signedness of char: if char is signed, it is mapped to CHAR, otherwise to UCHAR. This difference should also be resolved, when introducing a common ctype-to-enum-value map.

Currently IOComponent has distinct enum values for two char types: CHAR and UCHAR. I think it would be easier if we would have an extra enum value for SIGNED_CHAR, because in C++ char and signed char are distinct type. And then unconditionally map char to CHAR, and signed char to SIGNED_CHAR, right? Or would that break too much legacy code?

blowekamp · June 26, 2025, 6:38pm

It sounds like you are thinking the CHAR enum is describing the char C type ( which is incorrect). The actual usage of it is to describe a signed char/8-bit type. Changing the definition of the CHAR enum would not be good. Perhaps adding an alias of “SIGNED_CHAR == CHAR” would be descriptive and useful.

Also 8-bit integers are only signed or unsigned. The three states of C’s “char” types is an artifact of C and not data. The ImageIO interface really should be describing what is stored on the disk and not the state of types and sizes at programming language level, IMHO. That is to say I have a long term frustration with this interface not providing fixed width integer types.

Niels_Dekker · June 26, 2025, 6:50pm

It sounds like you are thinking the CHAR enum is describing the char C type

Thanks, but I guess most users would think so as well. And so does MeshIOBase. (Assuming that MeshIOBase can actually think .)

That frustration is over now, right?

ENH: Fixed width type enums (INT8, UINT64, FLOAT64, ...) for IOComponent by N-Dekker · Pull Request #5410 · InsightSoftwareConsortium/ITK · GitHub

blowekamp · June 26, 2025, 7:11pm

But the CHAR enum type ( for ImageIO ) currently must represents signed 8-bit data. There is not other way to represent as a component type, so it much be signed:

github.com/SimpleITK/SimpleITK

Code/IO/src/sitkImageReaderBase.cxx

master


      
          case itk::IOComponentEnum::CHAR:
            return ImageTypeToPixelIDValue<itk::Image<int8_t, UnusedDimension>>::Result;

(Those later cases look like they can be conveniently updated with the new fixed types. )

And yes, 8-bit signed data is real and common with certain file formats. Java does not have an 8-bit unsigned data type. So for Java developer signed 8-bit data/images are very natural.

Niels_Dekker · June 30, 2025, 11:18am

FYI, This pull request of mine is somewhat related:

github.com/InsightSoftwareConsortium/ITK

STYLE: Remove template specializations of `MeshIOBase::MapComponentType`

main ← N-Dekker:Remove-MeshIOBase-MapComponentType-specializations

opened 01:52PM - 27 Jun 25 UTC

N-Dekker

+18 -23

Follow-up to pull request https://github.com/InsightSoftwareConsortium/ITK/pull/…5421 commit b209c1a00f2cc042172996631d8cf5a06658fd1a "STYLE: Remove template specializations of `ImageIOBase::MapPixelType`" Note that `MeshIOBase::MapComponentType` supports `long double` as pixel component type, whereas `ImageIOBase::MapPixelType` does not. Moreover, `MeshIOBase::MapComponentType` unconditionally maps `char` to `CHAR`, whereas `ImageIOBase::MapPixelType` maps `char` to either `CHAR` or `UCHAR`, depending on the signedness of `char`.

Please note this pull request in it current state is only a style PR. It does not address the different behavior between MeshIO and ImageIO regarding the type mapping of CHAR and LDOUBLE (long double). So it may be processed independently.

Niels_Dekker · July 1, 2025, 11:23am

I see, SimpleITK’s sitkImageReaderBase maps IOComponentEnum::CHAR to int8_t (which is a signed integer type, by definition), whereas ITK’s itk::ImageFileReader maps IOComponentEnum::CHAR to char (which might be an unsigned integer type):

github.com/InsightSoftwareConsortium/ITK

Modules/IO/ImageBase/include/itkImageFileReader.hxx

0b79fd0c2


      
            {                                                                                                          \
              ConvertPixelBuffer<type, OutputImagePixelType, ConvertPixelTraits>::Convert(                             \
                static_cast<const type *>(inputData), m_ImageIO->GetNumberOfComponents(), outputData, numberOfPixels); \
            }                                                                                                          \
          }
          
          if (false)
          {
          }
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::UCHAR, unsigned char)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::CHAR, char)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::USHORT, unsigned short)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::SHORT, short)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::UINT, unsigned int)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::INT, int)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::ULONG, unsigned long)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::LONG, long)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::ULONGLONG, unsigned long long)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::LONGLONG, long long)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::FLOAT, float)
          ITK_CONVERT_BUFFER_IF_BLOCK(IOComponentEnum::DOUBLE, double)

blowekamp · July 2, 2025, 1:52pm

Interesting, thanks of sharing.

For the imageIOs which support signed 8-bit data, they are most likely reporting their data as ENUM::CHAR. Perhaps, “CHAR” should be changed to “signed char”? I think there are some portability issue with the current definitions, but I think it was “working” in practice so it was left alone.

A couple examples:

github.com/InsightSoftwareConsortium/ITK

Modules/IO/MRC/src/itkMRCImageIO.cxx

09305ce7c


      
          // There has been some confusion and inconsistency whether this
          // mode is signed or unsigned, but now MRC2014 clearly defines it
          // as signed.
          if (header.amin < 0 && header.amax >= header.amin)
          {
            this->SetComponentType(IOComponentEnum::CHAR);
          }
          else
          {
            this->SetComponentType(IOComponentEnum::UCHAR);
          }

github.com/InsightSoftwareConsortium/ITKIOOMEZarrNGFF

src/itkOMEZarrNGFFImageIO.cxx

4bd6557d2


      
          case tensorstore::DataTypeId::char_t:
          case tensorstore::DataTypeId::int8_t:
            return IOComponentEnum::CHAR;

Also note I am actively using sighed 8-bit images, for the MRC image file format.

The argument for just the “char” type for being useful would be to represent text. Which I don’t think make sense for the ImageIO class. I don’t believe that this enum is used to represent any meta-data? If you did want to represent text where is a whole other set of text encoding that would be relevant, and that is not something that should opened up.

Niels_Dekker · July 2, 2025, 4:17pm

Thanks @blowekamp

Honestly I don’t know if it’s useful to have a pixel type char to represent text. If we agree to only support signed char and unsigned char (but not plain char), I would suggest renaming the enum CHAR to SIGNED_CHAR.

On the other hand, if there is a use case for plain char pixels, let it map unconditionally to the enum CHAR, and add a SIGNED_CHAR enum, unconditionally mapping signed char (not plain char).

But honestly I don’t know what to choose