CoreFoundation's CFStringRef#stringValue doesn't add space for terminating null #1342

dbwiddis · 2021-04-21T07:58:55Z

CFStringRef#stringValue follows the following steps to create a String:

Gets the character length of the String. This does not include a null.

CFIndex length = INSTANCE.CFStringGetLength(this);

Gets the maximum size for UTF8 encoding for this number of characters (3 x the number of characters)

CFIndex maxSize = INSTANCE.CFStringGetMaximumSizeForEncoding(length, kCFStringEncodingUTF8);

Allocates a buffer for exactly those bytes and gets the String

Memory buf = new Memory(maxSize.longValue());
if (0 != INSTANCE.CFStringGetCString(this, buf, maxSize, kCFStringEncodingUTF8)) {
    return buf.getString(0, "UTF8");
}

In the edge case where every character in the String is the max encoding length, there is no space for a null. To reproduce:

public static void main(String[] args) {

    String foo = "ࠀ"; // e0a080
    CFStringRef str = CFStringRef.createCFString(foo);

    try {
        CFIndex length = CoreFoundation.INSTANCE.CFStringGetLength(str);

        CFIndex maxLength = CoreFoundation.INSTANCE.CFStringGetMaximumSizeForEncoding(length,
                CoreFoundation.kCFStringEncodingUTF8);

        Memory buffer = new Memory(maxLength.intValue());
        byte ret = CoreFoundation.INSTANCE.CFStringGetCString(str, buffer, maxLength,
                CoreFoundation.kCFStringEncodingUTF8);
        System.out.print("Getting string with size " + maxLength.intValue() + " --> " + ret);
        if (ret == 0) {
            System.out.println(" (failed)");
        } else {
            System.out.println(" .. " + buffer.getString(0));
        }

        // increase size
        maxLength = new CFIndex(maxLength.intValue() + 1);

        buffer = new Memory(maxLength.intValue());
        ret = CoreFoundation.INSTANCE.CFStringGetCString(str, buffer, maxLength,
                CoreFoundation.kCFStringEncodingUTF8);
        System.out.print("Getting string with size " + maxLength.intValue() + " --> " + ret);
        if (ret == 0) {
            System.out.println(" (failed)");
        } else {
            System.out.println(" .. " + buffer.getString(0));
        }
    } finally {
        str.release();
    }
}

Output:

Getting string with size 3 --> 0 (failed)
Getting string with size 4 --> 1 .. ࠀ

Proposed fix: Simply add 1 to the calculated max number of bytes

Possible improvement: Just multiply the length by 3 and add 1; or multiply the length by 4.

Other option: Map and call CFStringGetBytes passing null to get usedBufLen which will include the space for the null.

The text was updated successfully, but these errors were encountered:

dbwiddis · 2021-04-22T03:35:16Z

Reproduction also possible by altering existing test case to a string of all 3-byte utf8 characters. Adding 1 seems the simplest solution. Tried to implement CFStringGetBytes but couldn't get it to work with 0 for the lossByte parameter.

dbwiddis · 2021-04-25T14:27:33Z

So I thought this was fixed but had a suspicion it wasn't. UTF-8 takes up to 4 bytes, but the CoreFoundation function CFStringGetMaximumSizeForEncoding for the encoding kCFStringEncodingUTF8 only multiplies by 3. By replacing the test String with a four-byte emoji character ({ (byte) 0xF0, (byte) 0x9F, (byte) 0x98, (byte) 0x83 }) the conversion fails with a too small buffer.

I think it's best to just multiply by 4 and add one.

dbwiddis · 2021-04-25T15:07:23Z

Here's the macos Source code with the incorrect calculation. https://github.com/opensource-apple/CF/blob/master/CFString.c#L465

dbwiddis changed the title ~~CoreFoundation's CFString#stringValue doesn't add space for terminating null~~ CoreFoundation's CFStringRef#stringValue doesn't add space for terminating null Apr 21, 2021

This was referenced Apr 21, 2021

CFStringRef#stringValue fails if string consists entirely of 3-byte UTF characters oshi/oshi#1611

Closed

CFStringRef#stringValue buffer needs space for null byte #1343

Merged

dbwiddis closed this as completed in #1343 Apr 22, 2021

dbwiddis reopened this Apr 25, 2021

dbwiddis mentioned this issue Apr 25, 2021

CFStringRef#stringValue buffer needs space for 4 UTF8 bytes #1345

Merged

dbwiddis closed this as completed in #1345 Apr 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CoreFoundation's CFStringRef#stringValue doesn't add space for terminating null #1342

CoreFoundation's CFStringRef#stringValue doesn't add space for terminating null #1342

dbwiddis commented Apr 21, 2021 •

edited

Loading

dbwiddis commented Apr 22, 2021

dbwiddis commented Apr 25, 2021 •

edited

Loading

dbwiddis commented Apr 25, 2021

CoreFoundation's CFStringRef#stringValue doesn't add space for terminating null #1342

CoreFoundation's CFStringRef#stringValue doesn't add space for terminating null #1342

Comments

dbwiddis commented Apr 21, 2021 • edited Loading

dbwiddis commented Apr 22, 2021

dbwiddis commented Apr 25, 2021 • edited Loading

dbwiddis commented Apr 25, 2021

dbwiddis commented Apr 21, 2021 •

edited

Loading

dbwiddis commented Apr 25, 2021 •

edited

Loading