[strict provenance] Fix System APIs That Are Liars #95496

Gankra · 2022-03-30T19:25:57Z

This issue is part of the Strict Provenance Experiment - #95228

Some system APIs (and C FFI in general) like to lie about whether things are pointers, and this makes strict-provenance very sad.

prctl for PR_SET_NAME (says a pointer is unsigned long):

rust/library/std/src/sys/unix/thread.rs

Lines 120 to 127 in 44628f7

    
           pub fn set_name(name: &CStr) { 
        
               const PR_SET_NAME: libc::c_int = 15; 
        
               // pthread wrapper only appeared in glibc 2.12, so we use syscall 
        
               // directly. 
        
               unsafe { 
        
                   libc::prctl(PR_SET_NAME, name.as_ptr() as libc::c_ulong, 0, 0, 0); 
        
               } 
        
           }

clone3(?) (says a pointer is u64):

rust/library/std/src/sys/unix/process/process_unix.rs

Lines 185 to 198 in 44628f7

    
           if want_clone3_pidfd && HAS_CLONE3.load(Ordering::Relaxed) { 
        
               let mut args = clone_args { 
        
                   flags: CLONE_PIDFD, 
        
                   pidfd: &mut pidfd as *mut pid_t as u64, 
        
                   child_tid: 0, 
        
                   parent_tid: 0, 
        
                   exit_signal: libc::SIGCHLD as u64, 
        
                   stack: 0, 
        
                   stack_size: 0, 
        
                   tls: 0, 
        
                   set_tid: 0, 
        
                   set_tid_size: 0, 
        
                   cgroup: 0, 
        
               };

sigaction (defines sighandler_t (a callback) to be size_t):

rust/library/std/src/sys/unix/stack_overflow.rs

Line 130 in 44628f7

action.sa_sigaction = signal_handler as sighandler_t;

The "level 1" fix for this is to just change our extern decls so that all of these APIs/types actually say "this is a pointer". In general it's ok for integers to pretend to be pointers "for fun", and if anything is ever int | ptr the valid union of these types is ptr.

The "level 2" fix for this is to instead come up with some general mechanism for dealing with these kinds that folks can use in the larger ecosystem, like a "this is lies" annotation or uptr. This kind of thing is especially nasty for people who are "bindgen true believers" and don't want to ever hand-edit generated bindings (so, not us).

The text was updated successfully, but these errors were encountered:

Gankra · 2022-03-30T19:29:11Z

Note: there's probably way more of these but I only audited x64 Windows and x64 gnu-linux. Once we have lints for this stuff we should encourage folks who Do That Platform to run them and find what's up with those platforms (and ideally fix them).

cuviper · 2022-03-30T22:03:41Z

Does provenance matter across the user/kernel boundary? (Although most calls will pass via libc before the real syscall.)

These all seem to be ptr2int, and not int2ptr, which seems less bad, right? Then C can deal with provenance itself. If you then involve cross-lang LTO or something, I don't see how it makes a difference if we lie about the int/ptr API on our end.

Gankra · 2022-03-30T22:18:54Z

provenance matters up until that point. basically, we just need to make sure that the compiler understands that a "real pointer" is coming from FFI or a "real pointer" is leaking into FFI.

That means ptr2int is also concerning because "under" strict-provenance the compiler wouldn't treat this as "exposing" the pointer and might think it's unaliased, resulting in miscompilation if the kernel then reads/writes through the pointer. (nowhere near actually doing such an optimization, but we want as much code as possible to conform to those kinds of semantics because then it's bulletproof.)

workingjubilee · 2022-03-30T22:44:30Z

These all seem to be ptr2int, and not int2ptr, which seems less bad, right? Then C can deal with provenance itself. If you then involve cross-lang LTO or something, I don't see how it makes a difference if we lie about the int/ptr API on our end.

My understanding at the current moment is that if we have our own house in order, according to the Rust model, then all Rust code could be compiled and have LTO done according to the rules of Rust, without looking at the C code, and be correct. Thus all cross-lang link-time optimizations could happen after and as long as they are done according to the understood rules the cross-lang optimizer must abide by, they would be correct. If we do not, then a cross-lang optimizer could not defer the work and thus the ordering of optimizations would matter for not just performance but soundness, and likely have to be pervasive.

RalfJung · 2022-03-31T02:18:53Z

Another Provenance Crime API is the one for setting a thread name: rust-lang/miri#1717.

These all seem to be ptr2int, and not int2ptr, which seems less bad, right? Then C can deal with provenance itself. If you then involve cross-lang LTO or something, I don't see how it makes a difference if we lie about the int/ptr API on our end.

Basically, if for a given provenance P, we never give away a pointer with provenance P to anything else, then under strict provenance the compiler may assume that nothing reads or write with that provenance. Giving away merely the address does not "expose" the memory to third-party accesses, because an address is insufficient to read or write memory -- you need the right provenance, too.

So, in the case of setting the thread name, the compiler may assume that nothing ever actually reads the name of the thread. Obviously that doesn't work. That's why having proper provenance all the way until the edge of the FFI boundary is important. (What matters on the other side is irrelevant, of course.)

Another way to think about this is: the compiler does not know what the other side of the FFI boundary does, but whatever it does must be rationalizable inside the Rust Abstract Machine. No Rust function could possibly access private Rust memory of which it just learned the address (but not the provenance), and hence the compiler may assume that when calling a function and giving it just the address of that memory, this function cannot access that memory.

arichardson · 2022-03-31T08:42:39Z

Does provenance matter across the user/kernel boundary? (Although most calls will pass via libc before the real syscall.)

These all seem to be ptr2int, and not int2ptr, which seems less bad, right? Then C can deal with provenance itself. If you then involve cross-lang LTO or something, I don't see how it makes a difference if we lie about the int/ptr API on our end.

It absolutely matters for CHERI-enabled architectures, any uintptr_t must be passed as a pointer (or rather using the uptr type), otherwise the bounds, permissions and validity are lost which results in the kernel returning an error such as EFAULT/EINVAL. It was even worse for the now pretty much deprecated CHERI-MIPS where integers and pointers (capabilities) are passed in distinct register files (like m68k), so if you get the types wrong the callee will read from an incorrect register and you'd get completely unpredicatable results instead of missing metadata.

I know that many Linux APIs like to use long/u64 rather than the correct (u)intptr_t. For CHERI those have to be modified to actually use uintptr_t at which point the bindings should be generated correctly.

thalesfragoso · 2022-05-19T21:06:44Z

Giving away merely the address does not "expose" the memory to third-party accesses, because an address is insufficient to read or write memory -- you need the right provenance, too.

@RalfJung I take that this also applies to DMA configuration through MMIO (bare-metal rust), right ?

Currently, we generate the MMIO interface through (mostly) vendor provided SVD files, where we usually get just the width of the registers, so we always use unsigned integers. It seems that we would need to manually patch the DMA address registers to have a pointer type. Should we use *mut () ? Is it okay to derive a *mut from an immutable reference if you just read from it ?

Here is an example: https://docs.rs/stm32f4/0.14.0/stm32f4/stm32f401/dma2/struct.ST.html#structfield.m0ar

RalfJung · 2022-05-20T08:06:57Z

Uh, DMA/MMIO touch on many aspects that are quite hard to catch formally.^^

But, generally speaking: if the compiler just sees ptr.addr(), it can still do the full suite of optimizations on ptr, treating it as "not exposed to other code". So that is probably not what you want with MMIO. You can use ptr.expose_addr() to give the integer address in the ptr away in such a way that the other side can (conceptually) turn it back into a pointer and use that to access memory.

If you want to not use expose_addr, you have to "do the transfer at pointer type". I have no idea what that means for MMIO, but basically it has to be pointer types all the way on the Rust side; if there is ever a usize, you need expose_addr.

RalfJung · 2022-05-20T08:08:44Z

Is it okay to derive a *mut from an immutable reference if you just read from it ?

Yes. When you convert a reference to a raw ptr, the mutability of the ptr matters: x as *const _ produces a read-only pointer, x as *mut _ can be used for mutation (if x is a reference). Correspondingly, if x is a shared reference, x as *mut _ is rejected by the compiler. But once you have a raw ptr, casting back and forth between const and mut is inconsequential: x as *const _ as *mut _ is fine as long as you only read from it.

Lokathor · 2022-05-20T12:56:12Z

Note that for DMA specifically, the llvm docs for volatile basically disallow you to use DMA with a volatile access if any non-volatile memory is involved in the DMA (in other words: if any rust memory is read/written).

So for DMA you want to use inline asm.

thalesfragoso · 2022-05-22T17:43:54Z

@RalfJung We define the MMIO "registers" as structs, and then we just cast a fixed usize to a ptr to the struct. Yes, this violates strict provenance, but it's already listed on things to consider and I don't think it will cause many problems, since the address is outside of the space managed by the compiler and we always use volatile operations.

So we just need to mark the type in the struct as *mut () and use ptr::write_volatile to write the pointer directly.

@Lokathor What do you mean by DMA with a volatile access ? We don't usually do volatile operations on the DMA buffer, just write to the register and add fences for synchronization, isn't it enough ? What's the difference between that and e.g. passing pointers to the kernel to do io_uring ?

bjorn3 · 2022-05-22T17:56:41Z

The write to the device register would have to be volatile. Otherwise I believe the compiler is allowed to move the write, load it again after the write or write twice. Volatile exists precisely for device register accesses (and signal handlers)

RalfJung · 2022-05-22T19:04:49Z

So we just need to mark the type in the struct as *mut () and use ptr::write_volatile to write the pointer directly.

So the pointer has type *mut *mut ()? Yeah that makes sense.

Lokathor · 2022-05-22T19:16:37Z

@Lokathor What do you mean by DMA with a volatile access ? We don't usually do volatile operations on the DMA buffer, just write to the register and add fences for synchronization, isn't it enough ? What's the difference between that and e.g. passing pointers to the kernel to do io_uring ?

Well i'm just gonna quote part of the docs:

A volatile load or store may have additional target-specific semantics. Any volatile operation can have side effects, and any volatile operation can read and/or modify state which is not accessible via a regular load or store in this module. Volatile operations may use addresses which do not point to memory (like MMIO registers). This means the compiler may not use a volatile operation to prove a non-volatile access to that address has defined behavior.

The allowed side-effects for volatile accesses are limited. If a non-volatile store to a given address would be legal, a volatile operation may modify the memory at that address. A volatile operation may not modify any other memory accessible by the module being compiled. A volatile operation may not call any code in the current module.

I read "A volatile operation may not modify any other memory accessible by the module being compiled." as being that volatile can't modify any normal memory your program can see, including between copying between buffers in your normal program. Maybe I'm being too paranoid.

bjorn3 · 2022-05-22T19:50:03Z

The DMA buffers wouldn't need volatile, just a synchronization fence or something similar. The device register write initiating the DMA transfer (and any other device register access) would need volatile.

RalfJung · 2022-05-22T19:58:03Z

I read "A volatile operation may not modify any other memory accessible by the module being compiled." as being that volatile can't modify any normal memory your program can see, including between copying between buffers in your normal program. Maybe I'm being too paranoid.

The volatile operation will not modify any other memory. It can of course modify the memory behind ptr.

All this sentence does is let the compiler reorder volatile and non-volatile operations.

thalesfragoso · 2022-05-22T20:03:48Z

The write to the device register would have to be volatile.

Yes, the write to the register must be volatile, what I meant is that we will use non-volatile operations on the destination buffer for the DMA (after DMA is done).

I read "A volatile operation may not modify any other memory accessible by the module being compiled." as being that volatile can't modify any normal memory your program can see, including between copying between buffers in your normal program. Maybe I'm being too paranoid.

I think that this is just limiting the side-effects of volatile operations, as in, a volatile write to memory address x can't cause a modification on memory address y, which is indeed the case for DMA. However, what "protects" us from miscompilation regarding the destination buffer is the fact that the pointer escaped, not the volatile write.

The problem is that what constitutes a pointer escape seems to be fuzzy. What I have seen is that it's usually done by writing/exposing the pointer to a place "outside of the compiled module", plus proper synchronization, e.g. fences.

This function is available on Linux since glibc 2.12, musl 1.1.16, and uClibc 1.0.20. The main advantage over `prctl` is that it properly represents the pointer argument, rather than a multi-purpose `long`, so we're better representing strict provenance (rust-lang#95496).

…ulacrum linux: Use `pthread_setname_np` instead of `prctl` This function is available on Linux since glibc 2.12, musl 1.1.16, and uClibc 1.0.20. The main advantage over `prctl` is that it properly represents the pointer argument, rather than a multi-purpose `long`, so we're better representing strict provenance (rust-lang#95496).

safinaskar · 2023-04-05T18:59:57Z

@Gankra , I have good example of liar system API. It is https://docs.rs/libc/latest/libc/fn.readdir.html with https://docs.rs/libc/latest/libc/struct.dirent.html . Struct dirent is declared with field pub d_name: [c_char; 256], but this is lie, because actual file name can be more than 256 bytes. See my bug report here: rust-lang/libc#2669

cuviper · 2023-04-05T19:10:05Z

We have also dealt with that liar in the standard library -- e.g. #103137 and its history:

This code got added in #93459 and tweaked in #94272 and #94750.

asquared31415 · 2023-04-05T22:36:44Z

From https://man7.org/linux/man-pages/man3/readdir.3.html

       The dirent structure definition shown above is taken from the
       glibc headers, and shows the d_name field with a fixed size.

       Warning: applications should avoid any dependence on the size of
       the d_name field.  POSIX defines it as char d_name[], a character
       array of unspecified size, with at most NAME_MAX characters
       preceding the terminating null byte ('\0').

(as an additional note: not all platforms enforce NAME_MAX, this field should not be treated as having any fixed size on any platform, even using NAME_MAX)

Fixed vita libc definitions This fixes definitions that were incorrect in my initial PR (#3209). As with the previous one, this relies on open-source newlib implementation and doesn't use Sony internal API's or expose Sony internal data structures. This fixes: - fs API that uses `dirent` and stat - sockets - pthreads A couple of explanations - Unfortunately `dirent` in vita newlib is not very POSIX'y, before `d_name` it has a field with an internal data structure, which is of no use for std (no inodes, no file type inside), so I've just stubbed it as an `__offset`. Also, Vita doesn't expose inodes - `dirent` doesn't have it at all, and in `stat` it is always `0`. I am not sure what would be the best approach here. Maybe I should move the POSIX `dirent` to `generic.rs` and reexpose it in all targets, and redefine it in `vita/mod.rs`? - All pthread types on Vita are pointers, and the backing data structure is allocated by corresponding init functions, so their sizeof's are all 4. I also changed `pthread_attr_t` and `pthread_rwlockattr_t` to reflect their sizes and not be constant. May be in relation to rust-lang/rust#95496 it would be better to move existing thread definitions to generic, and for vita specifically make them pointer types instead of byte arrays. The fixes in std will be in rust-lang/rust#111819

Gankra added A-strict-provenance Area: Strict provenance for raw pointers T-libs Relevant to the library team, which will review and decide on the PR/issue. O-unix Operating system: Unix-like labels Mar 30, 2022

workingjubilee mentioned this issue Mar 30, 2022

Increase the minimum linux-gnu versions #95026

Merged

RalfJung mentioned this issue Jun 9, 2022

One-way address exposure for FFI Gankra/sptr#11

Closed

Gankra mentioned this issue Aug 2, 2022

RFC: Improve C types for cross-language LLVM CFI support rust-lang/rfcs#3296

Open

cuviper mentioned this issue Aug 8, 2022

linux: Use pthread_setname_np instead of prctl #100287

Merged

thomcc mentioned this issue Sep 6, 2022

Incorrect pthread_t definition on Darwin (should be a raw pointer, not an uintptr_t) rust-lang/libc#2903

Open

nikarh mentioned this issue May 26, 2023

Fixed vita libc definitions rust-lang/libc#3255

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[strict provenance] Fix System APIs That Are Liars #95496

[strict provenance] Fix System APIs That Are Liars #95496

Gankra commented Mar 30, 2022 •

edited

Loading

Gankra commented Mar 30, 2022 •

edited

Loading

cuviper commented Mar 30, 2022

Gankra commented Mar 30, 2022

workingjubilee commented Mar 30, 2022

RalfJung commented Mar 31, 2022 •

edited

Loading

arichardson commented Mar 31, 2022

thalesfragoso commented May 19, 2022

RalfJung commented May 20, 2022 •

edited

Loading

RalfJung commented May 20, 2022 •

edited

Loading

Lokathor commented May 20, 2022 •

edited

Loading

thalesfragoso commented May 22, 2022

bjorn3 commented May 22, 2022

RalfJung commented May 22, 2022

Lokathor commented May 22, 2022

bjorn3 commented May 22, 2022

RalfJung commented May 22, 2022

thalesfragoso commented May 22, 2022

safinaskar commented Apr 5, 2023

cuviper commented Apr 5, 2023

asquared31415 commented Apr 5, 2023

[strict provenance] Fix System APIs That Are Liars #95496

[strict provenance] Fix System APIs That Are Liars #95496

Comments

Gankra commented Mar 30, 2022 • edited Loading

Gankra commented Mar 30, 2022 • edited Loading

cuviper commented Mar 30, 2022

Gankra commented Mar 30, 2022

workingjubilee commented Mar 30, 2022

RalfJung commented Mar 31, 2022 • edited Loading

arichardson commented Mar 31, 2022

thalesfragoso commented May 19, 2022

RalfJung commented May 20, 2022 • edited Loading

RalfJung commented May 20, 2022 • edited Loading

Lokathor commented May 20, 2022 • edited Loading

thalesfragoso commented May 22, 2022

bjorn3 commented May 22, 2022

RalfJung commented May 22, 2022

Lokathor commented May 22, 2022

bjorn3 commented May 22, 2022

RalfJung commented May 22, 2022

thalesfragoso commented May 22, 2022

safinaskar commented Apr 5, 2023

cuviper commented Apr 5, 2023

asquared31415 commented Apr 5, 2023

Gankra commented Mar 30, 2022 •

edited

Loading

Gankra commented Mar 30, 2022 •

edited

Loading

RalfJung commented Mar 31, 2022 •

edited

Loading

RalfJung commented May 20, 2022 •

edited

Loading

RalfJung commented May 20, 2022 •

edited

Loading

Lokathor commented May 20, 2022 •

edited

Loading