STM32: fix SPI write with data >8 Bit #15115

JojoS62 · 2021-09-28T08:41:02Z

Summary of changes

the low level SPI write calls msp_write_data and passes data in single bytes. Sending data formats with more than 8 Bit are sending only 8 Bit with leading zeros for larger data.
This PR passes the data as pointer to make the type casting depending on the bitshift value working.

Impact of changes

fix sending 16 Bit Problem as in #15113

Migration actions required

Documentation

Pull request type

[x] Patch update (Bug fix / Target update / Docs update / Test update / Refactor)
[] Feature update (New feature / Functionality change / New API)
[] Major update (Breaking change E.g. Return code change / API behaviour change)

Test results

urgent fix

[] No Tests required for this change (E.g docs only update)
[] Covered by existing mbed-os tests (Greentea or Unittest)
[] Tests / results supplied as part of this PR

Reviewers

jeromecoutant · 2021-09-28T08:45:00Z

@vznncv
@LMESTM

ciarmcom · 2021-09-28T09:00:17Z

@JojoS62, thank you for your changes.
@ARMmbed/mbed-os-maintainers please review.

0xc0170 · 2021-09-30T09:34:10Z

CI started

vznncv · 2021-09-30T09:43:04Z

Hi @JojoS62

When I implement SPI 3-Wire changes, I followed existing implementation of 4-wire logic:

mbed-os/targets/TARGET_STM/stm_spi_api.c

Lines 1196 to 1210 in bc01a4e

    
           int spi_master_block_write(spi_t *obj, const char *tx_buffer, int tx_length, 
        
                                      char *rx_buffer, int rx_length, char write_fill) 
        
           { 
        
               struct spi_s *spiobj = SPI_S(obj); 
        
               SPI_HandleTypeDef *handle = &(spiobj->handle); 
        
               int total = (tx_length > rx_length) ? tx_length : rx_length; 
        
               if (handle->Init.Direction == SPI_DIRECTION_2LINES) { 
        
                   for (int i = 0; i < total; i++) { 
        
                       char out = (i < tx_length) ? tx_buffer[i] : write_fill; 
        
                       char in = spi_master_write(obj, out); 
        
                       if (i < rx_length) { 
        
                           rx_buffer[i] = in; 
        
                       } 
        
                   } 
        
               } else {

So I think that the following places should be fixed:

4-wire logic:

mbed-os/targets/TARGET_STM/stm_spi_api.c

Lines 1203 to 1209 in bc01a4e

    
           for (int i = 0; i < total; i++) { 
        
               char out = (i < tx_length) ? tx_buffer[i] : write_fill; 
        
               char in = spi_master_write(obj, out); 
        
               if (i < rx_length) { 
        
                   rx_buffer[i] = in; 
        
               } 
        
           }

3-wire write logic:

mbed-os/targets/TARGET_STM/stm_spi_api.c

Lines 1077 to 1080 in bc01a4e

    
           for (int i = 0; i < tx_length; i++) { 
        
               msp_wait_writable(obj); 
        
               msp_write_data(obj, tx_buffer[i], bitshift); 
        
           }

3-wire read logic:

mbed-os/targets/TARGET_STM/stm_spi_api.c

Lines 1106 to 1109 in bc01a4e

    
           for (int i = 0; i < rx_length; i++) { 
        
               msp_wait_readable(obj); 
        
               rx_buffer[i] = msp_read_data(obj, bitshift); 
        
           }

Asynchronous API logic (spi_master_transfer) should be correct since it uses STM HAL library that processes 16-bit mode correctly.

Additionally I have question about SPI interface functions spi_transfer and spi_master_block_write. According documentation tx_length and rx_length are number of bytes to write/read:

mbed-os/hal/include/hal/spi_api.h

Lines 237 to 253 in bc01a4e

    
           /** Write a block out in master mode and receive a value 
        
            * 
        
            *  The total number of bytes sent and received will be the maximum of 
        
            *  tx_length and rx_length. The bytes written will be padded with the 
        
            *  value 0xff. 
        
            * 
        
            * @param[in] obj        The SPI peripheral to use for sending 
        
            * @param[in] tx_buffer  Pointer to the byte-array of data to write to the device 
        
            * @param[in] tx_length  Number of bytes to write, may be zero 
        
            * @param[in] rx_buffer  Pointer to the byte-array of data to read from the device 
        
            * @param[in] rx_length  Number of bytes to read, may be zero 
        
            * @param[in] write_fill Default data transmitted while performing a read 
        
            * @returns 
        
            *      The number of bytes written and read from the device. This is 
        
            *      maximum of tx_length and rx_length. 
        
            */ 
        
           int spi_master_block_write(spi_t *obj, const char *tx_buffer, int tx_length, char *rx_buffer, int rx_length, char write_fill);

mbed-os/hal/include/hal/spi_api.h

Lines 372 to 384 in bc01a4e

    
           /** Begin the SPI transfer. Buffer pointers and lengths are specified in tx_buff and rx_buff 
        
            * 
        
            * @param[in] obj       The SPI object that holds the transfer information 
        
            * @param[in] tx        The transmit buffer 
        
            * @param[in] tx_length The number of bytes to transmit 
        
            * @param[in] rx        The receive buffer 
        
            * @param[in] rx_length The number of bytes to receive 
        
            * @param[in] bit_width The bit width of buffer words 
        
            * @param[in] event     The logical OR of events to be registered 
        
            * @param[in] handler   SPI interrupt handler 
        
            * @param[in] hint      A suggestion for how to use DMA with this transfer 
        
            */ 
        
           void spi_master_transfer(spi_t *obj, const void *tx, size_t tx_length, void *rx, size_t rx_length, uint8_t bit_width, uint32_t handler, uint32_t event, DMAUsage hint);

But existing asynchronous API implementation (spi_master_transfer) and suggested changes interpreter them as total SPI frame number.

@LMESTM, @0xc0170 Which definition is correct? Are tx_length and rx_length size of corresponding buffers in bytes or number of SPI frames?

mbed-ci · 2021-09-30T10:00:39Z

Jenkins CI Test : ✔️ SUCCESS

Build Number: 1 | 🔒 Jenkins CI Job | 🌐 Logs & Artifacts

CLICK for Detailed Summary

jobs	Status
jenkins-ci/mbed-os-ci_unittests	✔️
jenkins-ci/mbed-os-ci_build-cloud-example-ARM	✔️
jenkins-ci/mbed-os-ci_cmake-cloud-example-ARM	✔️
jenkins-ci/mbed-os-ci_cmake-cloud-example-GCC_ARM	✔️
jenkins-ci/mbed-os-ci_build-cloud-example-GCC_ARM	✔️
jenkins-ci/mbed-os-ci_build-greentea-GCC_ARM	✔️
jenkins-ci/mbed-os-ci_build-greentea-ARM	✔️
jenkins-ci/mbed-os-ci_build-example-ARM	✔️
jenkins-ci/mbed-os-ci_cmake-example-ARM	✔️
jenkins-ci/mbed-os-ci_cmake-example-GCC_ARM	✔️
jenkins-ci/mbed-os-ci_build-example-GCC_ARM	✔️
jenkins-ci/mbed-os-ci_greentea-test	✔️

0xc0170 · 2021-09-30T10:50:32Z

Question about the buffer sizes. everything is defined in bytes.

JojoS62 · 2021-09-30T11:11:03Z

yes, the class documentation says also bytes. But a 16 Bit operation can not send 1 byte, so how to handle this case? The buffer may also be to small and the type cast can crash.
I have used and checked only the 3-wire case, but yes, also the 4-wire spi_master_block_write() looks wrong. It takes bytes and sends them as larger words.
I have checked the grandpa implementation for LPC1768, its the same there in

mbed-os/targets/TARGET_NXP/TARGET_LPC176X/spi_api.c

Line 199 in bc01a4e

char out = (i < tx_length) ? tx_buffer[i] : write_fill;

So it seems blockwrites with >8 bit are not working correctly for some longer time. But 16 bit writes are important e.g. for data transfer to a display, otherwise you have to fiddle with byte swapping for a large amount of data.

I'm also confused by

mbed-os/hal/include/hal/spi_api.h

Line 111 in bc01a4e

    
            * * ::spi_master_block_write writes `tx_length` words to the bus - Verified by ::fpga_spi_test_common_no_ss and ::fpga_spi_test_common

this says words, what I experienced also as the working behavior in <6.15.0

0xc0170 · 2021-09-30T13:03:24Z

yes, the class documentation says also bytes. But a 16 Bit operation can not send 1 byte, so how to handle this case? The buffer may also be to small and the type cast can crash.

My understanding is that it's user's responsibility to use format/write properly (. You set up the format and then provide buffer (size in bytes, but should be multiple of format specified bits).

JojoS62 · 2021-09-30T13:28:06Z

I have checked again the previous behaviour (mbed 6.14.0), it was calling HAL_SPI_Transmit which gets a pointer and a size for datawords, so with 16 bit setting it was sending size * 16 bit.
This is what I understand also from spi_hal.h defined behaviour, in this case the documentation does not fit.

same topics:
#12448
#10399

vznncv · 2021-10-02T20:06:09Z

@0xc0170

Question about the buffer sizes. everything is defined in bytes.
My understanding is that it's user's responsibility to use format/write properly (. You set up the format and then provide buffer (size in bytes, but should be multiple of format specified bits).

Do I understand correctly that tx_buffer, tx_length, rx_buffer, rx_length in the functions spi_master_block_write and spi_master_transfer should be interpreted in the following ways:

8-bit SPI mode:

const uint8_t *tx_frames = (const uint8_t *)tx_buffer;
int tx_frames_number = tx_length;
uint8_t *rx_frames = (uint8_t *)rx_buffer;
int rx_frames_number = rx_length;

// transmit/receive 8-bit frames ...

16-bit SPI mode:

const uint16_t *tx_frames = (const uint16_t *)tx_buffer;
MBED_ASSERT(tx_length % 2 == 0);
int tx_frames_number = tx_length / 2;
uint16_t *rx_frames = (uint16_t *)rx_buffer;
MBED_ASSERT(rx_length % 2 == 0);
int rx_frames_number = rx_length / 2;

// transmit/receive 16-bit frames ...

32-bit SPI mode:

const uint32_t *tx_frames = (const uint32_t *)tx_buffer;
MBED_ASSERT(tx_length % 4 == 0);
int tx_frames_number = tx_length / 4;
uint32_t *rx_frames = (uint32_t *)rx_buffer;
MBED_ASSERT(rx_length % 4 == 0);
int rx_frames_number = rx_length / 4;

// transmit/receive 32-bit frames ...

note: currently only spi_master_transfer has such behavior:

mbed-os/targets/TARGET_STM/stm_spi_api.c

Lines 1335 to 1337 in 4587080


	words = length >> bitshift;

mbed-os/targets/TARGET_STM/stm_spi_api.c

Lines 1353 to 1364 in 4587080

    
           case SPI_TRANSFER_TYPE_TXRX: 
        
               rc = HAL_SPI_TransmitReceive_IT(handle, (uint8_t *)tx, (uint8_t *)rx, words); 
        
               break; 
        
           case SPI_TRANSFER_TYPE_TX: 
        
               rc = HAL_SPI_Transmit_IT(handle, (uint8_t *)tx, words); 
        
               break; 
        
           case SPI_TRANSFER_TYPE_RX: 
        
               // the receive function also "transmits" the receive buffer so in order 
        
               // to guarantee that 0xff is on the line, we explicitly memset it here 
        
               memset(rx, SPI_FILL_CHAR, length); 
        
               rc = HAL_SPI_Receive_IT(handle, (uint8_t *)rx, words); 
        
               break;

Mbed OS has SPI tests for 16/32 bit mode (hal/tests/TESTS/mbed_hal_fpga_ci_test_shield/spi/main.cpp). Are they run during CI/CD? (although it seems that a bug with a word receiving compensates a bug with word transmitting, so the test doesn't fail).
Extra question about fill_char. spi_master_block_write has an extra argument char write_fill.
What is expected fill_char behavior in 16/32 bit mode?

JojoS62 · 2021-10-03T14:38:56Z

I have no preference about byte or framelength, it should just be the same for all targets and clearly documented in the API reference. There are some more implementations that use the template from LPC1768 I guess, and all are sending only a byte in larger frames. Sending blocks of data was a later extension, but of course it makes sense e.g. for sending display data fast.
The FPGA test is interesting, didn't knew that before.
the fill char should have the same size as the frames for reading / writing. It could be promoted to a larger type, but the SPI should be fast and not try to catch every possible combination.

edit to 1)
but I tend to use size in frames instead of bytes. It makes the asserts unnecessary and reduces the extra conversions. When I want to send 16 bit frame blocks, I usually have the number of 16 bit items. Compatibility cannot be an issue, the function cannot have worked for a long time. I wonder why this was not reported before, except the few issues that I mentioned already.

jeromecoutant · 2021-10-04T08:36:52Z

2. Mbed OS has SPI tests for 16/32 bit mode (`hal/tests/TESTS/mbed_hal_fpga_ci_test_shield/spi/main.cpp`). Are they run during CI/CD? (although it seems that a bug with a word receiving compensates a bug with word transmitting, so the test doesn't fail).

For example, in the above Jenkins CI Test comment, if you go in the Logs & Artifacts link,
you can see that 4 targets are with FPGA

jeromecoutant · 2021-10-06T14:39:22Z

So let's merge ? or it needs update ?

0xc0170 · 2021-10-06T15:29:42Z

If no objections, I'll merge this tomorrow.

JojoS62 · 2021-10-06T16:21:14Z

this PR makes the SPI working like before in mbed-os-6.14, I would appreciate this.
The point is only, that this does not comply with the documentation. This can be another issue, to check also other targets and the FPGA test.

vznncv · 2021-10-06T19:43:25Z

this PR makes the SPI working like before in mbed-os-6.14,

No, data reading isn't fixed.

this PR makes the SPI working like before in mbed-os-6.14, I would appreciate this.

I don't like idea of "bug" compatibility fixes.

If you run the following code with mbed-os-6.14:

#include "mbed.h"

static DigitalOut user_led(LED1, 1);

int main()
{
    PinName mosi = PB_5;
    PinName miso = PB_4;
    PinName sclk = PB_3;
    PinName ssel = PB_6;

    DigitalOut ssel_out(ssel, 1);

    int count = 0;

    size_t tx_word_len;
    size_t rx_word_len;
    const uint16_t tx_data[8] = {0x1122, 0x3344}; // allocate extra memory to prevent "out of range" error
    uint16_t rx_data[8] = {0};

    while (true) {
        user_led = 1;
        printf("Demo run %i\n", count);

        // 3-wire demo
        // note: don't read data in 3-wire mode to get the same 4-wire results
        tx_word_len = 2;
        rx_word_len = 0;
        {
            SPI spi(mosi, NC, sclk);
            spi.format(16);

            ssel_out = 0;

            // transfer by 1 word per SPI::write call
            for (size_t i = 0; i < tx_word_len; i++) {
                spi.write(tx_data[i]);
            }
            wait_us(64);
            // transfer by bulk SPI::write call
            spi.write((const char *)tx_data, tx_word_len * sizeof(*tx_data),
                      (char *)rx_data, rx_word_len * sizeof(*rx_data));
            wait_us(64);
            // asynchronous transfer
            spi.transfer((const char *)tx_data, tx_word_len * sizeof(*tx_data),
                         (char *)rx_data, rx_word_len * sizeof(*rx_data), nullptr);
            wait_us(64);

            ssel_out = 1;
        }

        // 4-wire demo
        tx_word_len = 2;
        rx_word_len = 2;
        {
            SPI spi(mosi, miso, sclk);
            spi.format(16);

            ssel_out = 0;

            // transfer by 1 word per SPI::write call
            for (size_t i = 0; i < tx_word_len; i++) {
                spi.write(tx_data[i]);
            }
            wait_us(64);
            // transfer by bulk SPI::write call
            spi.write((const char *)tx_data, tx_word_len * sizeof(*tx_data),
                      (char *)rx_data, rx_word_len * sizeof(*rx_data));
            wait_us(64);
            // asynchronous transfer
            spi.transfer((const char *)tx_data, tx_word_len * sizeof(*tx_data),
                         (char *)rx_data, rx_word_len * sizeof(*rx_data), nullptr);
            wait_us(64);

            ssel_out = 1;
        }


        ThisThread::sleep_for(100ms);
        user_led = 0;
        ThisThread::sleep_for(1900ms);
        count++;
    }

    return 0;
}

You will see the following SPI results:

3-wire mode

4-wire mode

I.e actual int SPI::write(const char *tx_buffer, int tx_length, char *rx_buffer, int rx_length) behavior differs from SPI::transfer and depends on SPI mode (3 wire or 4 wire). It isn't correct. I think that all SPI methods should handle data in 16 bit mode in the same way (i.e. result should be the same)

My understanding is that it's user's responsibility to use format/write properly (. You set up the format and then provide buffer (size in bytes, but should be multiple of format specified bits).

I have no preference about byte or framelength, it should just be the same for all targets and clearly documented in the API reference

Since poring guide (https://os.mbed.com/docs/mbed-os/v6.15/porting/spi-port.html) doesn't provide clear information about 16-bit mode implementation, I'd suggest to adhere to SPI class documentation where tx_length and rx_length are explicitly called as bytes:

mbed-os/drivers/include/drivers/SPI.h

Lines 195 to 209 in 9dd6fb4

    
               /** Write to the SPI Slave and obtain the response. 
        
                * 
        
                *  The total number of bytes sent and received will be the maximum of 
        
                *  tx_length and rx_length. The bytes written will be padded with the 
        
                *  value 0xff. 
        
                * 
        
                *  @param tx_buffer Pointer to the byte-array of data to write to the device. 
        
                *  @param tx_length Number of bytes to write, may be zero. 
        
                *  @param rx_buffer Pointer to the byte-array of data to read from the device. 
        
                *  @param rx_length Number of bytes to read, may be zero. 
        
                *  @return 
        
                *      The number of bytes written and read from the device. This is 
        
                *      maximum of tx_length and rx_length. 
        
                */ 
        
               virtual int write(const char *tx_buffer, int tx_length, char *rx_buffer, int rx_length);

mbed-os/drivers/include/drivers/SPI.h

Lines 241 to 266 in 9dd6fb4

    
               /** Start non-blocking SPI transfer using 8bit buffers. 
        
                * 
        
                * This function locks the deep sleep until any event has occurred. 
        
                * 
        
                * @param tx_buffer The TX buffer with data to be transferred. If NULL is passed, 
        
                *                  the default SPI value is sent. 
        
                * @param tx_length The length of TX buffer in bytes. 
        
                * @param rx_buffer The RX buffer which is used for received data. If NULL is passed, 
        
                *                  received data are ignored. 
        
                * @param rx_length The length of RX buffer in bytes. 
        
                * @param callback  The event callback function. 
        
                * @param event     The event mask of events to modify. @see spi_api.h for SPI events. 
        
                * 
        
                * @return Operation result. 
        
                * @retval 0 If the transfer has started. 
        
                * @retval -1 If SPI peripheral is busy. 
        
                */ 
        
               template<typename Type> 
        
               int transfer(const Type *tx_buffer, int tx_length, Type *rx_buffer, int rx_length, const event_callback_t &callback, int event = SPI_EVENT_COMPLETE) 
        
               { 
        
                   if (spi_active(&_peripheral->spi)) { 
        
                       return queue_transfer(tx_buffer, tx_length, rx_buffer, rx_length, sizeof(Type) * 8, callback, event); 
        
                   } 
        
                   start_transfer(tx_buffer, tx_length, rx_buffer, rx_length, sizeof(Type) * 8, callback, event); 
        
                   return 0; 
        
               }

Such behavior may be implemented with changes like the followings in the stm_spi_api.c:

...
static inline int spi_get_word_from_buffer(const void *buffer, int bitshift)
{
    if (bitshift == 1) {
        return *((uint16_t *)buffer);
#ifdef HAS_32BIT_SPI_TRANSFERS
    } else if (bitshift == 2) {
        return *((uint32_t *)buffer);
#endif /* HAS_32BIT_SPI_TRANSFERS */
    } else {
        return *((uint8_t *)buffer);
    }
}

static inline void spi_put_word_to_buffer(void *buffer, int bitshift, int data)
{
    if (bitshift == 1) {
        *((uint16_t *)buffer) = data;
#ifdef HAS_32BIT_SPI_TRANSFERS
    } else if (bitshift == 2) {
        *((uint32_t *)buffer) = data;
#endif /* HAS_32BIT_SPI_TRANSFERS */
    } else {
        *((uint8_t *)buffer) = data;
    }
}
...
static int spi_master_one_wire_transfer(spi_t *obj, const char *tx_buffer, int tx_length,
                                        char *rx_buffer, int rx_length) {
    ...
    const int word_size = 0x01 << bitshift;
    ...
       for (int i = 0; i < tx_length; i += word_size) {
            msp_wait_writable(obj);
            msp_write_data(obj, spi_get_word_from_buffer(tx_buffer + i, bitshift), bitshift);
       }
    ...
    // SPI_IP_VERSION_V2
    ...
    /* Receive data */
        for (int i = 0; i < rx_length; i += word_size) {
            msp_wait_readable(obj);
            spi_put_word_to_buffer(rx_buffer + i, bitshift, msp_read_data(obj, bitshift));
        }
    ...
    // SPI_IP_VERSION_V1
    ...
        for (int i = 0; i < rx_length; i += word_size) {
            core_util_critical_section_enter();
            LL_SPI_Enable(SPI_INST(obj));
            /* Wait single SPI clock cycle. */
            wait_ns(baudrate_period_ns);
            LL_SPI_Disable(SPI_INST(obj));
            core_util_critical_section_exit();

            msp_wait_readable(obj);
            spi_put_word_to_buffer(rx_buffer + i, bitshift, msp_read_data(obj, bitshift));
        }
    ...
int spi_master_block_write(spi_t *obj, const char *tx_buffer, int tx_length,
                           char *rx_buffer, int rx_length, char write_fill)
{
    ...
    const int bitshift = datasize_to_transfer_bitshift(handle->Init.DataSize);
    MBED_ASSERT(tx_length >> bitshift << bitshift == tx_length);
    MBED_ASSERT(rx_length >> bitshift << bitshift == rx_length);
    int total = (tx_length > rx_length) ? tx_length : rx_length;
    int write_fill_frame = write_fill;
    for (int i = 0; i < bitshift; i++) {
        write_fill_frame = (write_fill_frame << 8) | write_fill;
    }

    if (handle->Init.Direction == SPI_DIRECTION_2LINES) {
        const int word_size = 0x01 << bitshift;
        for (int i = 0; i < total; i += word_size) {
            int out = (i < tx_length) ? spi_get_word_from_buffer(tx_buffer + i, bitshift) : write_fill_frame;
            int in = spi_master_write(obj, out);
            if (i < rx_length) {
                spi_put_word_to_buffer(rx_buffer + i, bitshift, in);
            }
        }
    } else {
    ...

Such solution has the following advantages/disadvantages:

(+) behavior matches SPI class documentation (I think that main interface class documentation should be primary source of truth of API behavior)
(+) all methods handle data in 16-bit mode in the same ways (SPI::write or SPI::transfer, 4-wire or 3-wire mode)
(-) it doesn't match behavior of current version and mbed-os 6.14 or lower

@JojoS62 @0xc0170

What do you think about it? If it's ok, @JojoS62 please implement "bytes" approach.

JojoS62 · 2021-10-20T16:50:56Z

its still on my todo list, I was struggeling with cmake.
I will use the modifications that @vznncv suggested.
Found also your hidden secrets in https://github.com/ARMmbed/mbed-os/tree/master/targets/TARGET_STM#readme
That is also interesting about performance and a (target specific) DMA implementation.
@vznncv When you have tested already your modifications and you have more time, than you can take over and I'll close my PR, no problem.

Pull request has been modified.

JojoS62 · 2021-10-27T20:03:11Z

its driving me nuts. I have used the test program from @vznncv on a H743. There the known problem was shown, but also the last asynch transfer was completely missing.
Then I used a F407, with different result: 3-wire does not work at all with this mix, 4-wire looks ok exept bulk write.

F407VG / 16 bit

for 8 bit, the result is ok.

jeromecoutant · 2021-10-28T07:18:15Z

its driving me nuts. I have used the test program from @vznncv on a H743. There the known problem was shown, but also the last asynch transfer was completely missing. Then I used a F407, with different result: 3-wire does not work at all with this mix, 4-wire looks ok exept bulk write.

Note that ST SPI IP has changed with STM32H7:
https://github.com/ARMmbed/mbed-os/blob/master/targets/TARGET_STM/TARGET_STM32H7/spi_device.h#L22

JojoS62 · 2021-10-28T08:49:19Z

yes, thanks, I have seen also the different treatment in the source code.
It may also be a problem with contructing and destructing the SPI object (too fast), I haven't checked this yet. For a test, I will focus first now on the F4 and try static SPI instances to avoid possible initialization problems. This should work also.
I see also that my rebase pulled in a lot of commits, I will fix it also.
With F4 and mbed-os-6.14.0, I get the same test result as Konstantin.

JojoS62 · 2021-10-28T21:07:23Z

@vznncv I've tried now your modifications, but I get the same wrong result with mbed-os-6.15, my modification and also your modification. I hope I got it right, its better when you send a git patch or the whole file.

mbed-6.14.0 3-Wire 1 word:

mbed-6.15.0 3-Wire 1 word:

mbed-6.15.0 + modifications:

it is strange, it is sending too many clocks. How can this happen? Can you confirm?
I have tested on a F407.

in spi_master_write. This fixes the problem with too many spi_sck

JojoS62 · 2021-10-28T21:44:00Z

this is now working, but is not using spi_master_one_wire_transfer(), so it still needs to be checked why this is not working.

JojoS62 · 2021-10-29T14:31:44Z

now it looks fine on the F407, also with LL SPI. I added a SPI_1LINE_TX(handle);before enabling SPI, this prevented it to start generating clocks. There is only one glitch in clock, but before setting spi enable.
Now it needs to be tested also on H7, is someone else in for testing?

on F407:

on H743:

JojoS62 · 2021-10-30T11:57:49Z

some points I've found so far:

the driver for F4 is working now
disabling/enabling is expensive and should be avoided in read/write. For the the F4, it is working with a fixed setting
F4: after initializing, a set direction avoids start firing clocks
H7: set direction is locked when SPI is enabled. disable/enable is also expensive here
the testprogram had an error in casting to (uint8_t*) for transfer function. The template evaluates the bit_length to 8 bits instead of 16.
the testprogram is not working yet with transfer async 4-wire.
transfer async 4 wire calls HAL_SPI_TransmitReceive_IT. Validatings are ok, data is written to TXDR, but no data is sent on SPI
transfer async 4 wire also does not work in mbed-os-6.14.0, so before Konstantins changes
when this function is used with HAL_SPI_TransmitReceive instead of HAL_SPI_TransmitReceive_IT, the data is sent properly
when used with 8 Bit format, the async 4 wire is sending
a test program with CubeMX, same settings and same hardware is working with HAL_SPI_TransmitReceive_IT

the problem is definetly the complex enable/disable sequence on the H7. And also the mix of LL and HAL makes it damned hard to handle.
I recommend to switch back to HAL functions, they are safer more compatible between MCU series. For better performance when sending data blocks, the transfer function should be used.

vznncv · 2021-10-31T20:25:05Z

Hi @JojoS62

I'll check your changes with my boards (F411 and H743) in 2-3 day.

Notes:
I see that you have changed 3-Wire SPI logic for optimization purposes: keep SPI enabled with TX during idle. But there are some notes:

according F4 reference manual (and code of HAL library), the SPI should disabled when we need to change transfer direction
you have deleted LL_SPI_SetTransferDirection(SPI_INST(obj), LL_SPI_HALF_DUPLEX_TX); line from transmit part, but I don't see any line that restores TX mode after data receiving.

JojoS62 · 2021-11-01T15:00:28Z

* according F4 reference manual (and code of HAL library), the SPI should disabled when we need to change transfer direction

yes, that will be better. Also the transfer for H7 needs to start in disabled state. I wanted to shorten the gap between two writes, it takes also on the fast H7 a few microsecnds.

I have published the current testcode on github:
https://github.com/JojoS62/testSPI

And played also with higher SPI speeds. When the H7 is used with PLL3P, then a modification in the init code is neccessary, it destroys a previous set PLL3 setting.
But for higher speeds, DMA would be better and that is missing in mbed-os. Because there are so many options, it will be easier to use CubeMX code in a subclassed SPI. Not portable, but I think such features are beyond a versatile OS.

for SPI in H7, there is a valuable resources in AN5543 and www.st.com/content/ccc/resource/training/technical/product_training/group0/52/17/7a/28/2b/90/41/7d/STM32H7-Peripheral-Serial_Peripheral_interface_SPI/files/STM32H7-Peripheral-Serial_Peripheral_interface_SPI.pdf/_jcr_content/translations/en.STM32H7-Peripheral-Serial_Peripheral_interface_SPI.pdf

jeromecoutant · 2021-11-12T08:47:45Z

FYI: I executed SPI non regression basic tests with all STM32 families with the current patches
They are all OK

JojoS62 · 2021-11-12T09:23:34Z

I will try to continue tomorrow and fix the issues that @vznncv mentioned.

vznncv · 2021-11-28T20:25:04Z

Hi @JojoS62

Sorry for delayed answer. I was busy and didn't have enough time to check changes/fixes.

But finally I have updated my test example and test/check spi.

demo/example results

Example: https://github.com/vznncv/mbed-os-stm32-spi-3-wire-demo/tree/iss_spi_16bit

The example communicates with sensor (it sends data to 6 "free" 8-bit registers and then reads them).
I have updated this example to use 16-bit mode for data writing/reading. The test code isn't
efficient, since I need to switch between 8-bit and 16-bit mode often to send address byte, but anyway it allows to
automatically check the correctness of 16-bit transfer.
Additionally, I have added basic "loopback" example to test 16-bit mode with 4 wires.

For testing purposes I have used patches that I suggested (+some bug fixes that I made during debugging). The code with
patches: https://github.com/vznncv/mbed-os/tree/iss_spi_16bit (git patch: 0001-Fix-SPI-16-bit-logic.patch.txt)

I have tested it with 3 MCU:

STM32F103C8, result logs:
- stm32f103c8_3wire_release_logs.txt
- stm32f103c8_4wire_release_logs.txt
STM32F411CE, result logs:
- stm32f411ce_3wire_release_logs.txt
- stm32f411ce_4wire_release_logs.txt
STM32H743VI, result logs:
- stm32h743vi_3wire_release_logs.txt
- stm32h743vi_4wire_release_logs.txt

Summary:

synchronous API (3 and 4 wire modes) works without problems (or at least I haven't found any issues)
asynchronous API works in some cases, but it has the following problems:
- STM32H7 - 16 bit and 4-wire mode doesn't work, as you mentioned above
- STM32F4, STM32F1 - 3-wire mode doesn't work correctly (there are some problems with HAL library)

SPI enabling/disabling optimization notes

The default HAL library behavior for 3-wire mode (HAL_SPI_Transmit, HAL_SPI_Receive, HAL_SPI_Receive_IT
, HAL_SPI_Transmit_IT) - disable SPI between transaction.
For enabling/disabling optimization you need to adjust spi_irq_handler_asynch interrupt handler,
since it calls the HAL_SPI_IRQHandler, that calls SPI_CloseRx_ISR (via SPI_RxISR_16BIT, ... callbacks), that disables SPI.
disabling/enabling is expensive and should be avoided in read/write
H7: set direction is locked when SPI is enabled. disable/enable is also expensive here

I temporarily update spi_master_one_wire_transfer for data writing:
```
LL_SPI_Disable(SPI_INST(obj));
for (int i = 0; i < tx_length; i += word_size) {
    LL_SPI_Enable(SPI_INST(obj));
    msp_wait_writable(obj);
    msp_write_data(obj, spi_get_word_from_buffer(tx_buffer + i, bitshift), bitshift);
    msp_wait_not_busy(obj);
    LL_SPI_Disable(SPI_INST(obj));
}
```
But I don't note any significant changes. Only tiny delay (~0.5 us) between frames (I have checked it with
STM32F411CE and 1 MHz SPI frequency). The base SPI transaction setup/validation code gives more overhead. I'm not
sure if such optimization gives significant performance boost.

Do you have any comparison table of SPI performance with/without optimization?
I recommend to switch back to HAL functions, they are safer more compatible between MCU series. For better performance when sending data blocks, the transfer function should be used.

Yes, they are safer and more compatible between MCU series, but unfortunately they don't implement correct 3-wire
mode for SPI data reading (except STM32H7 series).

Summary

I'd suggest to split this pull request into 2 ones:

Basic SPI fix to work with 16 bit mode correctly (at least synchronous API). If it's needed, I can create it.
SPI 3-wire mode optimization, that you have suggested.

What do you think about it?

vznncv · 2021-12-02T22:22:12Z

STM32H7 SPI async API (16 bit and 4-wire mode) notes

I have compared CubeMX demo program and stm_spi_api.c and found problem: STM32H7 HAL API assumes that SPI is disabled between function (HAL_SPI_TransmitReceive_IT, HAL_SPI_TransmitReceive, etc.) calls. It's needed for transfer size modification (CR2 register), that can be set only if SPI is disables. If SPI is enabled before synchronous API call (HAL_SPI_TransmitReceive) it doesn't hinder transfer, but may cause problems with asynchronous API.

After adding helper code that disables SPI before HAL_SPI_TransmitReceive_IT call and enables it after transmission, the problems with async API disappears: vznncv@44e2735

mergify · 2022-01-17T15:41:53Z

This PR cannot be merged due to conflicts. Please rebase to resolve them.

JojoS62 · 2022-01-24T14:19:01Z

closed in favour of #15206
thanks @vznncv

ciarmcom added the release-type: patch Indentifies a PR as containing just a patch label Sep 28, 2021

ciarmcom requested a review from a team September 28, 2021 09:00

ciarmcom added needs: review devices: st labels Sep 28, 2021

jeromecoutant approved these changes Sep 30, 2021

View reviewed changes

0xc0170 previously approved these changes Sep 30, 2021

View reviewed changes

mergify bot added needs: CI and removed needs: review labels Sep 30, 2021

mergify bot added ready for merge and removed needs: CI labels Sep 30, 2021

0xc0170 added needs: review and removed ready for merge labels Sep 30, 2021

0xc0170 added ready for merge and removed needs: review labels Oct 6, 2021

pass data as pointer to msp_write_data

a5de5bf

JojoS62 force-pushed the fix-spi-16bit-data branch from ee70315 to a5de5bf Compare October 28, 2021 10:25

merge Konstantins code

8328b3d

revert spi_master_one_wire_transfer

9e52610

in spi_master_write. This fixes the problem with too many spi_sck

JojoS62 marked this pull request as draft October 28, 2021 21:46

set 1LineTx mode again before enabling SPI

8634d04

JojoS62 mentioned this pull request Oct 30, 2021

SPI::transfer is defined in spi.h and spi.cpp #15160

Closed

version tested for V1 and V2

af81798

vznncv mentioned this pull request Jan 15, 2022

STM32: fix SPI 16 bit mode #15206

Merged

0xc0170 closed this Jan 26, 2022

mergify bot removed needs: work devices: st release-type: patch Indentifies a PR as containing just a patch labels Jan 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STM32: fix SPI write with data >8 Bit #15115

STM32: fix SPI write with data >8 Bit #15115

JojoS62 commented Sep 28, 2021

jeromecoutant commented Sep 28, 2021

ciarmcom commented Sep 28, 2021

0xc0170 commented Sep 30, 2021

vznncv commented Sep 30, 2021

mbed-ci commented Sep 30, 2021

0xc0170 commented Sep 30, 2021

JojoS62 commented Sep 30, 2021 •

edited

Loading

0xc0170 commented Sep 30, 2021

JojoS62 commented Sep 30, 2021 •

edited

Loading

vznncv commented Oct 2, 2021

JojoS62 commented Oct 3, 2021 •

edited

Loading

jeromecoutant commented Oct 4, 2021

jeromecoutant commented Oct 6, 2021

0xc0170 commented Oct 6, 2021

JojoS62 commented Oct 6, 2021

vznncv commented Oct 6, 2021

JojoS62 commented Oct 20, 2021 •

edited

Loading

JojoS62 commented Oct 27, 2021 •

edited

Loading

jeromecoutant commented Oct 28, 2021

JojoS62 commented Oct 28, 2021 •

edited

Loading

JojoS62 commented Oct 28, 2021 •

edited

Loading

JojoS62 commented Oct 28, 2021

JojoS62 commented Oct 29, 2021 •

edited

Loading

JojoS62 commented Oct 30, 2021 •

edited

Loading

vznncv commented Oct 31, 2021

JojoS62 commented Nov 1, 2021 •

edited

Loading

jeromecoutant commented Nov 12, 2021

JojoS62 commented Nov 12, 2021

vznncv commented Nov 28, 2021

vznncv commented Dec 2, 2021

mergify bot commented Jan 17, 2022

JojoS62 commented Jan 24, 2022

STM32: fix SPI write with data >8 Bit #15115

STM32: fix SPI write with data >8 Bit #15115

Conversation

JojoS62 commented Sep 28, 2021

Summary of changes

Impact of changes

Migration actions required

Documentation

Pull request type

Test results

Reviewers

jeromecoutant commented Sep 28, 2021

ciarmcom commented Sep 28, 2021

0xc0170 commented Sep 30, 2021

vznncv commented Sep 30, 2021

mbed-ci commented Sep 30, 2021

Jenkins CI Test : ✔️ SUCCESS

Build Number: 1 | 🔒 Jenkins CI Job | 🌐 Logs & Artifacts

0xc0170 commented Sep 30, 2021

JojoS62 commented Sep 30, 2021 • edited Loading

0xc0170 commented Sep 30, 2021

JojoS62 commented Sep 30, 2021 • edited Loading

vznncv commented Oct 2, 2021

JojoS62 commented Oct 3, 2021 • edited Loading

jeromecoutant commented Oct 4, 2021

jeromecoutant commented Oct 6, 2021

0xc0170 commented Oct 6, 2021

JojoS62 commented Oct 6, 2021

vznncv commented Oct 6, 2021

JojoS62 commented Oct 20, 2021 • edited Loading

JojoS62 commented Oct 27, 2021 • edited Loading

jeromecoutant commented Oct 28, 2021

JojoS62 commented Oct 28, 2021 • edited Loading

JojoS62 commented Oct 28, 2021 • edited Loading

JojoS62 commented Oct 28, 2021

JojoS62 commented Oct 29, 2021 • edited Loading

JojoS62 commented Oct 30, 2021 • edited Loading

vznncv commented Oct 31, 2021

JojoS62 commented Nov 1, 2021 • edited Loading

jeromecoutant commented Nov 12, 2021

JojoS62 commented Nov 12, 2021

vznncv commented Nov 28, 2021

demo/example results

SPI enabling/disabling optimization notes

Summary

vznncv commented Dec 2, 2021

STM32H7 SPI async API (16 bit and 4-wire mode) notes

mergify bot commented Jan 17, 2022

JojoS62 commented Jan 24, 2022

JojoS62 commented Sep 30, 2021 •

edited

Loading

JojoS62 commented Sep 30, 2021 •

edited

Loading

JojoS62 commented Oct 3, 2021 •

edited

Loading

JojoS62 commented Oct 20, 2021 •

edited

Loading

JojoS62 commented Oct 27, 2021 •

edited

Loading

JojoS62 commented Oct 28, 2021 •

edited

Loading

JojoS62 commented Oct 28, 2021 •

edited

Loading

JojoS62 commented Oct 29, 2021 •

edited

Loading

JojoS62 commented Oct 30, 2021 •

edited

Loading

JojoS62 commented Nov 1, 2021 •

edited

Loading