Branch predictor implementation #135

jiristefan · 2024-05-21T16:31:35Z

Resolves #64
Should also address the branch predictor TODOs from #1 and #2

This PR implements:

Branch Target Table (BTB) for storing instruction address / target address pairs for jump and branch instruction, along with a UI dock widgets for visualizing the table
3 simple static branch predictors (Always Taken / Not Taken, Backward Taken Forward Not Taken)
3 Smith predictors (1-bit, 2-bit, 2-bit with hysteresis), along with a Branch History Table (BHT)
Variable BTB and BHT sizes, from 0 to 8 bits (0 to 16 bits for BHT, might decrease to 8 in the future)
UI dock widget for visualizing the BHT contents, predictions, and predictor statistics
Optional Branch History Register for addressing the BHT

Predictor info widget with BHT:

BTB widget:

Configuration dialog:

jdupak

@ppisa I have found no major issue with the code. It could be simplified a lot and I don't like the repetition of information in many places, but it can be accepted and improved later.

.gitignore

src/gui/dialogs/new/newdialog.cpp

src/gui/mainwindow/mainwindow.h

jdupak · 2024-05-21T18:30:26Z

src/gui/windows/predictor/predictor_bht_dock.h

+    uint8_t number_of_bhr_bits;
+    uint8_t number_of_bht_bits;
+
+    QGroupBox *content;


Please mark these pointers explicitly QT_OWNED.

src/gui/windows/predictor/predictor_btb_dock.cpp

src/machine/predictor.h

ppisa · 2024-05-22T09:48:43Z

@jiristefan please focus on the thesis now and then follow @jdupak suggestions. When you push to your branch the pull request should automatically update as well.

I have tested next code on your branch 233b331 with sequence tuned to have no aliases and results seems to be correct

   addi a0, zero, 10
l: nop
   j a
   nop
a: nop
   bne zero, zero, b
   nop
   nop
b: beq zero, zero, c
   nop
c: addi a0, a0, -1
   bne a0, zero, l
   ebreak

with Smith 1 bit BTB bits 3, BHR 0, BHT 2, initial NT and results are strange on single-cycle and pipelined and results seems to be correct.

When I add BHR 2, I get correct 35, incorrect 5 accuracy 87%.

But when I switch to pipelined I get correct 28, incorrect 12 accuracy 70%.

Adding BHR support above the assignment you have complicated things for yourselves. It is nice to have it there, but you probably need to pass through pipeline the index of the entry used at the prediction time because BHR updates when some branches are near. Even this solution would have problems depending on the variable length stalls caused by possible different cache hit miss in the sequences. Using local history would be easier there probably a little. But it can be added in future to the selection of the Smitch etc...

I do not see there problems with global BHR as blocker, it should be documented that for precise expected behavior used in the textbooks the core has to be used in the single-cycle variant. But some analysis how to resolve problem for pipelined and even for superscalar cores should be considered to update behavior in future.

jiristefan · 2024-05-22T10:30:41Z

Thank you for the review. I will fix the highlighted issues during next week, I admit the frontend code is more messy than the internal implementation, I'll try to implement all the suggestions and generally clean it up.

As for the branch history register, I will probably use the suggested approach of storing the index with the instruction address locally in the predictor, I believe it should not take too much time to implement properly.

ppisa · 2024-05-22T12:00:59Z

@jiristefan I have solution to make BHR work correctly for pipelined version. You need two copies of BHR. The one used to form index at PC computation stage is updated from outcomes of predictions and then the second one is updated in MEM stage from actual results of the branch condition evaluations. They will be in delayed sync which will be maintained as long as there is no missprediction. At the flush event MEM stage BHR will be copied to PC stage BHR. This way the predictor should work exactly the same way/with same outcome for singlecycle and pipelined version. So when you submit the text you can try to implement this approach.

jiristefan · 2024-05-22T12:27:51Z

@ppisa That sounds like the simplest implementation: BHR is already implemented as a class, so a simple solution would be just to create another instance and update and use that one in the prediction function. Then I'll only add another member function of the predictor, which will copy over the values during flush as you said. Thank you for the idea.

jiristefan · 2024-06-01T21:50:53Z

I should mention that I moved the contents of the "Core" tab in the dialog window for configuring the simulation into a scroll area widget:

The branch predictor config added a bit of height to the tab, and there are couple more settings I would like to add to the tab regarding the predictor in the future. So, to prevent the window from getting too tall, I moved everything to the scroll area. Hopefully there will be no issue with this.

jiristefan · 2024-06-02T20:35:50Z

@jdupak I think the reason the Qt6 check is failing is probably because of the predictor_types.h file which I added with the predictor enums I use in the code. To use the enums inside QVariant in the UI objects, I had to register the enum using the provided macros Q_NAMESPACE and Q_ENUM_NS.

I got a very similar error with Qt5 when I tried to compile the code after that, before noticing I did not add the predictor_types.h to the CMakeLists.txt file, then it worked fine.

I'm not sure what changed in Qt6, or if this is a good way to handle this, so please let me know how to fix this. An alternative would be to just convert the enums to integers before storing them in the QVariant, and then back when reading them.

…f the code

…t properties before updating them

…n, without connection to main window

…o be shown by default for easy access during testing

…leanup

… of bits

ppisa · 2024-07-08T11:01:04Z

The branch prediction demonstration developed by Jiri Stefan in the frame of his master's thesis is welcomed step for QtRvSim to cover more computer architectures lectures topics. The visualization and correct/wrong prediction statistic counting matches teaching needs of the classes when demonstrated on single-cycle processor. For pipelined version there are more topics to discussion, see #143. It is questionable if solution matching basic textbooks principles on the pipelined version without classification of branched during cache fills can be found. Probably not, but some updates would provide better insight to the problem and code should be enhanced.

The branch prediction demonstration developed by Jiri Stefan in the frame of his master's thesis is welcomed step for QtRvSim to cover more computer architectures lectures topics. The visualization and correct/wrong prediction statistic counting matches teaching needs of the classes when demonstrated on single-cycle processor. For pipelined version there are more topics to discussion, see #143. It is questionable if solution matching basic textbooks principles on the pipelined version without classification of branched during cache fills can be found. Probably not, but some updates would provide better insight to the problem and code should be enhanced. Signed-off-by: Pavel Pisa <[email protected]>

jdupak reviewed May 22, 2024

View reviewed changes

jdupak force-pushed the master branch from ab0469b to 3439e35 Compare July 6, 2024 10:27

jdupak marked this pull request as ready for review July 6, 2024 10:28

jdupak self-requested a review July 6, 2024 10:28

jiristefan added 11 commits July 6, 2024 13:13

Renamed predictor class and function

e00a162

Added predictor.ccp file implementation and types header file

bd3f29c

Full branch predictor implementation without connection to the rest o…

79f142d

…f the code

Added predictor update to core.cpp, predictor still disabled by default

aca1cc2

Added predictor config to machineconfig and init to machine.cpp

0ad6f71

Added step_started signal to core, to be used to clear some GUI widge…

6d23cd0

…t properties before updating them

Added predictor configuration GUI to newdialog

3ee56b0

Added predictor BHT and BTT dock widgets, only internal implementatio…

8f93ff0

…n, without connection to main window

Connected BHT and BTT dock widgets to the main window, they are set t…

3135528

…o be shown by default for easy access during testing

Renamed Target Table to Target Buffer, split BTB and BHT bits, code c…

bae027e

…leanup

Moved UI function code to lambdas & Fixed BTB having incorrect number…

6c82d44

… of bits

jdupak marked this pull request as draft July 7, 2024 11:42

jiristefan force-pushed the master branch from 3439e35 to 6c82d44 Compare July 7, 2024 23:04

jiristefan marked this pull request as ready for review July 7, 2024 23:18

ppisa mentioned this pull request Jul 8, 2024

Branch predictor enhancements,corrections and coexistence with pipelined execution #143

Open

ppisa added this pull request to the merge queue Jul 8, 2024

Merged via the queue into cvut:master with commit e54b47d Jul 8, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Branch predictor implementation #135

Branch predictor implementation #135

jiristefan commented May 21, 2024 •

edited

Loading

jdupak left a comment

jdupak May 21, 2024

jiristefan Jun 1, 2024

ppisa commented May 22, 2024

jiristefan commented May 22, 2024

ppisa commented May 22, 2024

jiristefan commented May 22, 2024

jiristefan commented Jun 1, 2024

jiristefan commented Jun 2, 2024

ppisa commented Jul 8, 2024

Branch predictor implementation #135

Branch predictor implementation #135

Conversation

jiristefan commented May 21, 2024 • edited Loading

jdupak left a comment

Choose a reason for hiding this comment

jdupak May 21, 2024

Choose a reason for hiding this comment

jiristefan Jun 1, 2024

Choose a reason for hiding this comment

ppisa commented May 22, 2024

jiristefan commented May 22, 2024

ppisa commented May 22, 2024

jiristefan commented May 22, 2024

jiristefan commented Jun 1, 2024

jiristefan commented Jun 2, 2024

ppisa commented Jul 8, 2024

jiristefan commented May 21, 2024 •

edited

Loading