Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

#10270: yolov4 implementation #10068

Closed
wants to merge 1 commit into from
Closed

Conversation

punithsekar
Copy link
Contributor

@punithsekar punithsekar commented Jul 10, 2024

This PR contains ttnn yolov4 implementation with new conv API.

@punithsekar punithsekar force-pushed the punith/yolov4_with_new_conv branch 2 times, most recently from 5c45326 to 49fbdb0 Compare July 11, 2024 05:45
@punithsekar
Copy link
Contributor Author

punithsekar commented Jul 11, 2024

@punithsekar punithsekar force-pushed the punith/yolov4_with_new_conv branch 6 times, most recently from 3892644 to fa81d68 Compare July 12, 2024 16:46
@punithsekar
Copy link
Contributor Author

Performance sheet for neck sub-module : ops_perf_results_neck_2024_07_12_16_43_36.csv

We are not able to generate perf sheets for head sub-module as the test hangs after completing the sub-module and also the yolov4 pipeline fails due to OOM issue in DS5 sub-module first conv itself ,but DS5 sub-module is passing when tested separately.

Currently, Using torch maxpool in neck sub-module as ttnn.maxpool are making issues for those configuration #7746.
And the current pipeline of neck sub-modules outputs face low PCC.

@punithsekar punithsekar force-pushed the punith/yolov4_with_new_conv branch 6 times, most recently from 196a75f to 805d26a Compare July 15, 2024 14:46
@punithsekar
Copy link
Contributor Author

Latest neck and head sub-module perf sheets:
Neck sub-module: ops_perf_results_neck_latest_2024_07_15_17_20_08.csv
Head sub_module: ops_perf_results_head_2024_07_15_17_21_58.csv

@punithsekar punithsekar force-pushed the punith/yolov4_with_new_conv branch 4 times, most recently from fdfea45 to e6b824e Compare July 16, 2024 12:56
@punithsekar
Copy link
Contributor Author

All the sub-modules work fine, the YOLOv4 model produces three outputs, and their PCC are greater than 0.95, 0.96, and 0.98, respectively.

Performance sheets of all modules:

Downsample1 sub_module : ops_perf_results_downsample1_17_2024_07_16_12_42_56.csv

Downsample2 sub_module : ops_perf_results_downsample2_17_2024_07_16_12_44_45.csv

Downsample3 sub_module : ops_perf_results_downsample3_17_2024_07_16_12_45_27.csv

Downsample4 sub_module : ops_perf_results_downsample4_17_2024_07_16_12_46_15.csv

Downsample15sub_module : ops_perf_results_downsample5_17_2024_07_16_12_46_55.csv

Neck sub_module : ops_perf_results_neck_17_2024_07_16_12_48_17.csv

Head sub_module : ops_perf_results_head_17_2024_07_16_12_47_31.csv

yolov4 module : ops_perf_results_yolov4_17_2024_07_16_12_49_50.csv

@punithsekar punithsekar force-pushed the punith/yolov4_with_new_conv branch 4 times, most recently from a5f91fd to edb0d05 Compare July 18, 2024 17:34
@punithsekar
Copy link
Contributor Author

punithsekar commented Jul 18, 2024

Latest performance sheet,
Downsample 1: ops_perf_results_down_1_18_2024_07_18_17_35_49.csv
Downsample 2: ops_perf_results_down_2_18_2024_07_18_17_36_29.csv
Downsample 3: ops_perf_results_down_3_18_2024_07_18_17_37_22.csv
Downsample 4: ops_perf_results_down_4_18_2024_07_18_17_37_55.csv
Downsample 5: ops_perf_results_down_5_18_2024_07_18_17_38_27.csv
Neck : ops_perf_results_neck_sub_18_2024_07_18_17_39_51.csv
Head : ops_perf_results_head_sub_18_2024_07_18_17_39_12.csv
Yolov4 : ops_perf_results_yolov4_18_latest_2024_07_18_17_41_31.csv

Below are the current FPS,

  • FPS (MatMul/Conv Ops only): 263.297
  • FPS (Other Device Ops): 415.307
  • FPS (All Ops): 187.65

Note : Currently using 3 torch maxpool which is not visible in the perf sheets.

@punithsekar punithsekar changed the title WIP- yolov4 implementation #10270: yolov4 implementation Jul 18, 2024
@punithsekar
Copy link
Contributor Author

Latest perf sheet,
yolov4 model: yolov4_19_07_2024.csv

Below are the current FPS,

  • FPS (MatMul/Conv Ops only): 263.459
  • FPS (Other Device Ops): 15.494
  • FPS (All Ops): 14.826

@punithsekar punithsekar force-pushed the punith/yolov4_with_new_conv branch 4 times, most recently from 7552b26 to 7abcdce Compare July 25, 2024 08:41
@punithsekar
Copy link
Contributor Author

Closing this PR since it's outdated and the YOLOv4 model is in the main.

@punithsekar punithsekar closed this Oct 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant