SambaTune Reports

Use one of the following reports to understand how to improve model performance:

Host-device breakdown report. Learn to identify if the application is host-bound or device-bound.
Section report. Learn how to read generated data for sections.
Stage latency report. Learn how to read generated latency data for stages in a section.
PMU stalls report. Learn how to read the generated PMU read stalls data for stages in a section.
Stack Tracing report. Trace a PyTorch operator or hyperfunction across different layers in the software stack.

Performance insights report (Beta). Learn about bottlenecks, including RDU versus Host, critical sections, and critital stages.
Hypertuner (Beta). Explore how to fine-tune model configuration by running the model with a combination of the model arguments and sweep arguments.
Check compilation and running logs. Learn about SambaTune’s detailed logs, which you can use for checking the high-level status, debugging errors, and reproducing substeps.