SambaTune Release Notes
Release 1.16 (2023-07-14)
Features
-
Added full stack tracing capability. See Stack Tracing report.
-
Added the ability to profile apps that do training using actual run command. See unet_e2e.yaml example below.
-
Added the ability to separate compile and run apps in YAML inputs. See unet_e2e.yaml example below.
-
Improvements to reports:
-
Improved collated report output to consolidate with other reports.
-
Improved section report output to support hypersection.
-
Improved DDR and PCIe reports to support hypersection.
-
Improved performance insights to support hypersection.
-
Bug Fixes
-
Fixed model sweep and samba sweep failure when
mac_resources.json
outputted empty or arch was not specified.
unet_e2e.yaml example
This unet_e2e.yaml
example illustrates some changes we made in this release.
# This example assumes that the sambaflow-apps-datascale-image-segmentation package is installed.
app: /opt/sambaflow/apps/image/segmentation/compile.py
compile-args: >
--batch-size 64
--mac-v2
...
run-app: /opt/sambaflow/apps/image/segmentation/hook.py
run-args: >
--num-workers 4
--data-dir $DATA_DIR
--log-dir $LOG_DIR
...
env:
DATA_DIR: DATA_DIR
LOG_DIR: LOG_DIR