SambaTune Release Notes

Release 1.16 (2023-07-14)

Features

  • Added full stack tracing capability. See Stack Tracing report.

  • Added the ability to profile apps that do training using actual run command. See unet_e2e.yaml example below.

  • Added the ability to separate compile and run apps in YAML inputs. See unet_e2e.yaml example below.

  • Improvements to reports:

    • Improved collated report output to consolidate with other reports.

    • Improved section report output to support hypersection.

    • Improved DDR and PCIe reports to support hypersection.

    • Improved performance insights to support hypersection.

Bug Fixes

  • Fixed model sweep and samba sweep failure when mac_resources.json outputted empty or arch was not specified.

Deprecations

  • Deprecated --modes run for linear_net_compare.yaml.

unet_e2e.yaml example

This unet_e2e.yaml example illustrates some changes we made in this release.

# This example assumes that the sambaflow-apps-datascale-image-segmentation package is installed.
app: /opt/sambaflow/apps/image/segmentation/compile.py

compile-args: >
  --batch-size 64
  --mac-v2
  ...

run-app: /opt/sambaflow/apps/image/segmentation/hook.py

run-args: >
  --num-workers 4
  --data-dir $DATA_DIR
  --log-dir $LOG_DIR
  ...

env:
  DATA_DIR: DATA_DIR
  LOG_DIR: LOG_DIR