SambaNova Runtime release notes
Release 1.15 (2023-03-30)
New features and other improvements
-
Added timing information to description for faults that are manually cleared.
-
Added
snconfig
options to configure host hugepage settings. Runsnconfig set hugepages -–help
for details.
Fault management improvements
-
Improved error and fault handling policies and descriptions for RDU tiles, PCIe links and device memory.
-
Enhanced error messages in case of resource allocation failure.
-
Tile reset status is recorded in the fault management Error log. Use the snfadm tool to access the SNFM logs.
-
PCIe link faults will include a list of potential components that may be the reason for link issues in the link connectivity path.
Bug fixes
-
Improved device memory initialization and recycling
-
Fixed bugs in RDU tile resource management and allocation.
Supported components and versions
XRDU firmware
For XRDU firmware information, see the Hardware release notes:
Operating systems
-
Red Hat Enterprise Linux 8.5.
-
Ubuntu Linux 20.04 LTS.
Deprecated components
-
FTYPE_BAD_TILE tile fault has been renamed to FTYPE_TILE_EXCLUDED.
-
FTYPE_BAD_RDU is deprecated. Instead, you see FTYPE_RDU_INIT, FTYPE_RDU_RESET, or FTYPE_RDU_REMOVED, depending upon the reason for the RDU fault.
-
RampUpErrorException: Starting with release 1.16, this exception is no longer one of the SambaNova Runtime exceptions.
Release 1.14 (2023-01-10)
New features and other improvements
-
Configuration of RoCE before running any Data Parallel workloads is no longer required.
-
Enhanced fault management for both inventory and fault reporting.
Release 1.13 (2022-11-03)
New features and other improvements
-
New features
-
Enabled RDU reset from VM.
-
Improved fault management policies for PCIe link errors and on-chip memory errors.
-
Added option to
sntilestat
to skip idle tiles. -
Updated SambaNova Runtime APIs for equal functionality between C and C++ interfaces.
-
Enhanced multi-processing support for SambaNova Runtime APIs.
-
Enhanced host profiling information and detailed timeline view in SambaTune.
-
Enhanced
snprof
and added more robust fault reporting insnstat
. -
Added requirement to properly configure RoCE before running any DP workloads.
-
-
Fault management improvements
-
Enhanced fault management for both inventory and fault reporting.
-
-
Infrastructure
-
Improved packaging and simplified software dependencies.
-
-
Performance improvements
-
Faster SambaFlow context creation.
-
More efficient CPU usage.
-
Better performance for scaleout operations.
-
Release 1.12.7 (2022-07-30)
New features
-
Added SambaTune: a tool that supports profiling application performance.
-
Enabled C++ SambaRuntime Tensors ZeroCopy via pinned memory.
-
Improved Scale-out performance through parallel reduce.
-
Enhanced RDU reset support with VM.