Welcome to the Cerebras Inference API demo repository! This repository contains various examples showcasing the power of the Cerebras Wafer-Scale Engines and CS-3 systems for AI model inference. The ...
model_name: nvidia/nemotron-speech-streaming-en-0.6b # Pre-trained CTC/hybrid model from NGC/HuggingFace or local .nemo file path model_path: null # The path to built '.nemo' boosting tree model ...