Skip to main content
Ctrl+K
Logo image
version
⌘+K
  • Welcome

Get Started

  • How It Works
    • Introduction to Determined
    • System Architecture
    • Distributed Training
      • Distributed Training Concepts
      • Implementing Distributed Training
      • Configuration Templates
      • Reproducibility
      • Optimizing Training
  • Tutorials
    • Run Your First Experiment
    • PyTorch MNIST Tutorial
    • TensorFlow Keras Fashion MNIST Tutorial
  • Quickstart for Model Developers
  • Examples
  • Model Hub Library
    • Huggingface Trainsformers
      • Tutorial
      • Examples
    • MMDetection
  • How-To Articles
    • How to View Epoch-Based Metrics in the WebUI

Set Up

  • Installation Guide
    • Quick Installation
    • Deploy on Prem
      • Requirements
      • Options to Deploy on Prem
        • Install Determined Using det deploy
        • Install Determined Using Linux Packages
        • Install Determined Using Docker
        • Install Determined Using Homebrew (macOS)
        • Install Determined Using Windows Subsystem for Linux (Windows)
    • Deploy on AWS
      • Install Determined
      • Deploy Determined with Dynamic Agents
      • Use Spot Instances
    • Deploy on GCP
      • Install Determined
      • Deploy Determined with Dynamic Agents
    • Deploy on Kubernetes
      • Install Determined on Kubernetes
      • Set up and Manage an Azure Kubernetes Service (AKS) Cluster
      • Set up and Manage an AWS Kubernetes (EKS) Cluster
      • Set up and Manage a Google Kubernetes Engine (GKE) Cluster
      • Development Guide
      • Customize a Pod
      • Helm and Kubectl Command Examples
      • Troubleshooting
    • Deploy on Slurm/PBS
      • Installation Requirements
      • HPC Launching Architecture
      • HPC Launcher Security Considerations
      • Install Determined on Slurm/PBS
      • Upgrade Determined on Slurm/PBS
      • Provide a Container Image Cache
      • Known Issues
      • Agent on Slurm/PBS
    • Cluster Configuration
  • Security
    • OAuth 2.0 Configuration
    • Transport Layer Security
    • OpenID Connect Integration
    • SAML Integration
    • SCIM Integration
    • RBAC
  • User Accounts
  • Workspaces and Projects
    • Binding Resource Pools to Workspaces
  • Logging and Elasticsearch
  • Cluster Usage History
  • Monitor Experiment Through Webhooks
    • Through Zapier
    • Through Slack
  • Upgrade
  • Troubleshooting

Model Developer Guide

  • Overview
  • Distributed Training
    • Distributed Training Concepts
    • Implementing Distributed Training
    • Configuration Templates
    • Reproducibility
    • Optimizing Training
  • Prepare Container Environment
    • Set Environment Images
    • Customizing Your Environment
  • Prepare Data
  • API Guides
    • Training APIs
      • Core API User Guide
      • PyTorch API
      • Keras API
      • DeepSpeed API
        • API Usage Guide
        • Autotuning
        • Advanced Usage
        • PyTorchTrial to DeepSpeedTrial
    • Torch Batch Processing API
  • Hyperparameter Tuning
    • Configure Hyperparameter Ranges
    • Hyperparameter Search Constraints
    • Instrument Model Code
    • Handle Trial Errors and Early Stopping Requests
    • Search Methods
      • Adaptive (Asynchronous) Method
      • Grid Method
      • Random Method
      • Single Search Method
      • Custom Search Methods
  • Submit Experiment
  • How to Debug Models
  • Model Management
    • Checkpoints
    • Organize Models in the Model Registry
  • Best Practices

Reference

  • Overview
  • Python SDK
  • REST API
  • Training Reference
    • det
    • det.core
    • det.pytorch
    • det.pytorch.samplers
    • det.pytorch.deepspeed
    • det.keras
    • Experiment Configuration
  • Experiment Configuration Reference
  • Model Hub Reference
    • MMDetection API
    • Transformers API
  • Deployment Reference
    • Common Configuration Options
    • Master Configuration Reference
    • Agent Configuration Reference
    • Helm Chart Configuration Reference
  • Job Configuration Reference
  • Custom Searcher Reference
  • CLI Reference

Tools

  • Overview
  • CLI User Guide
  • Commands and Shells
  • WebUI Interface
  • Jupyter Notebooks
  • TensorBoards
  • Exposing Custom Ports

Integrations

  • Works with Determined
  • IDE Integration
  • Pachyderm
  • Prometheus and Grafana
  • Open Source Licenses
Set Up Reference Dev Guide
Release Notes Blog

Deployment Reference

Deployment Reference#

Deloyment Reference includes reference documentation for configuring Determined deployment.

Title

Description

Common Configuration Options

Common configuration option reference for security, networking, checkpointing, and telemetry options.

Master Configuration Reference

Master node configuration option reference.

Agent Configuration Reference

Agent node configuration option reference.

Helm Chart Configuration Reference

Helm chart configuration option reference.

previous

Transformers API

next

Common Configuration Options

By hello@determined.ai

© Copyright 2023, Determined AI.