triton-inference-server repositories

server

Public

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

machine-learning cloud deep-learninggpu inference edge datacenter

Python

•

BSD 3-Clause "New" or "Revised" License

•1.7k•10k•774•87•Updated

Dec 8, 2025

core

Public

The core library and APIs implementing the Triton Inference Server.

C++

•

BSD 3-Clause "New" or "Revised" License

•121•156•0•20•Updated

Dec 8, 2025

tensorrtllm_backend

Public

The Triton TensorRT-LLM Backend

Apache License 2.0

•132•910•317•23•Updated

Dec 8, 2025

python_backend

Public

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

C++

•

BSD 3-Clause "New" or "Revised" License

•188•661•0•16•Updated

Dec 7, 2025

model_analyzer

Public

Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.

deep-learning gpu inferenceperformance-analysis

Python

•

Apache License 2.0

•80•500•0•1•Updated

Dec 6, 2025

pytorch_backend

Public

The Triton backend for the PyTorch TorchScript models.

C++

•

BSD 3-Clause "New" or "Revised" License

•62•166•0•6•Updated

Dec 6, 2025

openvino_backend

Public

OpenVINO backend for Triton.

C++

•

BSD 3-Clause "New" or "Revised" License

•18•34•6•1•Updated

Dec 5, 2025

onnxruntime_backend

Public

The Triton backend for the ONNX Runtime.

backend inference triton-inference-serveronnx-runtime

C++

•

BSD 3-Clause "New" or "Revised" License

•74•169•74•5•Updated

Dec 5, 2025

fil_backend

Public

FIL backend for the Triton Inference Server

Jupyter Notebook

•

Apache License 2.0

•38•83•52•0•Updated

Dec 3, 2025

common

Public

Common source, scripts and utilities shared across all Triton repositories.

C++

•

BSD 3-Clause "New" or "Revised" License

•75•78•0•7•Updated

Dec 3, 2025

client

Public

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Python

•

BSD 3-Clause "New" or "Revised" License

•252•666•58•31•Updated

Nov 27, 2025

vllm_backend

Public

Python

•

BSD 3-Clause "New" or "Revised" License

•32•319•0•9•Updated

Nov 26, 2025

tensorflow_backend

Public

The Triton backend for TensorFlow.

C++

•

BSD 3-Clause "New" or "Revised" License

•23•55•0•2•Updated

Public

Python

•

BSD 3-Clause "New" or "Revised" License

•39•123•33•18•Updated

Nov 14, 2025

local_cache

Public

Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API

C++

•

BSD 3-Clause "New" or "Revised" License

•2•6•1•0•Updated

Nov 14, 2025

identity_backend

Public

Example Triton backend that demonstrates most of the Triton Backend API.

C++

•

BSD 3-Clause "New" or "Revised" License

Public

The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.

python deep-learning gpuimage-processing dali data-preprocessing nvidia-dali fast-data-pipeline

C++

•

MIT License

•33•139•23•6•Updated

Nov 13, 2025

pytriton

Public

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

deep-learning gpu inference

Python

•

Apache License 2.0

•55•830•11•0•Updated

Aug 13, 2025

model_navigator

Public

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

deep-learning gpu inference

Python

•

Apache License 2.0

•27•215•3•0•Updated

Apr 22, 2025

.github

Public

Community health files for NVIDIA Triton

2•2•0•0•Updated

Mar 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triton Inference Server

All

All

35 repositories

server

core

tensorrtllm_backend

python_backend

model_analyzer

pytorch_backend

openvino_backend

onnxruntime_backend

fil_backend

common

client

vllm_backend

tensorflow_backend

triton_cli

tutorials

third_party

tensorrt_backend

square_backend

repeat_backend

redis_cache

perf_analyzer

local_cache

identity_backend

developer_tools

checksum_repository_agent

backend

dali_backend

pytriton

model_navigator

.github

All

All

Repositories list

35 repositories