Skip to content

partition64/nvidia-triton-vs-python-backend

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NVIDIA Triton vs Python Backend

This repository contains the requirements for deploying an NLLB model in NVIDIA Triton Inference Server and a standalone Python-based server for deep learning model inference performance and scalability.

Triton container version used: nvcr.io/nvidia/tritonserver:24.01-py

Quick Steps to Serve the Model

### Step 1: Clone the Repository
git clone https://github.com/Siddhartha-11/nvidia-triton-vs-python-backend.git
cd nvidia-triton-vs-python-backend

### Step 2: Launch Triton from the NGC Triton Container and Deploy the Model
docker run --rm --gpus=all -p8000:8000 -p8001:8001 -p8002:8002 \
  -v$(pwd)/Modelrepo/nllb:/models/nllb \
  --name triton-client nvcr.io/nvidia/tritonserver:24.01-py3 \
  tritonserver --model-repository=/models

### Step 3: Deploy the Model in Standard Server
python3 model.py

### Step 4: Sending Inference request
##NVIDIA Triton Server
cd python script file
##To enter a text and get the returned translated text:
python3 entertext2.py
##To send a text file containing 11 sentences for translation:
python3 triton_translation.py

##Standard Server
cd ServerNormal
##To enter a text and get the returned translated text:
python3 client.py
##To send a text file containing 11 sentences for translation:
python3 client2.py

To set up Docker on your system, follow the official Docker installation guide for your operating system : Install Docker

To know more about NVIDIA Triton Inference server, follow the link to official NVIDIA Triton Inference Server guide : NVIDIA Triton

List of all the Triton Inference Server Container Image : Containers

Model used : NLLB

About

comparative analysis of NVIDIA triton inference server and standalone server for deep learning model's inference performance and scalability.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages