triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Minimalistic large language model 3D-parallelism training
cuML - RAPIDS Machine Learning Library
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
cuDF - GPU DataFrame Library
1 capture since 2026-05-25