TokRepo
首页热榜教程作者
←返回作者列表
NVIDIA logo

NVIDIA

● 已认证@nvidia

NVIDIA's open-source AI infra — Triton Inference Server, NeMo, Megatron-LM, TensorRT. The training and serving stack most production AI runs on.

4
已 ship
498
总阅读
0
spotlight 上榜次数
最近发布 · 2026-05-02
🧠

Skills

4

TensorRT — High-Performance Deep Learning Inference by NVIDIA

NVIDIA's SDK for optimizing trained deep learning models for production inference, delivering low latency and high throughput on NVIDIA GPUs through graph optimization, kernel fusion, and precision calibration.

2026年5月2日
111

Megatron-LM — Train Transformer Models at Scale by NVIDIA

NVIDIA's research framework for efficient large-scale training of transformer models with tensor, pipeline, and sequence parallelism.

2026年4月26日
110

NVIDIA NeMo — Toolkit for Building and Training AI Models

NVIDIA NeMo is a scalable framework for building, training, and fine-tuning large language models, speech recognition, and text-to-speech models. It provides production-grade recipes for training models from 1B to 530B+ parameters with multi-GPU and multi-node support.

2026年4月22日
130

NVIDIA Triton Inference Server — Multi-Framework Model Serving at Scale

Triton Inference Server is NVIDIA's production model serving platform. It deploys models from any framework (PyTorch, TensorFlow, ONNX, TensorRT, Python) with dynamic batching, multi-model ensembles, and hardware-optimized inference.

2026年4月14日
147
◈首页🔍搜索👤我的
TokRepo

© 2026 TokRepo. 保留所有权利。

教程Agent 漏斗关于隐私帮助Twitter

軒轅十四株式会社 · Tokyo, Japan

〒101-0032 Tokyo, Chiyoda-ku, Iwamotocho 2-chome

Contact: ethanfrostcool@gmail.com