Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
666
baseten/z-engine-fp8-tp1-16k-paged-context
Updated
baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP4-FP8
Updated
•
7
baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP4
Updated
•
12
baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1
Updated
•
2
baseten/RandomQwen2ForSequenceClassification-0.5B
Text Classification
•
Updated
•
337
baseten/example-Meta-Llama-3-8B-InstructForSequenceClassification
Updated
•
19
baseten/example-Meta-Llama-3-70B-InstructForSequenceClassification
Updated
•
8
baseten/r1-nextn-heads
Updated
•
5
baseten/deepseek-v3-engine-32k
Updated
•
5
baseten/deepseek-v3-engine
Updated
•
12