Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)containerdCUDA RuntimeAWS Lambdallama.cppText Generation Inference
ScoreB71B-66B-65B-65C+63F39
TypeLanguageContainerServerless
Executioninterpretedhybridaothybridaothybrid
Interfacecliplatformsdkplatformcliapi
Cold Start50ms100ms100ms200ms100ms10000ms
Memory15MB20MB500MB128MB50MB2000MB
Startup10ms20ms50ms100ms10ms5000ms
Isolationprocesscontainerhardwaremicrovmprocesscontainer
Maturityproductionproductionproductionproductionproductionproduction
LanguagesPythonAnyC, C++, PythonJavaScript, TypeScript, Python, Java, Go, Ruby, .NET, RustC, C++Rust, Python
LicenseOtherApache-2.0ProprietaryProprietaryMITApache-2.0
Links