Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)CUDA RuntimeAWS Lambdallama.cppGoogle Cloud FunctionsText Generation InferenceKoboldCpp
ScoreB71B-65B-65C+63C59F39F38
TypeLanguageServerlessServerless
Executioninterpretedaothybridaothybridhybridhybrid
Interfaceclisdkplatformcliplatformapigui
Cold Start50ms100ms200ms100ms300ms10000ms1500ms
Memory15MB500MB128MB50MB128MB2000MB400MB
Startup10ms50ms100ms10ms50ms5000ms300ms
Isolationprocesshardwaremicrovmprocesscontainercontainerprocess
Maturityproductionproductionproductionproductionproductionproductionstable
LanguagesPythonC, C++, PythonJavaScript, TypeScript, Python, Java, Go, Ruby, .NET, RustC, C++Python, JavaScript, TypeScript, Go, Java, Ruby, PHPRust, PythonC++, Python
LicenseOtherProprietaryProprietaryMITProprietaryApache-2.0AGPL-3.0
Links