Compare Runtimes

Select runtimes to compare side by side. Click chips below to toggle selection.

MetricPython (CPython)containerdllama.cppONNX RuntimeText Generation InferenceKoboldCpp
ScoreB71B-66C+63C-50F39F38
TypeLanguageContainer
Executioninterpretedhybridaothybridhybridhybrid
Interfacecliplatformclisdkapigui
Cold Start50ms100ms100ms500ms10000ms1500ms
Memory15MB20MB50MB300MB2000MB400MB
Startup10ms20ms10ms100ms5000ms300ms
Isolationprocesscontainerprocessprocesscontainerprocess
Maturityproductionproductionproductionproductionproductionstable
LanguagesPythonAnyC, C++Python, C++, C#, JavaRust, PythonC++, Python
LicenseOtherApache-2.0MITMITApache-2.0AGPL-3.0
Links