Skip to end of metadata
Go to start of metadata

You are viewing an old version of this content. View the current version.

Compare with Current View Version History

Version 1 Current »

deepseek-ai/DeepSeek-R1-Distill-Llama-8B

vRam: 24 GB

Disk: 18GB

Context window: 20k - 100k tokens

AWS: G6.Xlarge

Runpod: $0.22

On Prem: RTX 4090

ai21labs/AI21-Jamba-1.5-Mini

vRAM: 80 GB

Context Window: 100k Tokens

Disk: 110 GB

Runpod cost: $0.80

AWS: Unfeasible

On Prem: 1x A100 80GB / 4 x RTX 4000 ADA

  • No labels