Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents

Container Purpose

Option 1 (GPU)

Option 2 (CPU)

Gateway (Linux)

16 GB

4 vCPUs

32 GB

8 vCPUs

Embedder (Linux)

16 GB

16GB GPU

None. Embedding happens on Gateway.

LLM (Linux)

16 GB

24 GB GPU

To be determined.

Dashboard

t3a.large

t3a.large

Info
  1. Example CPU processors: Intel Xeon Platinum 8000

  2. Example GPU for embedding: NVIDIA T4 Tensor Core

  3. Example GPU for LLM: NVIDIA L4 Tensor Core

AI model servers

The main hardware cost of the BusinessGPT deployment are the AI servers responsible for answering questions. These utilize GPUs.

...

Linux server Ubuntu 2CPU, 8GB RAM HDD/SSD with a R/W speed of at least 100MB/s.

GPU: CUDA 11.8+, Min 12GB 24GB RAM.
The disk size should be 30% larger than the original content base file size.

...