Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents

AI model servers

...

The main hardware cost of the BusinessGPT deployment are the AI servers responsible for answering questions. These utilize GPUs.

...

GPU: CUDA 11.8+, Min 12GB RAM.
The disk size should be 30% larger than the original content base file size.

Multiple docker containers servers with a load balancer may be used for higher performance / High availability.

...

Linux server Ubuntu 4CPU, 8GB RAM. HDD/SSD with a R/W speed of at least 100MB/s.
The disk size should be 30% larger than the original content file size.

Server can optionally have an NVIDIA GPU with a minimum of 14GB vRAM for embedding the content.