Table of Contents |
---|
AI model servers
...
The main hardware cost of the BusinessGPT deployment are the AI servers responsible for answering questions. These utilize GPUs.
...
GPU: CUDA 11.8+, Min 12GB RAM.
The disk size should be 30% larger than the original content base file size.
Multiple docker containers servers with a load balancer may be used for higher performance / High availability.
...
Linux server Ubuntu 4CPU, 8GB RAM. HDD/SSD with a R/W speed of at least 100MB/s.
The disk size should be 30% larger than the original content file size.
Server can optionally have an NVIDIA GPU with a minimum of 14GB vRAM for embedding the content.