...
Purpose | Option 1 (GPU) | Option 2 (CPU) |
---|---|---|
Gateway (Linux) (Vector DB, SQL DB, Gateway API) | 16 GB RAM (32 GB for larger databases) 4 cores 150GB SSD | 16 GB RAM 4 cores |
Embedder (Linux) | 16 GB RAM 16 GB GPU 4 cores 60 GB SSD | 16 GB RAM 4 cores |
LLM (Linux) | 16 GB RAM 24 GB GPU 4 coresTo be determined.80 GB SSD | 16 GB RAM 8 cores |
Dashboard/Ingestor (Windows Server- Not a container) | 8 GB RAM 4 cores 80 GB SSD | 8 GB RAM 4 cores |
Info |
---|
|
Sizing
Above server hardware requirements are estimated to be sufficient for up to
50 GB of ingested data
300 Licensed users
20 concurrent users asking questions.
Add more LLM containers with GPUs to support more concurrent users.
AI model servers
The main hardware cost of the BusinessGPT deployment are the AI servers responsible for answering questions. These utilize GPUs.
...