AI model servers
The main cost of the BusinessGPT deployment is the AI servers responsible for answering questions. These are utilizing graphic card GPUs (Graphic Process Units).
Graphics Cards
Below are two options of two cards that are available for purchase for standard servers. The difference between the cards is the number of questions per minute that can be processed.
The system supports multiple servers with a load balancer to boost performance.
Card | Answer Speed | Purchase cost* (One time cost) |
---|---|---|
Nvidia RTX 4090 24 GB VRAM 4 vCPU | 2.1sec ( 29 Questions /Min) | $2289 |
Nvidia RTX 4070 Ti 12GB VRAM 8vCPU | 4.2 sec (14 Questions /Min) |
*Costs were taken from Amzaon.com however these graphic cards can also be purchased elsewhere.