Number of Required Servers
POC Requirements
1x Linux Instance ( Gateway)
t3a.2xlarge - 100 GB SSD disk
1x Linux GPU Instance (LLM)
g6.xlarge - 60GB SSD disk
1x Windows Instance (Dashboard/ Ingestor)
t3a.large- 80GB SSD disk
1x Microsoft SQL Server Express. May be co-located.
SSL certificate (optional)
Customer may provide an SSL certificate to secure access to the dashboard website.
Network Access
Ensure that the BGPT dashboard server has network access to any privately hosted services containing data that needs to be ingested. E.g. Data Center confluence.
Production
2+ GPU Linux Instances (1 for Gateway + AI model, 1 or more for extra AI model capacity)
2+ Windows Instances (Dashboard, Ingestor, Database). More depending on capacity + HA requirements.
1x Application Load balancer
1x Microsoft SQL Server (Express/Standard). May be co-located.
SSL certificate (optional)
Customer may provide an SSL certificate to secure access to the dashboard website.
Machine Types
EC2 Linux Instances
For LLM + Gateway Services
Nvidia Graphics card required for LLM, optional for Gateway Services
E.g. G6.xlarge
$0.803 / Hour (Llama 3)
1x required
EC2Windows Instances
For Dashboard, Database and Ingestor services.
E.g. m5a.large
Load Balancer
EC2 Application Load Balancer
Required if using more than one server for each component.
Minimum Required Permissions for AGAT
During Deployment
Continuous RDP/ SSH / all ports access to servers from AGAT offices static IP address
Optional - IAM access to deploy instances and set up security configuration
After Deployment
Temporary access for support from AGAT offices static IP address
Deployment Process - Optionally managed by AGAT
Open network access for BGPT servers to any locally hosted resources. E.g. Confluence.
Create Databases
Dashboard
Install Dashboard services
Gateway
Deploy docker containers using “Docker Compose” script.
Configure DB connection string
Configure LLM VPC IP
Deploy AGAT LLM Amazon Image
Configure Load Balancers to provide external access to Dashboard
Configure Dashboard to access company data stores (Google Drive, Confluence, etc)