The gateway stores the text of the content uploaded.
Assuming a five-char average per word and 500 words per page, it is estimated that 1 page of content should take 5kb.
The system stores the original text and a vector representation (chunk) of the content, approximately 2kb per page.
A total of 7kb per page, or in other words, 1M can contain around 140 pages of content.
Assuming the average document has 10 pages, you can ingest 14 documents in 1M.
If you have 100,000 documents - you would need 7200M=7.2G.
Depending on how many pages of content - you can calculate the estimated size of DB needed for the product
Here is a sample sizing from our company:
| Site A | Site B |
Word No. of Files Size Average word file size | 1214 240 MB 200Kb | 480 400 MB 800Kb |
Excel No. of Files Size
| 283 37 MB | 281 24 MB
|
PowerPoint No. of Files Size
| 472 2368 MB | 27 74 MB |
Pdf No. of Files Size Average file size | 1372 2262 MB 1600 Kb | 975 407 MB 400Kb |