Embedding model: gte-Qwen
...
Depends on source of documents (e.g. SharePoint) - would need extra time to download each file
GPU | System Name | Chunks | Size | Tokens | Amount Of Docs | Embedding Time | Tokens / Minute | Chunks / Minute |
---|---|---|---|---|---|---|---|---|
2 x L4 | Small | 17,980 | 113 MB | 4,537,270 | 138 | 36 Mins | 126,035 | 499 |
1 x H100 NVL | Medium | 17,980 | 113 MB | 4,537,270 | 138 | 53 Mins | 85,609 | 339 |
2 x H100 NVL | Large | 17,980 | 113 MB | 4,537,270 | 138 | 24 Mins | 188,340 | 746 |
Examples
System Name | Size | Tokens | Est Num Pages | Files | Time |
---|---|---|---|---|---|
Small 2 x L4 | 10 Million | 30,000 pages | 80 Mins | ||
Small 2 x L4 | 12 Million | 60 Days | |||
Large 2 x H100 | 12 Million | ||||
Large 2 x H100 | 30 TB / 31m MB |