Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Embedding model: gte-Qwen

...

Depends on source of documents (e.g. SharePoint) - would need extra time to download each file

GPU

System Name

Chunks

Size

Tokens

Amount Of Docs

Embedding Time

Tokens / Minute

Chunks / Minute

2 x L4

Small

17,980

113 MB

4,537,270

138

36 Mins

126,035

499

1 x H100 NVL

Medium

17,980

113 MB

4,537,270

138

53 Mins

85,609

339

2 x H100 NVL

Large

17,980

113 MB

4,537,270

138

24 Mins

188,340

746

Examples

System Name

Size

Tokens

Est Num Pages

Files

Time

Small 2 x L4

10 Million

30,000 pages

80 Mins

Small 2 x L4

12 Million

60 Days

Large 2 x H100

12 Million

Large 2 x H100

30 TB / 31m MB