Model#HPUPrecisionThroughputAccTTTBatchTaskFramework VersionRun Instructions
Llama2-70B Fine Tuning FSDP (LoRA with torch.compile)8bf161.81 sentences/sec2.1360 min10language-modelingOptimum Habana 1.12.1
Llama2-70B Fine Tuning (LoRA)8bf162.66 sentences/sec2.1338.86 min10language-modelingDeepSpeed 0.14.0 Optimum Habana 1.12.1
Falcon-180B Fine Tuning (LoRA)8bf162.47 sentences/sec3.74162.13 min1language-modelingDeepSpeed 0.14.0 Optimum Habana 1.12.1
GPTJ-CLM8bf1622.17 sentences/sec0.5321.56 min4language-modelingDeepSpeed 0.14.0 Optimum Habana 1.12.1
GPTNEOX-20B-CLM16bf16257 sentences/sec0.5341 min2language-modelingDeepSpeed 0.14.0 Optimum Habana 1.12.1
BridgeTower8bf161031 sentences/sec 7.28 min40contrastive-image-textOptimum Habana 1.12.1
GPT2-XL8bf1695.69 sentences/sec0.478.81 min4language-modelingDeepSpeed 0.14.0 Optimum Habana 1.12.1
ALBERT-XXL8bf16422 sentences/sec94.87.4 min16question-answeringOptimum Habana 1.12.1
BERT Base (torch.compile)8bf164513 sentences/sec85.290.93 min24question-answeringOptimum Habana 1.12.1
BERT-Large Fine Tuning (torch.compile)8bf162099 sentences/sec93.181.93 min32question-answeringOptimum Habana 1.12.1
ClipRoBERTa (torch.compile)8bf166420 images/sec 8.95 min64contrastive-image-textOptimum Habana 1.12.1
DistilBERT (torch.compile)8bf1612192 sentences/sec82.020.56 min64question-answeringOptimum Habana 1.12.1
Flan-T5 XXL8bf1627.11 sentences/sec37.06356 min22summarizationDeepSpeed 0.14.0 Optimum Habana 1.12.1
RoBERTa Large (torch.compile)8bf162084 sentences/sec94.841.95 min32question-answeringOptimum Habana 1.12.1
Swin Transformer8bf165830 images/sec99.091.8 min160question-answeringOptimum Habana 1.12.1
T5-LARGE8bf1686 sentences/sec44.34226 min4image-classificationDeepSpeed 0.14.0 Optimum Habana 1.12.1
Vision Transformer8bf166273 images/sec98.850.91 min128image-classificationOptimum Habana 1.12.1
Wav2Vec2.0 AC8bf161933 sentences/sec81.472.46 min16speech-recognitionOptimum Habana 1.12.1
Wav2Vec2.0 ASR8bf1688 sentences/sec3.9617.5 min4speech-recognitionOptimum Habana 1.12.1