Text Summarization using LLAMA2-70b
Dataset
The benchmark implementation run command will automatically download the validation and calibration datasets and do the necessary preprocessing. In case you want to download only the datasets, you can use the below commands.
LLAMA2-70b validation run uses the Open ORCA dataset.
Get Validation Dataset
cm run script --tags=get,dataset,openorca,validation -j
Model
The benchmark implementation run command will automatically download the required model and do the necessary conversions. In case you want to only download the official model, you can use the below commands.
Get the Official MLPerf LLAMA2-70b Model
Pytorch
cm run script --tags=get,ml-model,llama2-70b,_pytorch -j
Tip
Downloading llama2-70B model from Hugging Face will prompt you to enter the Hugging Face username and password. Please note that the password required is the access token generated for your account. Additionally, ensure that your account has access to the llama2-70B model.