Use official Nvidia base image

mirror of https://github.com/hiyouga/LLaMA-Factory.git synced 2026-06-21 22:58:58 +08:00

Note that the flash-attn library is installed in this image and the qwen model will use it automatically.
However, if the the host machine's GPU is not compatible with the library, an exception will be raised during the training process as follows:
FlashAttention only supports Ampere GPUs or newer.
So if the --flash_attn flag is not set, an additional patch for the qwen model's config is necessary to set the default value of use_flash_attn from "auto" to False.


Former-commit-id: e75407febd

This commit is contained in:

S3Studio

2024-03-14 18:03:33 +08:00

committed by

liuzhao2

parent dcbc8168a8

commit 46ef7416e6

2 changed files with 4 additions and 1 deletions

									
										2

Dockerfile
									
												View File
												
				@@ -1,4 +1,4 @@

				FROM cnstark/pytorch:2.0.1-py3.9.17-cuda11.8.0-ubuntu20.04

				FROM nvcr.io/nvidia/pytorch:24.01-py3

				WORKDIR /app

Use official Nvidia base image

2 Dockerfile Unescape Escape View File

2

Dockerfile

View File