Commit Graph

34 Commits

Author SHA1 Message Date
leo-pony
e86cb8a4fa [npu] update cann base image and torch 2.4 (#7061)
* Update base npu container image version:The Python version required for Hugging Face Transformers is >= python3.10

* Fix the bug: arg type of INSTALL_DEEPSPEED shoud been string now.

* Update Ascend CANN, CANN-Kernel and corresponding torch and torch-npu version

* Upgrade torch-npu needs packages' version: torch==2.1.0 and torch-npu==2.4.0.post2

Former-commit-id: acc52e0fe7
2025-02-25 23:32:01 +08:00
hiyouga
a02a140840 update wechat
Former-commit-id: cd14336506
2025-01-04 07:59:57 +00:00
shibingli@yeah.net
c76c33ddb1 Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.
Former-commit-id: f1d76786e0
2024-12-27 18:31:14 +08:00
shibingli@yeah.net
a37ef0eaae Add ARG HTTP_PROXY in Dockerfile to support HTTP proxy during image building.This commit introduces an ARG parameter named HTTP_PROXY in the Dockerfile. This addition allows for the configuration of an HTTP proxy, facilitating image building in environments with network restrictions.
Former-commit-id: a3a49b1ea4
2024-12-27 18:17:17 +08:00
XYZliang
1c1e48a570 Increase shm_size to 16GB in docker-compose.yml to optimize shared memory allocation for large-scale model fine-tuning tasks.
This pull request increases the shm_size parameter in docker-compose.yml to 16GB. The goal is to enhance the LLaMA-Factory framework’s performance for large model fine-tuning tasks by providing sufficient shared memory for efficient data loading and parallel processing.

This PR also addresses the issues discussed in [this comment](https://github.com/hiyouga/LLaMA-Factory/issues/4316#issuecomment-2466270708) regarding Shared Memory Limit error.


Former-commit-id: 64414905a3
2024-11-13 10:13:59 +08:00
hoshi-hiyouga
5ad86fc71d Update Dockerfile
Former-commit-id: 5e8a619d1d
2024-11-02 20:20:26 +08:00
Valerio Mariani
1ab3ea21ce make base image parametric.
default `BASE_IMAGE` is nvcr.io/nvidia/pytorch:24.02-py3 for retro-compatibility


Former-commit-id: d5b2def7c0
2024-10-30 21:53:32 +01:00
hiyouga
e90a1199da tiny fix
Former-commit-id: 3af57795dd
2024-10-11 23:51:54 +08:00
StrangeBytesDev
d1e766898c Add additional install options to Dockerfiles
Former-commit-id: 237e302b5c
2024-09-24 16:54:46 -07:00
HardAndHeavy
a39dc400ed update the ROCm version to 6.2
Former-commit-id: b1f9ccb29f
2024-09-12 23:46:33 +03:00
MengqingCao
a620fa8be3 update npu base image
Former-commit-id: b3f4acd1b4
2024-08-21 09:12:38 +00:00
hiyouga
884b0bbb4f tiny fix
Former-commit-id: 668654b5ad
2024-07-26 11:51:00 +08:00
HardAndHeavy
27f42f6319 Add ROCm support
Former-commit-id: c8e18a669a
2024-07-25 21:29:28 +03:00
hzhaoy
93ba3bd5b0 fix #4780
Former-commit-id: 642c6d666f
2024-07-12 00:25:48 +08:00
hzhaoy
b3e4793ded fix #4779
Former-commit-id: a8bf1abf0f
2024-07-12 00:15:15 +08:00
hiyouga
4b1ab6c83d update npu docker
Former-commit-id: 93e6fbb37d
2024-06-30 21:05:31 +08:00
hiyouga
cba994b9dc fix docker flashattn
Former-commit-id: e3141f5f1b
2024-06-28 01:28:59 +08:00
hiyouga
5e60c29971 fix docker file
Former-commit-id: 0f421055da
2024-06-27 20:29:16 +08:00
hiyouga
7c488cea57 tiny fix
Former-commit-id: e44a4f07f0
2024-06-27 20:14:48 +08:00
hoshi-hiyouga
37d3adb1f8 Merge pull request #4461 from hzhaoy/feature/support-flash-attn
support flash-attn in Dockerfile

Former-commit-id: 64b131dcfa
2024-06-27 20:05:26 +08:00
hoshi-hiyouga
ffe7cda93d Update Dockerfile
Former-commit-id: 5fcd33fd4f
2024-06-27 19:57:40 +08:00
hoshi-hiyouga
030dda5861 Update Dockerfile
Former-commit-id: 9624af1df3
2024-06-27 19:51:25 +08:00
fanjunliang
9e93ff9479 support docker-npu-[amd64|arm64] build
Former-commit-id: bdda0827b3
2024-06-27 15:25:12 +08:00
hzhaoy
c662c2e56f add flash-attn installation flag in Dockerfile
Former-commit-id: e19491b0f0
2024-06-27 00:13:30 +08:00
fanjunliang
e7bd3ab6c3 fix torch-npu dependency
Former-commit-id: 8096f94a7d
2024-06-26 18:21:42 +08:00
MengqingCao
7c7d6614d8 fix docker-compose path
Former-commit-id: 106647a99d
2024-06-26 02:15:00 +00:00
hzhaoy
08a221443c support flash-attn in Dockerfile
Former-commit-id: c88b1be9f3
2024-06-25 15:13:07 +08:00
hiyouga
dafc9268bc fix #4419
Former-commit-id: efb81b25ec
2024-06-25 01:51:29 +08:00
hoshi-hiyouga
af5b2b9299 Update docker-compose.yml
Former-commit-id: 721acd8768
2024-06-25 00:54:28 +08:00
hoshi-hiyouga
6cd45e95f7 Update Dockerfile
Former-commit-id: 3af936a76d
2024-06-25 00:50:34 +08:00
hoshi-hiyouga
62e63d74ec Update docker-compose.yml
Former-commit-id: 15608d0558
2024-06-25 00:46:47 +08:00
hoshi-hiyouga
cfa2dbefcb Update Dockerfile
Former-commit-id: fce146ab68
2024-06-25 00:46:08 +08:00
hoshi-hiyouga
f84bce3638 Update Dockerfile
Former-commit-id: dcc2e24f5c
2024-06-24 23:41:35 +08:00
MengqingCao
3b499948a5 update docker files
1. add docker-npu (Dockerfile and docker-compose.yml)
  2. move cuda docker to docker-cuda and tiny changes to adapt to the new path


Former-commit-id: d7207e8ad1
2024-06-24 10:57:36 +00:00