mini-queue

所属分类:其他
开发工具:Shell
文件大小:0KB
下载次数:0
上传日期:2023-05-31 05:34:05
上 传 者sh-1993
说明:  超级计算机中心子队列系统,实现按需独占计算资源的功能,
(Supercomputer center subqueue system, to achieve the function of exclusive computing resources on demand,)

文件列表:
LICENSE (35149, 2023-05-30)
__detect_cache__/ (0, 2023-05-30)
__detect_cache__/task_206.log (52, 2023-05-30)
__detect_cache__/task_207.log (52, 2023-05-30)
__detect_cache__/task_210.log (52, 2023-05-30)
__detect_cache__/task_215.log (52, 2023-05-30)
__task_cache__/ (0, 2023-05-30)
__task_cache__/task_1.sh (461, 2023-05-30)
__task_cache__/task_105.sh (884, 2023-05-30)
__task_cache__/task_113.sh (884, 2023-05-30)
__task_cache__/task_114.sh (884, 2023-05-30)
__task_cache__/task_115.sh (884, 2023-05-30)
__task_cache__/task_116.sh (938, 2023-05-30)
__task_cache__/task_120.sh (62, 2023-05-30)
__task_cache__/task_138.sh (884, 2023-05-30)
__task_cache__/task_139.sh (884, 2023-05-30)
__task_cache__/task_140.sh (884, 2023-05-30)
__task_cache__/task_141.sh (884, 2023-05-30)
__task_cache__/task_142.sh (884, 2023-05-30)
__task_cache__/task_143.sh (884, 2023-05-30)
__task_cache__/task_144.sh (884, 2023-05-30)
__task_cache__/task_145.sh (884, 2023-05-30)
__task_cache__/task_165.sh (884, 2023-05-30)
__task_cache__/task_174.sh (884, 2023-05-30)
__task_cache__/task_175.sh (884, 2023-05-30)
__task_cache__/task_176.sh (884, 2023-05-30)
__task_cache__/task_177.sh (884, 2023-05-30)
__task_cache__/task_196.sh (812, 2023-05-30)
__task_cache__/task_206.sh (907, 2023-05-30)
__task_cache__/task_207.sh (907, 2023-05-30)
__task_cache__/task_208.sh (907, 2023-05-30)
__task_cache__/task_209.sh (907, 2023-05-30)
__task_cache__/task_210.sh (907, 2023-05-30)
__task_cache__/task_215.sh (907, 2023-05-30)
__task_cache__/task_231.sh (50, 2023-05-30)
__task_cache__/task_233.sh (167, 2023-05-30)
__task_cache__/task_239.sh (214, 2023-05-30)
__task_cache__/task_64.sh (884, 2023-05-30)
bin/ (0, 2023-05-30)
... ...

# mini-queue Supercomputer center subqueue system, to achieve the function of exclusive computing resources on demand ## Installation Guide ```shell cd /Any_path_you_like; git clone https://github.com/yxchen123/mini-queue.git cd mini-queue; ./setup.sh; source ~/.bashrc; ``` ## Instructions ### Step 1:make a Worker_submit.sh ```shell #!/bin/bash #An example for MPI job. Note:It may vary from supercomputer center to supercomputer center #SBATCH -J job_name #SBATCH -o job-%j.log #SBATCH -e job-%j.err #SBATCH -p CPU-64C256GB #SBATCH -N 2 -n 128 NODELIST=$SLURM_JOB_NODELIST while true;do run_tasks.py $NODELIST done ``` This worker will constantly detect if a task exists in the mini-queue, and if it does, it runs, otherwise it waits. Of course, you can submit as many workers as you want to grab tasks from the mini-queue at the same time ## Step 2: Manage, view, and submit tasks ```shell #Once the Worker has applied for computing resources in the cluster, the queue is up to you. #such as slurm system. >> sbatch Worer_submit.sh Submitted batch job 19672 >> squeue -u user_name JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON) 19672 CPU-64C25 job_name yxchen R 0:07 2 cnode[66-67] #your worker is warmed up and ready to go >> cd Workdir (optional) >> ssbatch task.sh Submit mini-queue batch job 6 >> qqa TASKID NAME ST TIME NodeList 6 mpi_job.sh R 0-00:00:05 cnode[66-67] >> ssbatch task.sh Submit mini-queue batch job 7 >> qqa TASKID NAME ST TIME NodeList 6 mpi_job.sh R 0-00:00:10 cnode[66-67] 7 mpi_job.sh PD 0-00:00:00 (Priority) >> qqw TASKID ST WorkDir 6 R /gpfs/home/scms/yxchen/cyx/mini-test 7 PD /gpfs/home/scms/yxchen/cyx/mini-test #if you want to cancel a job, you can sscancel + TASKID >> sscancel 6 >> qqa TASKID NAME ST TIME NodeList 6 mpi_job.sh C 0-00:00:00 (Priority) 7 mpi_job.sh R 0-00:00:01 (Priority) #wait a moment >> qqa TASKID NAME ST TIME NodeList 7 mpi_job.sh R 0-00:00:05 (Priority) ```

近期下载者

相关文件


收藏者