I am using a computer cluster with 20 nodes and each node has 16 CPU. I tried to submit 1000 jobs to all nodes with the command "sbatch XX.sbatch". What I want is that 320 jobs are running simultaneously, i.e., 16 jobs per node, or 1 job per CPU.
When I use the . sbatch file with the parameters in the XX sbatch file is
#!/bin/bash
# Interpreter declaration
#SBATCH -N 1
#SBATCH -n 1
#SBATCH -c 1
#SBATCH -J job_XX
./example.sh
I noticed only 1 job is running on each node.
Then I tried
#!/bin/bash
# Interpreter declaration
#SBATCH -N 20
#SBATCH -n 1
#SBATCH -c 1
#SBATCH -J job_XX
./example.sh
I noticed only 1 job is running in 20 nodes, i.e., 1 job per 20 nodes.
Then I tried
#!/bin/bash
# Interpreter declaration
#SBATCH -N 20
#SBATCH -n 320
#SBATCH -c 1
#SBATCH --ntasks-per-node=16
#SBATCH -J job_XX
./example.sh
Still, 1 job is using all 20 nodes.
Does anyone know how to fix it? Thanks.