TORQUE Resource Manager- Basic Configuration

By default, make install installs all files in /usr/local/bin/usr/local/lib/usr/local/sbin,/usr/local/include, and /usr/local/man . You can specify an installation prefix other than/usr/local using --prefix as an argument to ./configure, for example:

./configure --prefix=$HOME

Verify you have environment variables configured so your system can find the shared libraries and binary files for TORQUE.

To set the library path, add the directory where the TORQUE libraries will be installed. For example, if your TORQUE libraries are installed in /opt/torque/lib, execute the following:

> set LD_LIBRARY_PATH=$(LD_LIBRARY_PATH):/opt/torque/lib
> ldconfig

Note Cluster Resources recommends that the TORQUE administrator be root.

1.2.1 Initialize/Configure TORQUE on the Server (pbs_server)

$TORQUEHOME/server_priv/ contains configuration and other information needed for pbs_server. One of the files in this directory is serverdbserverdb contains configuration parameters forpbs_server and its queues. In order for pbs_server to run, serverdb has to be initialized.

serverdb can be initialized in two ways:

  • pbs_server -t create
  • Execute ./torque.setup from the build directory.

Restart pbs_server after initializing serverdb.

> qterm
> pbs_server

1.2.1.1 pbs_server -t create

The '-t create' option tells pbs_server to create the serverdb file and initialize it with a minimum configuration to run pbs_server. To see the configuration, use qmgr:

> pbs_server -t create
> qmgr -c 'p s'

#
# Set server attributes.
#
set server acl_hosts = kmn
set server log_events = 511
set server mail_from = adm
set server scheduler_iteration = 600
set server node_check_rate = 150
set server tcp_timeout = 6

A single queue named 'batch' and a few needed server attribues are created.

1.2.1.2 ./torque.setup

The torque.setup script uses pbs_server -t create to initialize serverdb, and then adds a user as a manager and operator of TORQUE and other commonly used attributes. The syntax is:

  • ./torque.setup <username>
> ./torque.setup ken
> qmgr -c 'p s'

#
# Create queues and set their attributes.
#
#
# Create and define queue batch
#
create queue batch
set queue batch queue_type = Execution
set queue batch resources_default.nodes = 1
set queue batch resources_default.walltime = 01:00:00
set queue batch enabled = True
set queue batch started = True
#
# Set server attributes.
#
set server scheduling = True
set server acl_hosts = kmn
set server managers = ken@kmn
set server operators = ken@kmn
set server default_queue = batch
set server log_events = 511
set server mail_from = adm
set server scheduler_iteration = 600
set server node_check_rate = 150
set server tcp_timeout = 6
set server mom_job_sync = True
set server keep_completed = 300

1.2.2 Specify Compute Nodes

The environment variable $TORQUEHOME is where configuration files are stored. For TORQUE 2.1 and later, $TORQUEHOME is /var/spool/torque/. For earlier versions, $TORQUEHOME is/usr/spool/PBS/.

The pbs_server needs to know which systems on the network are its compute nodes. Each node must be specified on a line in the server's nodes file. This file is located at$TORQUEHOME/server_priv/nodes. In most cases, it is sufficient to specify just the names of the nodes on individual lines; however, various properties can be applied to each node.

Syntax of nodes file:
node-name[:ts] [np=] [gpus=] [properties]

The [:ts] option marks the node as timeshared. Timeshared nodes are listed by the server in the node status report, but the server does not allocate jobs to them.

The [np=] option specifies the number of virtual processors for a given node. The value can be less than, equal to, or greater than the number of physical processors on any given node.

The [gpus=] option specifies the number of GPUs for a given node. The value can be less than, equal to, or greater than the number of physical GPUs on any given node.

The node processor count can be automatically detected by the TORQUE server ifauto_node_np is set to TRUE. This can be set using the command qmgr -c "set server auto_node_np = True". Setting auto_node_np to TRUE overwrites the value of np set in$TORQUEHOME/server_priv/nodes.

The [properties] option allows you to specify arbitrary strings to identify the node. Property strings are alphanumeric characters only and must begin with an alphabetic character.

Comment lines are allowed in the nodes file if the first non-white space character is the pound sign (#).

The example below shows a possible node file listing.

$TORQUEHOME/server_priv/nodes :
# Nodes 001 and 003-005 are cluster nodes
#
node001 np=2 cluster01 rackNumber22
#
# node002 will be replaced soon
node002:ts waitingToBeReplaced
# node002 will be replaced soon
#
node003 np=4 cluster01 rackNumber24
node004  cluster01 rackNumber25
node005 np=2 cluster01 rackNumber26 RAM16GB
node006
node007 np=2
node008:ts np=4
...

1.2.3 Configure TORQUE on the Compute Nodes

If using TORQUE self extracting packages with default compute node configuration, no additional steps are required and you can skip this section.

If installing manually, or advanced compute node configuration is needed, edit the$TORQUEHOME/mom_priv/config file on each node. The recommended settings are below.

$TORQUEHOME/mom_priv/config :
$pbsserver      headnode          # note: hostname running pbs_server
$logevent       255               # bitmap of which events to log

This file is identical for all compute nodes and can be created on the head node and distributed in parallel to all systems.

1.2.4 Finalize Configurations

After serverdb and the server_priv/nodes file are configured, and MOM has a minimal configuration, restart the pbs_server on the server node and the pbs_mom on the compute nodes.

Compute Nodes:
> pbs_mom


Server Node:
> qterm -t quick
> pbs_server

After waiting several seconds, the pbsnodes -a command should list all nodes in state free.

See Also

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
资源包主要包含以下内容: ASP项目源码:每个资源包中都包含完整的ASP项目源码,这些源码采用了经典的ASP技术开发,结构清晰、注释详细,帮助用户轻松理解整个项目的逻辑和实现方式。通过这些源码,用户可以学习到ASP的基本语法、服务器端脚本编写方法、数据库操作、用户权限管理等关键技术。 数据库设计文件:为了方便用户更好地理解系统的后台逻辑,每个项目中都附带了完整的数据库设计文件。这些文件通常包括数据库结构图、数据表设计文档,以及示例数据SQL脚本。用户可以通过这些文件快速搭建项目所需的数据库环境,并了解各个数据表之间的关系和作用。 详细的开发文档:每个资源包都附有详细的开发文档,文档内容包括项目背景介绍、功能模块说明、系统流程图、用户界面设计以及关键代码解析等。这些文档为用户提供了深入的学习材料,使得即便是从零开始的开发者也能逐步掌握项目开发的全过程。 项目演示与使用指南:为帮助用户更好地理解和使用这些ASP项目,每个资源包中都包含项目的演示文件和使用指南。演示文件通常以视频或图文形式展示项目的主要功能和操作流程,使用指南则详细说明了如何配置开发环境、部署项目以及常见问题的解决方法。 毕业设计参考:对于正在准备毕业设计的学生来说,这些资源包是绝佳的参考材料。每个项目不仅功能完善、结构清晰,还符合常见的毕业设计要求和标准。通过这些项目,学生可以学习到如何从零开始构建一个完整的Web系统,并积累丰富的项目经验。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值