slurm-gcp | Slurm on Google Cloud Platform | GCP library
kandi X-RAY | slurm-gcp Summary
kandi X-RAY | slurm-gcp Summary
Slurm on Google Cloud Platform
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Resume all nodes from a list
- Generate instance properties
- Run a subprocess
- Compute a Google App Service
- Constructs a batch request
- Callback for pubsub messages
- Chown the given path
- Get metadata for a given path
- Fetch the deployment scripts from the metadata
- Resume placement groups
- Resume all nodes in a list
- Setup the controller
- Recursively sync GCP instances
- Run subprocess
- Delete resource policies
- Configure root logger
- List all subscriptions
- Start the MOTD server
- Synchronously update subscriptions
- Suspend jobs on the specified nodes
- Return the status of a node
- Configure the directories
- Find subscription status
- Compute a Google App
- Sets up compute resources
- Fetchvel scripts from metadata
- Sync GCP nodes and slurm nodes
- Load slurm jobs
- Return machine configuration
slurm-gcp Key Features
slurm-gcp Examples and Code Snippets
Community Discussions
Trending Discussions on slurm-gcp
QUESTION
I would like to automatically add network-tag(http-server, https-server) to compute node which is automatically created by slurm-gcp .
After creating a node, the method of manually adding it using the gcloud command is temporarily used.
However, as more nodes are created, the time of manual addition becomes too slow.
I created a custom image and created an instance template using that image. (including http-server and https-server)
However, it does not appear to utilize the instance template at all when created.
I wonder how to solve it.
Thank you.
...ANSWER
Answered 2021-Nov-01 at 07:57According to the documentation:
If you need to create a tag on a VM, you must create the tag manually.
You can assign network tags to new VMs at creation time, or you can edit the set of assigned tags at any time later. You can edit network tags without stopping a VM.You can also add tags to, and remove tags from, an existing VM.
Check out the documentation on Configuring network tags.
QUESTION
I distributed slurm-gcp using Terraform through the GitHub and it was available successfully. Source:
Slurm on Google Cloud Platform
But I want to change the image I use when using node to a custom image.
I am trying to edit /slurm/scripts/config.yaml
.
Among the contents of the file:
...ANSWER
Answered 2021-Sep-14 at 07:08First you need to create your own image.
Create a new VM with the image you want to modify; make appropriate changes and stop the VM. Then create a new image from the VM's disk.
Next create a custom image from that disk and your path in the config.yaml
file can look like this:
image: projects/my-project-name/global/images/your-image-name
You can get exact path to your custom image by running:
QUESTION
I installed slurm-gcp from that github using terraform in cloud shell.
https://github.com/SchedMD/slurm-gcp#install-using-terraform
There are two VMs created.
- controller
- login
I installed several applications in /apps to confirm that they exist.
I would like to edit node's /etc/hosts and add network tag (http-server, https-server) to run the applications.
But nowhere does it say how to modify the image used by node .
Is there no way to edit the image of the slurm?
Thank you.
...ANSWER
Answered 2021-Sep-03 at 07:46As of the latest version of Slurm on GCP, the images are pre-built and hosted by SchedMD, and are used directly without the requirement to make build an image at deployment time. You can continue to modify the image by creating your own image. This is done by spinning up a VM with the public image pointed to in the tfvars, making your modifications, saving the image, and pointing your tfvars to your new image.
You can continue to use the custom compute/controller install scripts found in the scripts directory.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install slurm-gcp
To deploy, you must have a GCP account and either have the GCP Cloud SDK and Terraform installed on your computer or use the GCP Cloud Shell. There are 3 types of network storage sections that can be provided to the TF modules: network_storage, login_network_storage, and partitions[].network_storage.
cd to tf/examples/basic
Copy basic.tfvars.example to basic.tfvars
Edit basic.tfvars with the required configuration See the tf/examples/basic/io.tf file for more detailed information on available configuration options.
Deploy the cluster $ terraform init $ terraform apply -var-file=basic.tfvars
Tearing down the cluster $ terraform destroy -var-file=basic.tfvars NOTE: If additional resources (instances, networks) are created other than the ones created from the default deployment then they will need to be destroyed before deployment can be removed. This includes bursted instances that Slurm has not yet suspended.
network_storage is mounted on all instances in the cluster.
login_network_storage is mounted on the controller and all login nodes.
partitions[].network_storage is mounted on compute instances within the specified partition.
server_ip
remote_mount
local_mount
fs_type
mount_options
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page