What We Can Offer
- Full social insurance package
- Global working environment
- Training and development
- 13th month salary
- Healthcare plan
Job Description
• Build/manage system S/W components such as GPU/NPU device drivers, communication libraries, directory services, distributed file systems, AI acceleration, and object storage for clustering.
• Automate S/W provisioning processes through IaC tools such as Ansible and Terraform or programming.
• Build/manage container orchestration tools such as Kubernetes (K8s) in clusters.
• Analyze and resolve the causes of various S/W or H/W errors.
• Provide overall management and technical consulting for Moreh's customer operating infrastructure.
• Install/operate various equipment in data centers, including CPU/GPU/NPU servers, high-speed interconnection networks such as InfiniBand and RoCE, storage servers, and firewalls.
Job Requirements
• 3 years of experience operating and managing Linux-based cluster systems
• Extensive understanding of various H/W and S/W components of computer systems.
• Knowledge of Docker and Kubernetes, and experience building a Kubernetes cluster oneself.
• Experience in analyzing various logs and operating monitoring solutions for large-scale IT infrastructure.
• Experience in developing high-availability (HA) S/W and related knowledge.
• Experience in installing and maintaining Linux systems at an IT system/solution distributor or reseller.
• Fluent English conversation skills (Writing & Reading)
• Excellent logical thinking and problem-solving skills.