Design, implement, and maintain CI/CD pipelines to automate deployments and testing processes.
Collaborate with development teams to streamline infrastructure provisioning and application deployment.
Manage and optimize cloud infrastructure (AWS, GCP, or Azure) to ensure scalability, security, and performance.
Automate server provisioning, configuration, and management using tools like Terraform, Ansible, Chef, or Puppet.
Ensure high availability, disaster recovery, and fault tolerance for production systems.
Monitor infrastructure performance, troubleshoot issues, and optimize resources for cost-effectiveness and reliability.
Work with containerization technologies like Docker and container orchestration platforms like Kubernetes.
Implement and manage version control systems and branching strategies using Git.
Establish and maintain system monitoring, alerting, and logging solutions using tools such as Prometheus, Grafana, ELK Stack, or similar.
Perform regular system audits and implement security measures to protect cloud resources.
Provide mentorship to junior DevOps engineers and share best practices.
Continuously improve infrastructure and processes to ensure faster development cycles, better scalability, and reduced downtime.
Participate in on-call support for production systems and handle escalations when necessary.
Required Skills and Experience:
Experience: 3 to 10 years of experience in DevOps, cloud infrastructure management, and automation.
Technical Expertise: Proficiency in cloud platforms (AWS, GCP, or Azure) and managing infrastructure-as-code using tools like Terraform or CloudFormation.
Strong experience with CI/CD tools like Jenkins, GitLab CI, CircleCI, or similar.
Hands-on experience with containerization tools like Docker and container orchestration platforms like Kubernetes.
Expertise in automation and configuration management tools such as Ansible, Puppet, Chef, or similar.
Solid experience with monitoring tools such as Prometheus, Grafana, ELK Stack, or other APM tools.
Familiarity with database management and scaling (SQL and NoSQL databases like MySQL, PostgreSQL, MongoDB, etc.).
Knowledge of version control systems (preferably Git) and branching strategies.
Experience in implementing and managing logging, monitoring, and alerting systems.
Understanding of infrastructure security best practices, including IAM, VPNs, SSL/TLS, firewalls, and network segmentation.
Problem-Solving: Strong troubleshooting skills with the ability to diagnose and resolve complex infrastructure and application issues.
Collaboration & Communication: Excellent communication skills and the ability to collaborate with cross-functional teams.
Process Improvement: Passion for continuous improvement, automation, and optimization in development and deployment pipelines.
Preferred Skills:
Experience with serverless architecture and tools (AWS Lambda, Azure Functions, etc.).
Knowledge of microservices architecture and experience in deploying and managing microservices.
Familiarity with service mesh technologies (Istio, Linkerd).
Understanding of container security and compliance.
Knowledge of Agile and DevOps methodologies and best practices.
Education & Certifications:
Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
DevOps or cloud-related certifications (AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or similar) are a plus.