Job Description
Client: A US based deep tech company that focuses on Developer APIs and Developer Tools.
Description:
What will you DO?
- Build robust, fault-tolerant and highly scalable systems that support their growth.
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Measure and optimize system performance to push their capabilities forward, getting ahead of customer needs, and innovate to improve continually.
- Create tools the team can use to do their jobs more efficiently.
- Work closely with development and support teams to solve production escalation cases.
What are we looking for in a Candidate?
Required Skills:
- SRE/DevOps Experience (3+ years).
- Hands-on with 0-down time upgrade of production environment.
- Prior experience in designing, deploying and scaling production environments for the consumer (both internally as externally) and enterprise facing products.
- Experience with running & deploying applications on Kubernetes.
- Knowledge of some or all of their tech stack – AWS, Kubernetes, MySQL, Postgres, Redis, RabbitMQ, HAProxy, Traefik, Terraform, Pulumi, Prometheus, OpsGenie.
- Knowledge of cloud infrastructure principles – load balancing, high availability, server-based and serverless architecture, database configurations.
- Kafka experience.