DevOps Engineer
Job Description
Posted on:
November 7, 2024
Federato is on a mission to defend the right to efficient, equitable insurance for all. We enable insurers to provide affordable coverage to people and organizations facing the issues of today - the climate crisis, cyber-attacks, social inflation, etc. Our vision is understood and well funded by those behind Salesforce, Veeva, Zoom, Box, etc.
Federato’s AI/ML-driven platform leverages deep reinforcement learning to help insurance companies optimize the portfolio of risks they insure, allowing them to continue to provide fair and equitable pricing in difficult-to-price areas. Our category-defining ‘RiskOps’ solution drives better underwriting decisions by operationalizing underutilized data investments and surfacing real-time risk and portfolio insights. We focus on putting insurance underwriters back in the driver’s seat, helping them meet their goals while providing an important service to society.What You'll Be Doing:
Federato’s AI/ML-driven platform leverages deep reinforcement learning to help insurance companies optimize the portfolio of risks they insure, allowing them to continue to provide fair and equitable pricing in difficult-to-price areas. Our category-defining ‘RiskOps’ solution drives better underwriting decisions by operationalizing underutilized data investments and surfacing real-time risk and portfolio insights. We focus on putting insurance underwriters back in the driver’s seat, helping them meet their goals while providing an important service to society.What You'll Be Doing:
- Collaborate with cross-functional teams to provide infrastructure support for new features and services.
- Enhance and automate infrastructure management using tools like Terraform, Argo, and GitHub Actions to streamline deployments and operational processes.
- Manage and optimize our GCP and Kubernetes (GKE) environment, ensuring performance, scalability, and reliability.
- Improve our observability stack, working with tools such as Sentry, Prometheus, Grafana, and Alert Manager to provide real-time monitoring and proactive alerting.
- Set up and maintain performance monitoring tools, including k6 and other solutions, to ensure optimal performance and capacity planning.
- Identify areas for infrastructure improvement and take the lead on implementing these changes.
- Support incident management and resolution, performing root cause analysis and creating processes to prevent recurrence.
- 3+ years of hands-on experience managing cloud infrastructure and automation.
- Experience with Node.js or Python for backend services in a microservices architecture.
- 2+ years of experience with cloud providers, preferably Google Cloud Platform (GCP).
- 2+ years of experience managing Kubernetes clusters in production environments.
- Knowledge of asynchronous processing, message queues (e.g., Kafka, Pub/Sub), and event-driven architecture for backend applications.
- Experience focused on the internal engineer team success
Originally posted on Himalayas
Salary & Benefits
•
⚠ Sorry, application are closed for this job. Check more jobs here.