Site Reliability Engineer (SRE)

Дата размещения вакансии: 27.09.2024

Работодатель: Americor Funding Inc

Уровень зарплаты:

от 2300 до 3100 RUR

Город:

Москва

Требуемый опыт работы:

От 3 до 6 лет

About Americor

Americor is a leader in debt relief solutions, helping thousands of clients in the USA achieve financial freedom through innovative services.

We are looking for a Site Reliability Engineer (SRE) to support and enhance the infrastructure of our mission-critical CRM project, ensuring seamless service delivery to help people regain control of their financial futures. As a recognized ‘Top Place to Work’ and ‘Best Company’ in customer service, quality, and value, we value collaboration, growth, and innovation in every team member.

Project Features

Hosted in MS Azure, AWS, but mainly OVHcloud (US)
OVHcloud contains Bare Metal and VMs
OS: CentOS / AlmaLinux OS
Components: Nginx, KeyDB/Redis, OpenSearch
Database: MariaDB/MySQL, Percona / Galera Cluster, ProxySQL
Storage: GlusterFS
Language: PHP 8 (Yii2, Symfony, Laravel)
Monitoring tools: Datadog, Sentry, Pingdom
IaC: Terraform, Ansible
Alerting: OpsGenie

Responsibilities

Ensure the reliability of infrastructure supporting mission-critical services, minimizing downtime and optimizing performance.
Proactively monitor, respond to, diagnose and resolve incidents, improving response time and minimizing customer impact.
Work closely with Russian speaking Developers, QA and System Analysts.
Enhance CI/CD pipelines, monitoring tools, and automation processes to streamline workflows and increase system efficiency.
Keep infrastructure-related documentation up to date.

Requirements:

5+ years of experience in a Site Reliability Engineering role, with a proven track record of maintaining high-availability infrastructure in a high-load environment.
Expertise in Linux systems and web stacks (Nginx, PHP, MySQL/MariaDB, Redis/KeyDB) to ensure smooth and efficient operation.
Strong experience with MySQL/MariaDB Galera cluster and Gluster storage to optimize data reliability and scalability.
Deep knowledge of network architectures, including TCP/IP, DNS, VPNs, and load balancing techniques, with hands-on experience in troubleshooting and optimizing network performance to support distributed systems across multiple regions.
Solid understanding of CI/CD and security best practices to drive efficiency and ensure the protection of our systems.
Fluent in both Russian and English, both written and spoken (You will work on a daily basis with the Director of DevOps who is a native speaker)

Desirable but not mandatory

Proficiency in PHP and Docker for seamless integration and deployment of services
Understanding of the principles of Infrastructure-as-Code, Monitoring-as-Code, and GitOps (we use Ansible and Terraform)
Experience with Cloudflare and AWS services (EKS, S3, OpenSearch)
Experience in building fault-tolerant systems and compliance audits (SOC, FFIEC, etc.)
Familiarity with Jira and Agile software development
Familiarity with modern container orchestration and deployment tools (Kubernetes, Helm)

What do we offer:

Remote work with flexible schedule
Competitive salary based on performance, with payments in USD through Deel.com
Payments to foreign account
Assistance with opening an IE in Georgia (if necessary)
Paid holidays, sick leave, sports, English in Skyeng
Participation in an interesting project with the possibility of team building
Support for initiatives and opportunities for development

Откликнуться

Site Reliability Engineer (SRE)

Похожие вакансии: Москва