Site Reliability Engineer (SRE)

Дата размещения вакансии: 27.09.2024
Работодатель: Americor Funding Inc
Уровень зарплаты:
от 2300 до 3100 RUR
Город:
Москва
Требуемый опыт работы:
От 3 до 6 лет

About Americor

Americor is a leader in debt relief solutions, helping thousands of clients in the USA achieve financial freedom through innovative services.

We are looking for a Site Reliability Engineer (SRE) to support and enhance the infrastructure of our mission-critical CRM project, ensuring seamless service delivery to help people regain control of their financial futures. As a recognized ‘Top Place to Work’ and ‘Best Company’ in customer service, quality, and value, we value collaboration, growth, and innovation in every team member.

Project Features

  • Hosted in MS Azure, AWS, but mainly OVHcloud (US)

  • OVHcloud contains Bare Metal and VMs
  • OS: CentOS / AlmaLinux OS

  • Components: Nginx, KeyDB/Redis, OpenSearch

  • Database: MariaDB/MySQL, Percona / Galera Cluster, ProxySQL

  • Storage: GlusterFS

  • Language: PHP 8 (Yii2, Symfony, Laravel)

  • Monitoring tools: Datadog, Sentry, Pingdom

  • IaC: Terraform, Ansible

  • Alerting: OpsGenie

Responsibilities

  • Ensure the reliability of infrastructure supporting mission-critical services, minimizing downtime and optimizing performance.

  • Proactively monitor, respond to, diagnose and resolve incidents, improving response time and minimizing customer impact.

  • Work closely with Russian speaking Developers, QA and System Analysts.

  • Enhance CI/CD pipelines, monitoring tools, and automation processes to streamline workflows and increase system efficiency.

  • Keep infrastructure-related documentation up to date.

Requirements:

  • 5+ years of experience in a Site Reliability Engineering role, with a proven track record of maintaining high-availability infrastructure in a high-load environment.

  • Expertise in Linux systems and web stacks (Nginx, PHP, MySQL/MariaDB, Redis/KeyDB) to ensure smooth and efficient operation.

  • Strong experience with MySQL/MariaDB Galera cluster and Gluster storage to optimize data reliability and scalability.

  • Deep knowledge of network architectures, including TCP/IP, DNS, VPNs, and load balancing techniques, with hands-on experience in troubleshooting and optimizing network performance to support distributed systems across multiple regions.

  • Solid understanding of CI/CD and security best practices to drive efficiency and ensure the protection of our systems.

  • Fluent in both Russian and English, both written and spoken (You will work on a daily basis with the Director of DevOps who is a native speaker)

Desirable but not mandatory

  • Proficiency in PHP and Docker for seamless integration and deployment of services

  • Understanding of the principles of Infrastructure-as-Code, Monitoring-as-Code, and GitOps (we use Ansible and Terraform)

  • Experience with Cloudflare and AWS services (EKS, S3, OpenSearch)

  • Experience in building fault-tolerant systems and compliance audits (SOC, FFIEC, etc.)

  • Familiarity with Jira and Agile software development

  • Familiarity with modern container orchestration and deployment tools (Kubernetes, Helm)

What do we offer:

  • Remote work with flexible schedule
  • Competitive salary based on performance, with payments in USD through Deel.com
  • Payments to foreign account
  • Assistance with opening an IE in Georgia (if necessary)
  • Paid holidays, sick leave, sports, English in Skyeng
  • Participation in an interesting project with the possibility of team building
  • Support for initiatives and opportunities for development