Operations and Performance Monitoring Engineer

Digikala Tehran

Posted 2 months ago

Job Description

We are searching for an experienced engineer to join our team and help us achieve our mission by empowering us with a rich feature set, high availability, and stellar performance; As we expand our infrastructure, we are improving our processes and pipelines; The selected candidate will manage and maintain our monitoring services, analyze problems, troubleshoot, and find appropriate customer resolutions.

Responsibilities:

  • Manage day-to-day operations, monitoring alerts, servers, and backup platforms.
  • Maintain and configure monitoring services to ensure reliability and uptime.
  • Identify hardware, software, and environmental issues.
  • Document problems and define solutions, prioritize problems, and assess the impact of issues.
  • Perform or delegate regular backup operations and implement appropriate processes for data protection, disaster recovery, and failover procedures.
  • Develop, implement, and maintain procedures to measure and track service performance and quality.

Requirements:

  • The ideal candidate should be self-motivated, proactive, capable of multi-tasking, meeting deadlines, and working in a collaborative environment.
  • Must be able to work in a 24/7 environment and work second/third shifts, weekends, and holidays.
  • Must have at least 2 years of work experience as an NOC Engineer or related positions.
  • Excellent problem-solving mindset and the ability to diagnose complex technical issues.
  • Detail-oriented and the be able to manage multiple projects.
  • Strong communication and collaboration skills, which are essential to execute duties to the others in the team.
  • Good knowledge of Linux system management and administration, and interest in knowledge upgrading.
  • Ample experience configuring and automating Monitoring tools (Prometheus, Grafana, Zabbix, etc.).
  • Knowledge of one Logging stack (Preferably ELK).
  • Hands-on experience with networking principles (DNS, Routing, Firewalls, Load Balancing, etc.).

Preferred Qualifications:

  • Possess vast knowledge and experience in system automation, deployment, and implementation.
  • Familiarity with CDN (Content Delivery Network) systems.
  • Basic knowledge of container concepts.
  • Familiarity with open-source services such as HAProxy, MySQL, Redis, and Memcached.

To see more jobs that fit your career