Site Reliability Engineer
7 days ago
Welcome to Winspire Group
We are a leading online entertainment company at the forefront of the digital gaming industry, offering both real money and free-to-play casino experiences to players worldwide. Since 2018, we've grown into a global force with 200+ team members, operations in 24+ countries, and multiple gaming licenses.
About Us
At Winspire Group, we combine cutting-edge technology with immersive entertainment to create unforgettable gaming experiences. Our story is built on innovation, player satisfaction, and responsible gaming practices.
Our Mission
Our mission is to leverage expertise and passion to provide unmatched value to our customers and stakeholders. We are dedicated to creating a safe, engaging, and secure gaming environment while setting new standards of excellence in online entertainment.
Innovation & Technology
Through continuous innovation and strategic partnerships, our platform integrates advanced technology with gameplay, delivering an experience that goes beyond geographical boundaries. We embrace change and push boundaries to create solutions that exceed expectations.
Join Our Team
At Winspire Group, we believe in collaboration, growth, and inclusivity. Our values—integrity, innovation, excellence, customer focus, continuous improvement, and diversity—guide everything we do. Join our global team and be part of a company shaping the future of online entertainment.
Role Summary
As a
Site Reliability Engineer
(SRE), you will play a critical role in maintaining the reliability, performance, and scalability of our systems and applications. You will design and manage monitoring infrastructure, respond to incidents, automate processes, and support system improvements—all with the aim of ensuring exceptional service for our users worldwide.
Key Responsibilities
Monitoring & Observability
- Design and implement proactive monitoring and alerting solutions using tools like Prometheus, Grafana, Loki, CloudWatch, and Datadog
- Analyze telemetry data to identify anomalies and root causes before they affect end users
- Maintain high visibility into system health and performance through dashboards and SLO/SLI tracking
Incident & Problem Management
- Respond to incidents, perform thorough analysis, and contribute to postmortems
- Collaborate with Engineering, DevOps, and QA teams to reduce MTTR and eliminate recurring issues
- Use tools like Opsgenie, Jira, and Slack for efficient alerting, coordination, and documentation
Automation & Efficiency
- Automate deployment, configuration, and monitoring tasks using Bash, Python, Ansible,
- Monitor and maintain self-healing infrastructure using Kubernetes, Docker, and IaC tooling
Collaboration & Communication
- Serve as a reliability champion across technical teams
- Lead and document incident response practices and cross-functional drills
- Partner with Data, Product, and Engineering teams to optimize end-to-end delivery
- Fluent English, Russian will be a plus
Key Requirements
Must-Have Skills
- Proficiency in observability tools: Prometheus, Grafana, Loki, LogQL, Cloud-native monitoring (AWS/GCP), Jaeger, DataDog, Checkly
- Strong scripting abilities in Bash, Python, and/or PowerShell
- Hands-on experience with Kubernetes, Docker, IaC tools (e.g., Ansible, Git), and CI/CD flows
- Fluency in incident lifecycle management (ITIL familiarity is a plus)
- Knowledge of SLIs/SLOs/SLAs and how to manage them effectively
Preferred Skills
- Familiarity with, Opsgenie, Playwright (for automated testing), and Checkly.
- Working knowledge of databases: SQL Server, BigQuery, MySQL
- Linux system administration experience
- Security awareness (basic understanding of tools like Kali Linux, Wireshark, nmap, etc)
- Experience with monitoring as code, specifically using Checkly's CLI, JavaScript/TypeScript SDKs, or Terraform
- Strong ability to debug complex user flows with Playwright trace viewer, DOM snapshots, and network waterfalls for failing browser checks in distributed environments
Courses and Certifications (a plus)
- AWS Cloud Practitioner
- Google Cloud Professional, Associate Certifications or Azure Fundamentals
- LPIC-1 Linux Administrator
- Fundamentals of Infrastructure as Code (IaC)
- Datadog Fundamentals, APM & Distributed Tracing Fundamentals and Log Management Fundamentals
- Getting Started with Synthetic Monitoring and Browser Testing by Datadog Learning
What We Offer
- Corporative events and team building activities
- Participation in corporate sports events and team challenges – stay active and bond with colleagues outside the office
- Lunch allowance
- Hybrid work format as per WFH internal policy
- Health insurance from Day 1
- Birthday and anniversary gifts
- Inclusive, dynamic, and innovative work culture
-
Senior Site Reliability Engineer
7 days ago
Limassol, Limassol, Cyprus JUJUR Full time €90,000 - €120,000 per yearPosition: Senior Site Reliability EngineerLocation: Limassol, CyprusDUTIES AND RESPONSIBILITIES: Develop and maintain monitoring, alerting, and observability tools to ensure system reliabilityEnsure consistent visibility into system performance and overall healthRespond to system incidents and service disruptions with detailed investigation and...
-
Senior Site Reliability Engineer
7 days ago
Limassol, Limassol, Cyprus Toptalent Full time €90,000 - €120,000 per yearPosition: Senior Site Reliability EngineerLocation: Limassol, CyprusDUTIES AND RESPONSIBILITIES: Develop and maintain monitoring, alerting, and observability tools to ensure system reliabilityEnsure consistent visibility into system performance and overall healthRespond to system incidents and service disruptions with detailed investigation and...
-
Senior Site Reliability Engineer
5 days ago
Limassol, Limassol, Cyprus TOPTALENT Full time €70,000 - €100,000 per yearPosition: Senior Site Reliability Engineer Location: Limassol, Cyprus DUTIES AND RESPONSIBILITIES: Develop and maintain monitoring, alerting, and observability tools to ensure system reliabilityEnsure consistent visibility into system performance and overall healthRespond to system incidents and service disruptions with detailed investigation and...
-
Senior Site Reliability Engineer
7 days ago
Limassol, Limassol, Cyprus Salve Consulting Full time €45,000 - €75,000 per yearLocation: Limassol, CyprusAs part of our expanding technology teams, we are looking for a Senior Site Reliability Engineer to join our innovative and diverse engineering group. You will play a key role in guaranteeing the reliability, scalability, and performance of our cloud platforms and mission-critical services. Working closely with cross-functional...
-
Senior Site Reliability Engineer
7 days ago
Limassol, Limassol, Cyprus theHRchapter Full time €60,000 - €80,000 per yearWe are looking for a driven and technically-adept professional, Senior Site Reliability Engineer, to ensure the stability, performance, and scalability of our systems in a fast-paced, globally distributed environment.If you thrive on solving complex infrastructure challenges, love automation, and enjoy collaborating across engineering, DevOps, and...
-
Senior Site Reliability Engineer
1 day ago
Limassol, Limassol, Cyprus Nordicrecruiters Full timeAre you an experienced SRE who thrives on solving complex challenges, scaling high-availability systems, and driving operational excellence? This is an opportunity to take ownership of reliability, performance, and automation within a high-growth global organisation all while enjoying life in one of the Mediterraneans most desirable coastal cities: Limassol,...
-
Senior Site Reliability Engineer
1 day ago
Limassol, Limassol, Cyprus Nordicrecruiters Full timeAbout the Role We are looking for a seasoned Senior Site Reliability Engineer to join our growing engineering team based in Limassol. In this role, you will ensure system stability, reliability, and performance by building and maintaining robust infrastructure, automating deployment and monitoring processes, responding to incidents, and collaborating with...
-
Senior Site Reliability Engineer
7 days ago
Limassol, Limassol, Cyprus Playnetic Full time €90,000 - €120,000 per yearEstablished in 2023, Playnetic is a new player in the world of gaming entertainment. We design and build slot games from scratch - from idea to release. Our games will be played in regulated markets globally through industry-leading operators. Our innovative gaming content is centred around our core values: quality gaming, dedicated customer service, and...
-
Senior Site Reliability Engineer
7 days ago
Limassol, Limassol, Cyprus JUJUR Full time €60,000 - €120,000 per yearPosition: Senior Site Reliability EngineerLocation: Limassol, CyprusDUTIES AND RESPONSIBILITIES: Develop and maintain software applications based on designs provided by the analysis and design team.Prepare setup and user manuals for the developed software solutions.Handle requests for new features, maintenance, and bug fixes, collaborating with quality...
-
Site Engineer
7 days ago
Limassol, Limassol, Cyprus Axia-Search Full time €40,000 - €60,000 per yearJob Title:Site EngineerLocation:Limassol & NicosiaJob Type:PermanentAre you a Site Engineer seeking a new opportunity for growth & development, or maybe seeking a new challenge?Axia-Search, isexclusivelypartnered with one of Cyprus'largest and respectedConstruction & Engineering Firms who are seeking toSite Engineersto join their growing teams based in...