
Site Reliability Engineer
2 weeks ago
Welcome to TLF
We are a leading innovator in the exciting world of online gambling, providing a thrilling and responsible gaming experience to players in Asia, South America and Africa.
About Us:
At TLF, we pride ourselves on our cutting-edge technology, exceptional customer service, and commitment to integrity. With years of industry expertise, we offer a wide range of thrilling casino games and immersive online experiences that keep our players engaged and entertained.
Our Mission:
Our mission is to create an exhilarating and secure gambling environment where players can enjoy their favorite games responsibly. We believe in promoting responsible gaming practices and maintaining the highest standards of fairness, transparency, and player protection. The customer's satisfaction and enjoyment are at the core of everything we do
Innovation & Technology:
At TLF, we embrace innovation and leverage the latest technologies to enhance the gambling experience for our players. Owner of a state-of-the-art gaming platform, we strive to deliver a user-friendly and immersive experience that keeps players coming back for more.
Join Our Team:
Are you passionate about the gambling industry? Are you ready to enjoy a dynamic and fast-paced environment? Join our talented team of professionals who are dedicated to pushing the boundaries of online gambling. We offer exciting career opportunities, a supportive work culture, and a chance to be part of an industry-leading company that's shaping the future of online gaming.
Role Summary
As a
Site Reliability Engineer
(SRE), you will play a critical role in maintaining the reliability, performance, and scalability of our systems and applications. You will design and manage monitoring infrastructure, respond to incidents, automate processes, and support system improvements—all with the aim of ensuring exceptional service for our users worldwide.
Key Responsibilities
Monitoring & Observability
- Design and implement proactive monitoring and alerting solutions using tools like Prometheus, Grafana, Loki, CloudWatch, and Datadog
- Analyze telemetry data to identify anomalies and root causes before they affect end users
- Maintain high visibility into system health and performance through dashboards and SLO/SLI tracking
Incident & Problem Management
- Respond to incidents, perform thorough analysis, and contribute to postmortems
- Collaborate with Engineering, DevOps, and QA teams to reduce MTTR and eliminate recurring issues
- Use tools like Opsgenie, Jira, and Slack for efficient alerting, coordination, and documentation
Automation & Efficiency
- Automate deployment, configuration, and monitoring tasks using Bash, Python, Ansible,
- Monitor and maintain self-healing infrastructure using Kubernetes, Docker, and IaC tooling
Collaboration & Communication
- Serve as a reliability champion across technical teams
- Lead and document incident response practices and cross-functional drills
- Partner with Data, Product, and Engineering teams to optimize end-to-end delivery
- Fluent English, Russian will be a plus
Key Requirements
Must-Have Skills
- Proficiency in observability tools: Prometheus, Grafana, Loki, LogQL, Cloud-native monitoring (AWS/GCP), Jaeger, DataDog, Checkly
- Strong scripting abilities in Bash, Python, and/or PowerShell
- Hands-on experience with Kubernetes, Docker, IaC tools (e.g., Ansible, Git), and CI/CD flows
- Fluency in incident lifecycle management (ITIL familiarity is a plus)
- Knowledge of SLIs/SLOs/SLAs and how to manage them effectively
Preferred Skills
- Familiarity with, Opsgenie, Playwright (for automated testing), and Checkly.
- Working knowledge of databases: SQL Server, BigQuery, MySQL
- Linux system administration experience
- Security awareness (basic understanding of tools like Kali Linux, Wireshark, nmap, etc)
- Experience with monitoring as code, specifically using Checkly's CLI, JavaScript/TypeScript SDKs, or Terraform
- Strong ability to debug complex user flows with Playwright trace viewer, DOM snapshots, and network waterfalls for failing browser checks in distributed environments
Courses and Certifications (a plus)
- AWS Cloud Practitioner
- Google Cloud Professional, Associate Certifications or Azure Fundamentals
- LPIC-1 Linux Administrator
- Fundamentals of Infrastructure as Code (IaC)
- Datadog Fundamentals, APM & Distributed Tracing Fundamentals and Log Management Fundamentals
- Getting Started with Synthetic Monitoring and Browser Testing by Datadog Learning
What We Offer
- Brand new 5-floor office with rooftop and bar
- Corporative events and team building activities
- Participation in corporate sports events and team challenges – stay active and bond with colleagues outside the office
- Lunch allowance
- Hybrid work format as per WFH internal policy
- Health insurance from Day 1
- Birthday and anniversary gifts
- Inclusive, dynamic, and innovative work culture
-
Site Reliability Engineer
22 hours ago
Limassol, Limassol, Cyprus Pinely Full time €90,000 - €120,000 per yearAt this stage of growing, we are looking for a Site Reliability Engineer. His goal will be to work with remotely brokers and server equipments on-site from all over the world (10 different locations).ResponsibilitiesTroubleshooting network availability issues and accounting for failures;Managing remote brokers and a fleet of application servers;Participating...
-
Site Reliability Engineer
3 days ago
Limassol Municipality, Limassol, Cyprus Criteo Full time €90,000 - €120,000 per yearWhat if, in your next adventure, you were surrounded by people who, like you, look for an unlimited playground to explore, share, and test, would you care to hear more? You've opened the right door As an R&D team, making sure your ideas are heard and encouraged is what we strive to doWhat You'll Do:Key Responsibilities:Play a key role in the development of...
-
Staff DevOps Engineer
22 hours ago
Limassol, Limassol, Cyprus MUFG Investor Services Full time €45,000 - €75,000 per yearCompany Description MUFG Investor Services is a trusted partner to many of the world's largest public and private funds, providing asset servicing and operational solutions built for alternatives. With over $1 trillion in client assets under administration, we offer fund administration, banking, payments, fund financing, foreign exchange overlay, corporate...
-
Systems & Network Engineer
5 days ago
Limassol Municipality, Limassol, Cyprus Your Bourse Full time €30,000 - €60,000 per yearWe are looking for a highly skilled and motivatedSystems & Network Engineer(IT Admin/Site Reliability Engineer) to manage and scale our global infrastructure. This role involves hands-on administration of Linux servers, automation, network configuration, system hardening, and ensuring high availability and performance across our systems. You'll play a key...
-
Senior Electrical Engineer
2 weeks ago
Limassol, Limassol, Cyprus TalentJar Full time €104,000 - €130,878 per yearOn behalf of one of our clients, we are seeking aSenior Electrical Engineerto join their team in Limassol. The ideal candidate will work closely with the Managing Director and represent him in meetings with key clientsMain Duties and Responsibilities:Designing, implementing & supervising electrical projectsCost budgeting & managing quotationsCoordinating...
-
Full Stack Engineer
2 weeks ago
Limassol, Limassol, Cyprus Bogialo Software Full time €104,000 - €130,878 per yearCompany DescriptionBogialo Software provides a suite of tools, systems, and technical services covering Full Stack, API, and Enterprise-wide development, as well as SAAS solutions. We specialise in creating comprehensive solutions that enhance business productivity and support various development needs. Our team is dedicated to delivering high-quality...
-
Limassol, Limassol, Cyprus TradingView Full time €70,000 - €120,000 per yearWe are seeking a talented and passionate Analytics Engineer to design, build, and own the semantic layer at the heart of our data platform. You will be the crucial bridge between our raw data infrastructure and our business users, transforming complex data into a single source of truth that is reliable, intuitive, and accessible. Your primary focus will be...
-
Data Engineer
2 weeks ago
Limassol, Limassol, Cyprus TradingView Full time €70,000 - €120,000 per yearAt TradingView, we help more than 100 million investors worldwide discover new trading opportunities and make informed decisions in global markets. Data is at the core of our strategy, and we firmly believe that fast and convenient access to reliable analytics is the key to success.The Data Engineering team is the foundation on which the entire company...
-
Data Engineer
2 weeks ago
Limassol, Limassol, Cyprus TradingView Full time €104,000 - €130,878 per yearAt TradingView, we help more than 100 million investors worldwide discover new trading opportunities and make informed decisions in global markets. Data is at the core of our strategy, and we firmly believe that fast and convenient access to reliable analytics is the key to success. The Data Engineering team is the foundation on which the entire company...
-
Hardware QA Engineer
5 days ago
Limassol, Limassol, Cyprus Cognyte Full time €45,000 - €55,000 per yearToday's world is crime-riddled. Criminals are everywhere, invisible, virtual and sophisticated. Traditional ways to prevent and investigate crime and terror are no longer enough… Technology is changing incredibly fast. The criminals know it, and they are taking advantage. We know it too. For nearly 30 years, the incredible minds at Cognyte around the...