Click here to revoke the Cookie consent

Senior Site Reliability Engineer, IMG ARENA

  • IMG
  • Chiswick Park, Chiswick, London W4, UK
  • 09/11/2023
Full time Broadcast Engineering Engineering

Job Description

Who We Are:

Launched in 2012, IMG Arena leveraged our rights expertise to pioneer the mainstream introduction of official data rights. We paved the way for a new revenue stream for our Rightsholder clients, whilst at the same time delivering greater value to the sports betting industry. We have gone on to expand our sports betting product suite with the addition of Event Centres and Official Virtual Sports. At IMG Arena the job is yours. We want you to be yourself. We are focused on building an inclusive and empowering environment that’s welcoming for everyone, where you are trusted and empowered to do what you’re good at. It’s your arena and your opportunity to shape it and your career. You’ll have the opportunity to pick up the latest technology, develop your own ideas and run with them. There’s loads of room for improvement and innovation. You get to prove and own the process from start to finish.

Endeavor is a global sports and entertainment company, home to the world’s most dynamic and engaging storytellers, brands, live events and experiences. The company is comprised of industry leaders including entertainment agency WME; sports, fashion, events and media company IMG; and premier mixed martial arts organization UFC. The Endeavor network specializes in talent representation, sports operations & advisory, event & experiences management, media production & distribution, experiential marketing and brand licensing.

Site Reliability Engineer

IMG ARENA are providing premium official sports content services to the betting and digital media industries. They have effective partnerships with a diverse range of international sports federations and associations including the MLS, PGA and European Tour. IMG ARENA offer multiple solutions that deliver low-latency data feeds and video streams of global sporting events at large scale. IMG ARENA’s engineering community are looking to minimise the challenges of shipping, rapidly iterating, and securing their applications, as well as ensuring they operate in a reliable and performant manner.

The Role:

As a senior SRE, you will join a growing SRE team that works closely with an established Platform team, multiple agile software development teams who embrace DevOps culture, and business stakeholders. You will be responsible for designing, implementing and monitoring solutions for our cloud infrastructure that powers services with millions of users whilst adopting and evangelising SRE principles and best practices. You will ensure adherence to security, best practice and standards for all application development, deployment and testing practices. You will design, develop, and maintain high-quality toolings and automation frameworks that helps tracking and ensuring the services' SLOs are met. You will help IMG ARENA’s engineering community learn and grow as industry best practices for DevOps and SRE evolve.

What will you be doing ?

  • Ensure the services' SLOs are met through aspects such as reliability and performance; support software development teams to meet their SLOs when necessary

  • Partner with DevOps and development teams to establish SLIs and SLOs for IMG ARENA’s internal and external-facing services

  • Implement toolings to automate away repetitive tasks and improve observability (such as tracking SLOs, monitoring custom platform metrics etc)

  • Work closely with the DevOps team to manage and improve IMG ARENA’s cloud infrastructure that mainly uses AWS, Terraform and Kubernetes

  • Work closely with the Security and QA teams to implement and improve automated testing frameworks that are used by IMG ARENA services

  • Improve IMG ARENA’s GitOps-based Continuous Delivery pipelines

  • Establish and maintain SRE best practices across the organisation through high-quality documentations,

knowledge-sharing workshops and training sessions etc.

  • Contribute to the design of project solutions and architectures that directly impact the business

  • Participate in on-call/support rota

Essential Requirements:

  • Strong track record as an SRE or software engineer who has managed large-scale applications in production whilst adopting SRE principles and best practices

  • Proficiency in at least one modern programming language (Full-stack development experience is a strong plus)

  • Experience in AWS services (including EC2, EBS, EKS, S3, CloudFront, VPC, IAM, CloudWatch, Lambda) and Infrastructure-as-Code (Terraform)

  • Good knowledge on TCP/IP, HTTP, websockets, load balancer, and DNS technologies

  • Strong expertise in Kubernetes

Passion about SRE principles and best practices

Senior grade leadership, including knowledge sharing, outreach, and fostering development in less experienced colleagues

Strong communication (written & verbal) and collaboration skills across both technical and non-technical stakeholders

Nice-to-have Skills:

  • Experience with GitOps and Kubernetes operator implementation

  • Experience with automated tests (such as load tests and chaos tests)

  • Experience with modern observability tools (Prometheus/Thanos, Loki, Grafana, Tempo)

  • Experience in an event based or sports tech organisation

Endeavor unites and brings people together in our love of sport, culture, and entertainment. We understand this can only be accomplished when we lead with a lens of diversity, equity, and inclusion in everything we do. As a global company that drives culture, we strive to reflect the world’s diverse voices.  

Endeavor is an equal opportunities employer and encourages applications from suitably qualified and eligible candidates regardless of sex, race, disability, age, sexual orientation, or religion or belief.