Site Reliability Engineer, India
- Department:
- R&D
- Location:
- Bangalore, Karnataka, India
Join a passionate and diverse team of developers in creating the next generation of clustered media content management systems, with the opportunity to work alongside some of the biggest content distribution providers and broadcasters in the world.
Responsibilities:
- Diagnose challenging technical issues within an AWS serverless environment
- Build and maintain tools to support the smooth running of a broadcast grade SaaS platform
- Active analysis of day-to-day technical operations with a goal to streamline and improve
- Bring an operational perspective to the team, ensuring that systems are developed with a mind towards reliability, maintainability, and operational ease
- Clearly and effectively communicate with team and management
- Support and troubleshoot systems, occasionally outside of regular office hours
- Encourage others to follow industry and platform best practices for the development and operation of systems
- An active interest in new and upcoming software engineering technologies and practices
- Ability to explain technical ideas and concepts
- Extensive understanding of the best practices for serverless and SaaS operations
- Comprehensive understanding of high availability, scalable and distributed systems; familiar with limitations of CAP
- In-depth understanding of cloud technologies, in particular AWS Services and tools (Cloud Formation, Lambda, DynamoDB, SQS, SNS, EC2, S3, AWS CLI, Boto3, etc.)
- Experience working with and building RESTful APIs
- Familiarity with the use and configuration of CI & CD pipelines such as Jenkins & AWS CodePipeline
- Use of the Git version control system
- Understanding of fundamental databases concepts for both relational and 'NoSQL' type persistence stores
- Extensive experience with Python is an asset
- Ability to manage your own time and priorities in pursuit of the team's objectives
- Strong sense of ownership, customer service, and integrity
Experience: 5 years +
Work mode: Remote/Hybrid
Office Timing: 1pm to 9pm IST
The Team:
The evertz.io Engineering Team builds next-generation systems for content management and distribution in the Media and Entertainment industry. Disney, NBCUniversal, Discovery, BBC, and many other content producers and publishers use our products and services to make the most of their file-based and live content for the least effort.
We work with high quality video in real-time and non-real-time scenarios across a wide range of cutting-edge tech. Specializations within the group span from low-level video manipulation and analysis, through back-end management and orchestration services, to web delivered UIs. Working in parallel with these teams is the Scientific Computing Group who work in computer vision, data science and machine learning, taking experiments in Jupyter notebooks through to deployment in production. This makes for a challenging and rewarding engineering experience of continual learning and plenty of opportunity to explore different parts of the stack.
Our technology stack includes a Serverless microservice architecture that capitalizes on the full breadth of AWS services with code written in Python, Rust and Java, our UI uses the latest versions of Angular, Typescript and NgRx, our CI/CD pipelines leverage AWS, Jenkins, Nexus, and Bazel in addition to our in-house release-management application to build and release 100’s of software components.
As a Site Reliability Engineer, you will join our talented and passionate team building evertz.io: a collection of services that will be used by the biggest names in the exciting broadcast and media industry. Our services are hosted in AWS, with a Serverless First mindset.
We work in agile, low-bureaucracy, high-creativity, cross-functional teams spread across the world. It’s a highly creative work environment where we support your growth with opportunities for career progression, mentoring others and third-party education. The team is built on trust and is relaxed, open and welcoming to all.