Data?1548802806
Site Reliability Engineer @ Cocus

Description

At COCUS we are working at the critical intersection of IT and business. True to our name COCUS – Company for Customers – we are proud to develop tailored solutions focus on the Internet of Things, Blockchain, Data Analytics and Information Security. Our customers are world-leaders in der respective industries – telecommunications, tourism, media, automotive, transport and logistics – impacting the life of millions across the world.

To help our customers shape the future, we need the brightest minds today. This is a fantastic opportunity for someone with the passion to explore and the right experience to apply that passion and knowledge to the solutions we offer our customers, and experience one truly international, fun and productive working environment.

 

What you will be doing:

  • Design, build and support the infrastructure to support applications and supporting systems
  • Help to define standards and best practices around AWS technologies and how they should be adopted by internal and external IT teams
  • Provide support around Identity and Access management lifecycles across AWS accounts
  • Support CI/CD tooling for technical teams to be able to build and deploy using pipelines
  • Be on a duty rotation to respond to availability incidents and customer incidents (level 2 and level 3 support)
  • Use your on-call shift to prevent incidents from ever happening again
  • Spend at least 50% of your time on development and automation
  • Run our infrastructure with CloudFormation and Terraform (all environments)
  • Make monitoring and alerting alert on symptoms and not on outages (application and infrastructure in all environments)
  • Document every action so your findings turn into repeatable actions–and then into automation
  • Improve the deployment process to make it as boring as possible
  • Design, build and maintain core infrastructure pieces that allow scaling
  • Debug production issues across services and levels of the stack
  • Plan the growth of infrastructure
  • Creating blog posts
  • Contributions to handbook, runbooks, general documentation
  • Maintaining good relationships with other engineering teams that help improve the Enabling Data Platforms product.

 

What we are looking for:

  • Experience of networks, security, load balancers and DNS
  • AWS Certified SysOps Administrator – Associate or AWS Certified DevOps Engineer – Professional
  • Experience using and build pipelines using Jenkins
  • Security tooling (e.g. Tenable)
  • Experience using Hashicorp suite including Terraform, Packer, Vagrant and Vault
  • Good understanding AWS security features and best practices
  • Experience with AWS Data persistence services (e.g. S3, Dynamo, Aurora, Neptune), Lake formation, Data Analytics Services (e.g. Sage Maker, Athena, EMR, Redshift) and AWS IAM.
  • Experience in monitoring and logging (e.g. ELK, CloudWatch, CloudTrail, Grafana, Dynatrace, Datadog)
  • Design of self-healing and fault-tolerant services
  • Techniques and strategies for maintaining high availability
  • Certificate management
  • Fluent in written and spoken English
  • Bachelor’s Degree in Computer Engineering or similar.

 

What can we offer you:

  • The ability to work in Innovative projects for global projects in a fast-paced environment where you can have a direct impact on the application
  • Informal and friendly culture that rewards innovation and teamwork
  • Salary according to experience
  • Permanent Contract
  • Annual performance bonus
  • Gym Membership
  • Ticket meal
  • Continuous Development and Training
  • Health Insurance
  • Flexible schedules and remote work.
     

Send us your application through: https://talent.cake.hr/jobs/045eb5fb-f4d5-4021-9a55-cc2ddb9c2505