Data?1663689639
Site Reliability Engineer @ Cocus

Description

COCUS PORTUGAL

COCUS is all about People! We are proud to deliver skilled services and products developed by great talent, with attitude and ambition to work in innovative IT solutions. 

We are partnering with worldwide industry leaders and always looking for the brightest minds to have fun working in Web and Mobile development, Cloud Computing, Data Engineering, Machine Learning and IoT. 

Emotions are part of us, we encourage everyone to be what they truly are in our collaborative, informal, transparent, and open environment where everyone can contribute to the path to achieve our goals as a Team!

What you will be doing:

As a Site Reliability Engineer, you will be part of a cross-functional or practice team that enables site reliability engineering skills and capabilities across a whole domain. Being an enthusiast in SRE, with a strong DevSecOps mindset, and thanks to your excellent collaboration skills you will work with your team to deliver the best answers to our customer’s needs and to take over full responsibility for its applications, from design to operation.

  • You care diligently about the quality of your work, including proper documentation and security aspects.
  • You will use your deep technical skills to enable your team to deliver operational excellence and ensure and improve the reliability, performance and maintainability of systems and services.
  • You work closely with your team to understand the operational processes, technical and business needs of the products and services your team is responsible for.
  • You ensure observability of systems and services, support change and configuration management.
  • You will be involved in raising operational readiness requirements as part of the development life cycle and validate software development and delivery is consistent, meeting the specified requirements.
  • You are hunting for performance optimizations and recognizing upcoming problems before our customers are impacted.
  • You will continuously improve CI/CD and automation maturity and efficiency. You will support your team with efficient incident handling and quick reaction to production problems. For this, you can expect to take part in an on-call rota.
  • You can work hands-on, being able to tackle the whole design, build, test, and deploy cycle and thus also take proactive corrective action where required.
  • You are able to verbalize your thoughts and ideas and take the initiative to translate ideas into outcomes.
  • You are demonstrating active contribution to Communities of Practice, including collaboration in shared initiatives.
  • You love to work in an international, intercultural team.
  • You always drive for technical excellence, ownership and self-organization at the team and personal levels.
  • You love to learn and acquire new skills and keep up to date with the latest developments in your focus areas.

What we are looking for:

  • Experience working with highly available, distributed systems
  • Strong experience with monitoring/observability solutions, preferably Datadog, as well as with incident response solutions, like PagerDuty
  • Strong hands-on experience with Amazon Web Services (AWS), AWS Certification at Associate level or above
  • Willingness to take part in on-call rota
  • Infrastructure as code (Ansible, Chef, Puppet, Terraform, AWS CloudFormation) and Kubernetes, Docker
  • Good experience with CI/CD, preferably Gitlab CI
  • Deep automation expertise, and hands-on with some programming languages, e.g. Bash Script, Java or JavaScript/NodeJS
  • Ability to apply server operating system administrative knowledge, mostly Linux based
  • Good knowledge of networking and security aspects
  • Customer-centric, passionate about delivering great digital products and services
  • Demonstrating true engineering craftsmanship mindset
  • Passionate about continuous improvement, collaboration and great teams
  • Strong problem-solving skills coupled with good communication skills
  • Understanding of social and ethical implications of software engineering (e.g. like described in the ACM Code of Ethics)
  • Open minded, inquisitive, life-long learner
  • Comfortable with ambiguity, highly autonomous.

What we can offer you:

  • The opportunity to work on innovative and global projects in a fast-paced environment, having a direct impact on the solution/application
  •  Informal and friendly culture that rewards innovation and teamwork
  • Permanent work contract and salary according to experience
  • Annual performance bonus
  • FlexOffice - You choose from where to work in Portugal and receive an individual budget to set up your workstation
  • 24 vacation days + 1 day per year of tenure + your birthday
  • Continuous development and training + internal knowledge sharing
  •  Pet friendly office in Matosinhos-Sul, a 5-minute walk from the beach with public transportation around
  • 3K referral program - invite a friend to join the team!
  • Co-payment for monthly gym subscription or public transport pass
  • Besides other standard perks (Coverflex ticket meal, health insurance for you and your family…)
  • Flexible schedules and higher wage liquidity using Tickets® “Infância”, “Educação”, “Ensino”. 

Interested? Please apply through: https://talent.sage.hr/jobs/72672523-7a96-42ff-b14d-4ac7b9d2a5dd