DevOps Engineer
Web Gaming
Work Model
Hybrid
Location
Porto, Portugal
Experience
1 - 3 years
Contract type
Full-time
We are seeking a highly skilled and motivated DevOps Engineer with a focus on observability to join our team. As our Observability Specialist, you will be responsible for designing, implementing, and maintaining monitoring and alerting systems across our infrastructure, applications, and services. You will work closely with our development teams to ensure that all our systems are highly available, reliable, and performant.
- Design and implement observability solutions that provide visibility into our systems, applications, and services
- Develop and maintain monitoring and alerting systems that ensure the health and performance of our infrastructure
- Work closely with our development teams to ensure that our applications and services are highly available, reliable, and performant
- Continuously improve our observability capabilities by researching and implementing new tools and technologies
- Collaborate with other teams to troubleshoot and resolve production issues
- Integrate logs with time series data for event correlation
- Help us unlock the power of distributed tracing
- Participate in on-call rotations and respond to alerts in a timely manner
- Write clear and concise documentation for all observability systems and processes
- Mentor and coach other team members on observability best practices and principles.
WHAT YOU WILL DO
- 3+ years of experience working in Observability/DevOps/SRE or similar
- Strong experience with observability tools (e.g Prometheus, Zabbix, Jaeger, ELK)
- Strong experience with log management tools (e.g. ElasticSearch, Splunk, Graylog)
- Experience with building dashboards (e.g. Grafana, Kibana, Loki)
- Experience with alert and notification management (e.g. Alertmanager, PagerDuty, Discord)
- Strong experience with scripting and automation languages (e.g. Bash, Python, Ruby)
- Strong experience with cloud infrastructure (we use AWS)
- Experience with version control systems (we use Git)
- Demonstrable experience instrumenting applications for observability (e.g. OpenTelemetry, UpTrace)
- Excellent problem-solving and troubleshooting skills, with a willingness to take ownership of projects
- Strong communication and collaboration skills
- A desire for absolute clarity regarding the state and behavior of our infrastructure
WHAT WE EXPECT
- Linux proficiency (most of us use Linux on our machines)
- Experience with the Kubernetes ecosystem (e.g. Helm, Argo, Nginx, Rancher)
- Experience with CI/CD tools (e.g. Gitlab, Jenkins, etc)
- Experience with SQL & NoSQL database architecture and administration (e.g. PostgreSQL, Redis, MongoDB, Clickhouse)
- Experience with Python 3.x (bonus points for Django experience)
- Understanding of networking, load balancing, and high availability (e.g. Nginx, Cloudfront, Cloudflare, WAF)
- Security conscientiousness, with an emphasis on designing for security best practices
- Oral and written fluency in English
- Ability to work autonomously and independently
- An insatiable desire to learn and grow, personally and professionally.
WHAT WE VALUE
- Health Insurance
- Training: individual budget for training + training plan for department + knowledge sharing + TechTalks + Language lessons (English and Spanish)
- A challenging work (comes with free coffee & fruit!)
- Career Evolution (technical and leadership path)
- Mentoring, Coaching and Talent Development Programs
- Soccer, Padel and Volleyball Teams, Yoga and Chinese Boxing Class
- Power-ups in Coverflex Card
- Annual Nerf War
- Proximity is one of our values - you will have several events like team buildings, christmas party, company birthday party and many more events. Basically we like to be together
- Our culture and co-workers are the best thing about working at Fabamaq and we are proud of that
WHAT WE OFFER