Python Job: Data Engineer

Job added on

Company

Samotics B.V.

Location

Amsterdam - Netherlands

Job type

Full-Time

Python Job Details

“We are a fast-growing, not-too-large company. Each software engineer is end-to-end responsible for key components of our infrastructure or solution, rather than for individual pieces of the puzzle. To me, this is what makes it engineering: it’s about architecture, design and implementation.” - Bart Römgens, one of Samotics’ lead software engineers


At Samotics, we develop AI-based predictive analytics for industrial machinery.
The Infrastructure team is looking for a high-performing data engineer to help us scale our fault detection analysis to many thousands of sensors, building core infrastructure that supports our machine learning models.


For whom?

For this role, we are looking for a senior data engineer—we typically find that 5 years of full-time experience are a minimum to hit the ground running. If you have less experience but are a high-performing and high-potential data or software engineer, we would still love to get to know you and discuss a junior role on the team. If you have significantly more experience than we're looking for in this role, we are equally excited to meet you and will be able to discuss a more senior role. In both cases, please apply for our general software engineer vacancy (junior or senior).


The full list of requirements for this role can be found below.


About Samotics

Passionate about solving the problem of unplanned downtime, Samotics uses AI-driven electrical signature analysis to create smart predictive maintenance solutions that increase productivity and reduce energy waste while saving customers’ time and money.


We serve customers such as Total, Schiphol, Nouryon and ArcelorMittal. Our product, SAM4, leverages machine learning algorithms and IoT sensors to detect upcoming failures in critical industrial assets up to months in advance. But SAM4 does more than just prevent unplanned downtime. It also enables the world's largest and most successful industrial companies to significantly reduce energy waste.


We are a growing, energetic company where professionalism, a great atmosphere, and cutting-edge technology go hand in hand. Our team in Leiden, the Netherlands currently has 50 people and continues to grow. Some of these employees work remotely.


What you will do

As a data engineer, you will apply your skills at the forefront of data processing, using cutting-edge tools and technologies to help us scale up our fault detection analysis to many thousands of sensors. Your key responsibility is to build core infrastructure that supports running machine learning models that do near-real-time prediction.


You will be part of the data infrastructure team, and will be working on a daily basis with both data scientists and software engineers. Some of your daily tasks may include:


  • Support and improve data science pipelines

  • Work closely with the data science and analyst teams to build the best infrastructure for their needs

  • Build CI/CD infrastructure for our data architecture components that includes code linting, tests, monitoring and deployment

  • Debug core performance problems around databases, data pipeline efficiency and code complexity in both production & research environments

You will be monitoring the data flow, databases and other systems related to our data science pipeline, where you are again focused on robustness and efficiency.


How we work

The development team uses its own simple, practical, and functional methodology, allowing you to fully focus on software engineering without having to switch tasks. You will work based on the principle of freedom and responsibility. We take pride in combining professionalism, trust, and very few meetings.


The development team

The software engineering team currently consists of 9 experienced developers working on a large architecture covering a wide domain, from signal processing on our edge devices through dashboarding for different user groups. Continuous improvement and daily deployments are at the heart of the high level of quality we deliver. The data infrastructure team sits within the software engineering department and plays a key role there, while also interacting daily with the data science team and the head of product.


Tech stack

Since we work on cutting-edge technology, we use an up-to-date stack:

  • Python to orchestrate our infrastructure

  • AWS—we’re using a broad range of AWS services (20+)

  • Terraform

  • The team prefers to use Linux distributions for their development environments


What we offer

  • A competitive salary

  • Ability to work 100% remotely

  • Ability to work part time (4 days)

  • Access to a wide range of learning programs

  • Generous parental leave

  • A brand-new, custom-designed office (with 90% recycled materials) at a three-minute walk from Leiden Central Station

  • Daily fresh lunches and healthy snacks when working in the office

  • The chance to work on cutting-edge technology that affects industry worldwide

  • A high-performing software engineering team

  • A fun and professional company culture based on the principle of freedom and responsibility


Please only apply if you recognize yourself in all of the below requirements on experience and behavior.


Experience

  • Computer science background or similar, and a good understanding of data engineering

  • A good understanding of / experience with modern databases, both SQL and NoSQL

  • Experience with designing and running cloud infrastructure, including distributed systems and clusters


Behavior

  • Demonstrated ability to design excellent solutions, by using available technology in a creative and elegant way

  • Ability to work in fast-paced environment where production deployments take place multiple times a day

  • Keen on experimenting with and implementing new technologies

  • Truly owning the job, and the responsibilities that come with it—taking initiative and demonstrating sound judgment