Support and improve data science pipelines
Work closely with the data science and analyst teams to build the best infrastructure for their needs
Build CI/CD infrastructure for our data architecture components that includes code linting, tests, monitoring and deployment
Debug core performance problems around databases, data pipeline efficiency and code complexity in both production & research environments
Python to orchestrate our infrastructure
AWS—we’re using a broad range of AWS services (20+)
The team prefers to use Linux distributions for their development environments
A competitive salary
Ability to work 100% remotely
Ability to work part time (4 days)
Access to a wide range of learning programs
Generous parental leave
A brand-new, custom-designed office (with 90% recycled materials) at a three-minute walk from Leiden Central Station
Daily fresh lunches and healthy snacks when working in the office
The chance to work on cutting-edge technology that affects industry worldwide
A high-performing software engineering team
A fun and professional company culture based on the principle of freedom and responsibility
Computer science background or similar, and a good understanding of data engineering
A good understanding of / experience with modern databases, both SQL and NoSQL
Experience with designing and running cloud infrastructure, including distributed systems and clusters
Demonstrated ability to design excellent solutions, by using available technology in a creative and elegant way
Ability to work in fast-paced environment where production deployments take place multiple times a day
Keen on experimenting with and implementing new technologies
Truly owning the job, and the responsibilities that come with it—taking initiative and demonstrating sound judgment
Amsterdam - Netherlands
Python Job Details
“We are a fast-growing, not-too-large company. Each software engineer is end-to-end responsible for key components of our infrastructure or solution, rather than for individual pieces of the puzzle. To me, this is what makes it engineering: it’s about architecture, design and implementation.” - Bart Römgens, one of Samotics’ lead software engineers
At Samotics, we develop AI-based predictive analytics for industrial machinery. The Infrastructure team is looking for a high-performing data engineer to help us scale our fault detection analysis to many thousands of sensors, building core infrastructure that supports our machine learning models.
For this role, we are looking for a senior data engineer—we typically find that 5 years of full-time experience are a minimum to hit the ground running. If you have less experience but are a high-performing and high-potential data or software engineer, we would still love to get to know you and discuss a junior role on the team. If you have significantly more experience than we're looking for in this role, we are equally excited to meet you and will be able to discuss a more senior role. In both cases, please apply for our general software engineer vacancy (junior or senior).
The full list of requirements for this role can be found below.
Passionate about solving the problem of unplanned downtime, Samotics uses AI-driven electrical signature analysis to create smart predictive maintenance solutions that increase productivity and reduce energy waste while saving customers’ time and money.
We serve customers such as Total, Schiphol, Nouryon and ArcelorMittal. Our product, SAM4, leverages machine learning algorithms and IoT sensors to detect upcoming failures in critical industrial assets up to months in advance. But SAM4 does more than just prevent unplanned downtime. It also enables the world's largest and most successful industrial companies to significantly reduce energy waste.
We are a growing, energetic company where professionalism, a great atmosphere, and cutting-edge technology go hand in hand. Our team in Leiden, the Netherlands currently has 50 people and continues to grow. Some of these employees work remotely.
What you will do
As a data engineer, you will apply your skills at the forefront of data processing, using cutting-edge tools and technologies to help us scale up our fault detection analysis to many thousands of sensors. Your key responsibility is to build core infrastructure that supports running machine learning models that do near-real-time prediction.
You will be part of the data infrastructure team, and will be working on a daily basis with both data scientists and software engineers. Some of your daily tasks may include:
You will be monitoring the data flow, databases and other systems related to our data science pipeline, where you are again focused on robustness and efficiency.
How we work
The development team uses its own simple, practical, and functional methodology, allowing you to fully focus on software engineering without having to switch tasks. You will work based on the principle of freedom and responsibility. We take pride in combining professionalism, trust, and very few meetings.
The development team
The software engineering team currently consists of 9 experienced developers working on a large architecture covering a wide domain, from signal processing on our edge devices through dashboarding for different user groups. Continuous improvement and daily deployments are at the heart of the high level of quality we deliver. The data infrastructure team sits within the software engineering department and plays a key role there, while also interacting daily with the data science team and the head of product.
Since we work on cutting-edge technology, we use an up-to-date stack:
What we offer
Please only apply if you recognize yourself in all of the below requirements on experience and behavior.