**Project** Description**:**Responsibilities**:- As an Engineer working on an industry-leading Data Platform, you will collaborate with other experienced software engineers to drive improvements to our technology, design and develop new services and software solutions, and build and track metrics to ensure high-quality results. You will have the chance to work with business partners and leaders to influence and drive product vision and lead the design of our systems.- We love contributing our work to open source communities, and have successfully done so with tools like Circus Train and Waggle Dance.- We guarantee you'll learn a lot and won't be bored!- Write clear, efficient, and well-tested code.- Be part of an agile team that is continuously learning and improving.- Develop scalable and highly performant distributed systems with everything this entails (availability, monitoring, resiliency).- Help us shape the future of Data Lakes.- Take architectural ownership for various critical components and systems.- Evolve development standards and design patterns.- Communicate and document solutions and design decisions.**Skills**:Must have- 4+ years of core and server-side Java programming (Spring, streams, lambdas).- To have all the attitude to learn about data pipeline and data engineering dutiesNice to have- Knowledge of the Hadoop ecosystem (Spark, Hadoop, Hive).- Experience with cloud computing platforms (AWS, EMR, S3, Kubernetes, Docker).- Experience with microservice architecture, design, and standard methodologies with an eye towards scale.- Experience working in an Agile way (scrum, code reviews, pair programming).- Experience with performance and scalability tuning, algorithms, and computational complexity.- Passionate about open source**Languages**:English: C1 Advanced**Seniority**:Senior**Relocation package**:If needed, we can help you with relocation process.Vacancy SpecializationBigData DevelopmentRef NumberVR-78014