.Department OverviewThe world is changing rapidly, and our customers are inventing new ways to meet that change confidently - with our help. We transform engineering and technical workflows with solutions that provide the right data at the right time. Our unparalleled combination of expansive technical knowledge and AI-based technology helps leaders build the right connections across their teams and workflows, so they can focus on designing for a world that runs faster, cleaner, safer, and smarter for everyone.Our development team architect and design high-availability, scalable, and fault tolerant systems that are decoupled and easy-to-maintain. A core part of our development philosophy revolves around Microservices and the DevOps model. All our new products are developed using a microservice architecture, are containerized, and are then deployed on container management systems such as Kubernetes. The developers on our teams subscribe to a DevOps model where time-to-market functions as a vital measure of our performance, productivity, and success. We are committed to stay ahead of the curve and we are always looking at new technologies that can enhance our product offerings.Position SummaryAre you passionate for the latest technologies in Data Engineering critical for success of Data Science, Machine Learning projects in Natural Language Processing domain? Come and be part of the S&P Global's Artificial Intelligence team! We are building deep-learning based natural language processing, information retrieval, document understanding, data mining and knowledge engineering solutions into S&P Global intelligent products that serve all major industries and markets.S&P Global is looking for a Data Engineer/Python (NLP, Knowledge Management and Information Retrieval domain) to join our AI Research & Development department. In this role, you will be responsible for data engineering aspects of data-driven projects, building robust data processing pipelines and curating all questions related to data lifecycle.Job**Responsibilities**:- Own data engineering in projects with Machine Learning, Natural Language Processing, Information Retrieval- Work in the team with data scientists, ML engineers and developers on building the intelligent capabilities into company products- Ensure dataset quality and suitability for ML projects (automation of labeling, inspection and cleaning, normalization, augmentation)- Discover new data (finding and obtaining raw data necessary for experimental setups, e.G. find and download available data from internet with focused crawling)- Develop data processing and transformation pipelines (designing ETL system for ML/DL projects, designing online leaning loops, embedding active learning algorithms into data annotation toolset, etc