**Trinity Industries, Inc.** is seeking a **Data Engineer** to join our Queretaro, Mexico Office!
**What you'll do**:
- Design data models and data storage solutions in the cloud (e.g., AWS, Azure, GCP with strong preference for Azure Cloud)
- Proficiency in programming languages commonly used in data engineering, such as Python/PySpark, SQL/Spark-SQL, Java, and/or Scala
- Experience with data storage and processing technologies (e.g., Hadoop, Spark, Kafka, SQL databases)
- Build processes supporting query optimization, data transformation, data structures, metadata, dependency, and workload management
- Experience with API based data acquisition and management
- Work with data scientists to facilitate technical design of complex data sourcing, transformation, and aggregation logic, ensuring business analytics requirements are met
- Works with big data pipelines to develop streaming analytics
- Utilize business intelligence data visualization tools and techniques to translate business analytics into solutions
- Experience and familiarity with parallel processing / multithreading and compute optimization
- Must possess effective communication skills, both verbal and written
- Strong organizational, time management and multi-tasking skills
**What you'll need**:
- Bachelor's (preferably master's in computer science, business analytics, mathematics, and/or quantitative fields) or equivalent with a minimum of 5 years of relevant experience
- Proficiency in programming languages relevant to data engineering, including Python, Java, or Scala
- Deep understanding of data modeling and data pipelines
- Strong experience with cloud-based data solutions, especially AWS, Azure, and GCP
- Familiarity with parallel processing, multithreading, and compute optimization
- Must be fully fluent in both English and Spanish/conversationally and written
**Nice to have**:
- Expertise in designing cloud solutions across platforms like Azure, AWS, and Google (preferably Azure)
- Background in leveraging semi-structured and unstructured data for NLP pipelines and image classification pipelines