Data Engineer [Multiple Positions Available]
DESCRIPTION:
Duties: Identify, analyze, and interpret trends/patterns in complex data sets acquired from various data sources and extract, transform, and load data. Enforce guidelines to ensure consistency, quality, and completeness of data assets. Work on the migration of infrastructure, data, and applications out of legacy data centers into cloud and hybrid environments. Design and implement data pipelines. Develop and maintain data processing and transformation scripts. Build and maintain data warehouses and data lakes. Deliver work products on time and on budget while applying quality assurances best practices.
QUALIFICATIONS:
Minimum education and experience required: Master's degree in Management Information Systems, Computer Science, Data Analytics, Business Analytics, or related field of study plus 3 years (36 months) of experience in the job offered or as Data Engineer, Systems Engineer, Software Engineer, Data Analyst, or related occupation.
Skills Required: This position requires experience with the following: conducting data engineering, automation, orchestration, and full-stack integration across on-premise and cloud environments; designing, developing, and maintaining scalable ETL and data-pipeline frameworks using AWS including Glue, EMR, Lambda, S3, Redshift, and CloudFormation and Databricks for large-scale analytical and streaming workloads; leveraging Apache Spark, Kafka, Hadoop, Hive, and Airflow for batch and real-time data ingestion, transformation, and scheduling; using Python, SQL, Java, PL/SQL, and Unix shell scripting to build reusable frameworks for data validation, testing, automation, and performance tuning; infrastructure provisioning and configuration management using Ansible, and continuous integration/deployment through Jenkins, Spinnaker, Jules, and Terraform (TFE); implementing data masking, lineage, and entitlement governance via Immuta; orchestrating workloads with Kubernetes; integrating datasets through RESTful APIs to downstream systems and applications; supporting the development of automation dashboards, monitoring utilities, and data-driven microservices using ReactJS, Redux, Node.js, and JavaScript frameworks; ensuring data reliability and SLA compliance through production job monitoring, incident resolution, and ticket management using ServiceNow and JIRA; preparing curated datasets for reporting and visualization; optimizing data models across Redshift, Oracle, PostgreSQL, MySQL, and Cassandra; operating under Agile SDLC methodology, ensuring automation, scalability, governance, and continuous improvement of data engineering processes and infrastructure.
Job Location: 8181 Communications Parkway, Plano, TX 75024.
Full-Time.