We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.
With over 8,000 CI&Ters around the world, we’ve built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.
We are looking for a Data Architect responsible for automated data pipelines, infrastructure as code, deployment patterns, and architecture governance for an ML-based demand forecasting system in the public transportation sector
Key Responsibilities:
-
Design and Architecture:
- Create and implement AWS data pipeline designs using AWS Glue, Step Functions, and Amazon S3.
- Develop architecture for analytics workloads, including data lake patterns and event-driven pipelines.
- Establish security architecture ensuring IAM policies, encryption, and least-privilege access.
-
Infrastructure as Code:
- Utilize AWS CloudFormation and/or AWS CDK (TypeScript or Python) for infrastructure automation.
- Deliver Infrastructure as Code (IaC) solutions in production environments.
-
Machine Learning Deployment:
- Design and deploy ML model deployment patterns using Amazon SageMaker, including endpoints, batch transforms, and inference pipelines.
- Manage the CI/CD processes for ML pipelines and infrastructure deployments.
-
Monitoring and Observability:
- Set up monitoring solutions using CloudWatch, including alarms and health dashboards for data pipelines.
-
Collaboration and Documentation:
- Engage with customer development teams for knowledge transfer and documentation.
- Contribute to project management in Firm Fixed Price or milestone-based delivery engagements.
Required Skills and Qualifications:
Knowledge & Skills:
-
- Proficient in AWS data pipeline design, including:
- AWS Glue (ETL jobs, crawlers, catalog)
- AWS Step Functions
- Amazon S3
- Expertise in Infrastructure as Code (IaC) using AWS CloudFormation and/or AWS CDK (TypeScript or Python).
- Experience deploying SageMaker models in sandbox and production environments.
- Experience designing and deploying data/ML architectures on AWS.
- Experience in architecture design for analytics workloads, including RDS-to-S3 ingestion.
- Strong understanding of security architecture, including IAM policies and VPC design.
- Knowledge of monitoring tools such as CloudWatch and pipeline health dashboards.
- Understanding of hybrid connectivity solutions (VPN, Direct Connect) and familiarity with Oracle RDS.
Expected Certifications
• AWS Certified Solutions Architect – Associate
• AWS Certified Data Engineer – Associate or AWS Certified DevOps
#LI-LO1