About us:
Working at Target means helping all families discover the joy of everyday life. We bring that vision to life through our values and culture. Learn more about Target here.
As a Senior Engineer, you serve as a specialist in the engineering team that supports the product. You help develop and gain insight in the application architecture. You can distill an abstract architecture into concrete design and influence the implementation. You show expertize in applying the appropriate software engineering patterns to build robust and scalable systems. You are an expert in programming and apply your skills in developing the product. You have the skills to design and implement the architecture on your own, but choose to influence your fellow engineers by proposing software designs, providing feedback on software designs and/or implementation. You show good problem solving skills and can help the team in triaging operational issues. You leverage your expertise in eliminating repeat occurrences.
Role Overview
We are seeking a Senior Engineer with strong expertise in distributed storage systems and full-stack application development. This role requires deep ownership of production systems, automation maturity, and architectural leadership. You will be responsible for maintaining high-performing storage clusters while contributing to scalable application development and infrastructure automation initiatives.
Core Responsibilities
Storage Infrastructure & Reliability
Maintain, enhance, and performance-tune distributed storage clusters.
Drive improvements in availability, scalability, and capacity planning.
Strengthen observability, monitoring, and alerting capabilities.
Lead root cause analysis and remediation of production incidents.
Improve system resilience and operational reliability.
Automation & CI/CD
Design and implement infrastructure automation solutions.
Enhance CI/CD pipelines for cluster provisioning, maintenance, and application deployments.
Reduce manual operational overhead through robust tooling and Infrastructure as Code practices.
Production Support & Troubleshooting
Troubleshoot complex production issues across storage and application layers.
Perform deep debugging in distributed environments.
Implement preventive measures to avoid incident recurrence.
Application Development
Design, develop, and maintain scalable full-stack applications.
Build clean, extensible, and well-tested code aligned with engineering best practices.
Optimize applications for performance, reliability, and security.
Architecture & Engineering Excellence
Contribute to architectural decisions with a focus on:
Performance
Reliability
Security
Maintainability
Uphold engineering standards, code quality, and operational readiness.
Required Qualifications
Strong experience maintaining and performance tuning distributed storage clusters.
Proficiency in Go and/or Kotlin.
Experience with infrastructure automation tools (e.g., Ansible, Terraform, Chef, or similar).
Hands-on experience building and maintaining CI/CD pipelines.
Strong understanding of distributed systems and production-grade environments.
Demonstrated ability to troubleshoot complex system issues.
Experience designing scalable and resilient systems.
Preferred Qualifications
Experience with Ansible for configuration management.
Experience operating large-scale production environments.
Familiarity with container orchestration platforms (e.g., Kubernetes).
Knowledge of observability tooling (metrics, logging, tracing).
Understanding of security best practices in distributed systems.