Department of Education

IT Spec (ENTARCH) "Platform/site Reliability Engineer", GS-2210-14 FPL GS-14 (DE)

San Francisco, California, Denver, Colorado, District of Columbia, District of Columbia, Atlanta, Ge Full time

IT Spec (ENTARCH) "Platform/site Reliability Engineer", GS-2210-14 FPL GS-14 (DE)

Department: Department of Education

Location(s): San Francisco, California, Denver, Colorado, District of Columbia, District of Columbia, Atlanta, Georgia, Chicago, Illinois, Boston, Massachusetts, Kansas City, Missouri, New York, New York, Philadelphia, Pennsylvania, Dallas, Texas, Seattle, Washington

Salary Range: $127829 - $197200 Per Year

Job Summary: These positions located in the Federal Student Aid (FSA), Chief Technology Office Technology Operations Division. FSA is modernizing the systems that serve over 17 million students and power more than $120 billion in financial aid each year. We are building a team of senior platform and reliability engineers to strengthen the technical foundation of one of the federal government's highest-impact digital ecosystems.

Major Duties:

  • APPLICATION LIMIT: This vacancy announcement is limited to the first 250 applications received and will close at 11:59PM Eastern Time on the day that we receive the 250th application, or at 11:59PM Eastern Time on the listed closing date, whichever occurs first. We encourage you to read this entire vacancy announcement prior to submitting your application. As a Platform/Site Reliability Engineer, you will lead the design, development, and evolution of the cloud platforms, automation, and reliability systems that power FSA's applications. You will develop infrastructure, tooling, and observability capabilities that enable teams to deliver secure, reliable, and high-performing services at scale. You'll collaborate with cross-functional partners to standardize cloud architectures, improve system reliability, and modernize FSA into a platform-driven, engineering-centric organization. This role blends the mission of public service with the complexity of major commercial cloud and SRE organizations. Your job is to lead the creation of the platforms, guardrails, and reliability practices that let teams ship changes safely and confidently. If you enjoy designing scalable infrastructure, optimizing system reliability, and enabling engineers to move faster, this is the role. As a Platform/Site Reliability Engineer, GS-2210-14, you will be responsible for: Serving as an advisor to the IOG Director and Chief of the Network Support Division, acting as a network architect and engineer to develop and implement solutions across cloud and on-premises environments, while designing reusable platform services, container environments, identity integrations, networking patterns, and infrastructure components. Provide input to design and technical documentation, review final deliverables, and ensure adherence to the enterprise network operations engineering framework through leadership, while serving as a principal-level expert in platform engineering, cloud architecture, Site Reliability Engineering (SRE) practices, and infrastructure automation. Engage with technology leaders, business partners, and contractors to ensure operational requirements and needs are met, while clearly communicating technical concepts to non-technical stakeholders and producing platform standards, design documents, and technical evaluations. Evaluate system security plans and procedures, manage and direct office support contractors, address IT compliance issues, and oversee project planning and updates, while designing and maintaining continuous improvement/continuous Development (CI/CD) pipelines to support automated testing, deployment, change control, and compliance validation. Drive network engineering direction and response for CISA Binding Operational Directives (BODs) impacting data center operations, developing plans and processes to strengthen security, while implementing secure cloud configurations, identity and access management (IAM) models, encryption, and zero-trust architectural patterns.

Qualifications: Selective Placement Factor You must meet the following selective placement factor: Candidates must possess any combination of the following certifications from a recognized professional organization at the time of hire and acceptance of the position: IT Information Library v4 (ITIL) Project Management Professional (PMP) AWS Certified Advanced Networking Certified Information Systems Security Professional (CISSP) Certified Cloud Security Professional (CCSP) F5 Networks Certified Technology Specialist Applicants who do not meet the Selective Placement Factor will be rated as not qualified and will not receive further consideration for this position. Minimum Qualification Requirements You may meet the minimum qualifications for the GS-14, if you possess the specialize experience. Specialized Experience for the SUPVY IT Specialist (Project Manager), GS-2210-14 One year of experience in either federal or non-federal service that is equivalent to at least a GS-13 performing two (2) out of three (3) of the following duties or work assignments: 1. Direct technical experience designing, deploying, and operating scalable cloud platforms using Infrastructure as Code (IaC), CI/CD, containers, and automated security controls to accelerate engineering delivery and ensure compliance. 2. Direct technical experience enhancing reliability and observability for distributed systems, including use of tracing/metrics/logs for observability, SLO/SLA development, incident response and analysis, and/or measurable performance/automation improvements. 3. Experience translating platform and reliability engineering concepts into clear documentation, technical standards, and architecture guidance for non-technical audiences, and influencing engineering practices across multiple teams. Basic Experience Requirements You must possess IT related experience (paid or unpaid experience and/or completion of specific, intensive training (e.g., IT certification), as appropriate) demonstrating each of the four competencies listed below. 1. Attention to Detail - Is thorough when performing work and conscientious about attending to detail. 2. Customer Service - Works with clients and customers (i.e., any individuals who use or receive the services or products that your work unit produces, including the general public, individuals who work in the agency, other agencies, or organizations outside the Government) to assess their needs, provide information or assistance, resolve their problems, or satisfy their expectations; knows about available products and services; is committed to providing quality products and services. 3. Oral Communication - Expresses information (e.g., ideas or facts) to individuals or groups effectively, taking into account the audience and nature of the information (e.g., technical, sensitive, controversial); makes clear and convincing oral presentations; listens to others, attends to nonverbal cues, and responds appropriately. 4. Problem Solving - Identifies problems; determines accuracy and relevance of information; uses sound judgment to generate and evaluate alternatives, and to make recommendations. Knowledge, Skills, and Abilities (KSAs) The quality of your experience will be measured by the extent to which you possess the following knowledge, skills and abilities (KSAs). You do not need to provide separate narrative responses to these KSAs, as they will be measured by your responses to the occupational questionnaire (you may preview the occupational questionnaire by clicking the link at the end of the Evaluations section of this vacancy announcement). 1. Skill in designing and implementing cloud and hybrid network solutions, including reusable platform services, container environments, identity integrations, and infrastructure components. 2. Skill in applying systems engineering and Site Reliability Engineering (SRE) concepts to ensure reliability, performance, scalability, security, and maintainability across complex, multi-cloud environments. 3. Knowledge of platform and reliability engineering principles and the ability to apply them through real-world implementation, debugging, optimization, and modernization of cloud environments. 4. Skill in computer engineering cloud automation, observability tooling, testing frameworks, and Continuous improvement/Continuous development (CI/CD) pipelines, including telemetry, logging, alerting, and distributed tracing. 5. Ability to leverage modern cloud, data, and security technologies to design, test, and deploy resilient platform and reliability systems that support mission-critical applications.

How to Apply: Step 1: Create a USAJOBS account (if you do not already have one) at www.usajobs.gov. Step 2: Create a resume using the USAJOBS resume builder or upload a resume into your USAJOBS account. Ensure that your resume demonstrates your education, experience, training, and accomplishments as it relates to the qualifications for this position and substantiates your responses to the occupational questionnaire. Step 3: Upload any required documents into your USAJOBS account (must be less than 3MB and in one of the following document formats: GIF, JPG, JPEG, PNG, RTF, PDF, or Word (DOC or DOCX)). Step 4: Click "Apply Online" and follow the prompts to complete the occupational questionnaire and attach any required documents. Verify that uploaded documents from USAJOBS transfer into the agency's hiring system. You will have the opportunity to upload any additional required documents in the agency's hiring system. Click “Finish” to submit your application. NOTE: You may update your application or required documents at any time while the announcement is open by logging into your USAJOBS account, clicking on "Application Status," clicking on the position title, clicking "Update Application,” and following the prompts. In order to receive consideration for this position, you must submit your complete application, including all required documents, by 11:59 PM Eastern Time on the closing date of the vacancy announcement. If the vacancy announcement has an application limit, we recommend that you submit your complete application at the time of initial application. We will not accept any required documentation after the closing date of the vacancy announcement. If you have any questions regarding submitting your application, please contact the HR Specialist listed under the Agency Contact Information.

Application Deadline: 2026-05-14