Argonne

Postdoctoral Appointee – Profiling AI Workflows on Supercomputers

Lemont, IL USA Full time

The Argonne Leadership Computing Facility’s (ALCF) mission is to accelerate major scientific discoveries and engineering breakthroughs for humanity by designing and providing world-leading computing facilities in partnership with the computational science community. We help researchers solve some of the world’s largest and most complex problems with our unique combination of supercomputing resources and computational science expertise.

The ALCF Performance Engineering Group invites applications for a postdoctoral appointee to develop and scale profiling capabilities for large, heterogeneous HPC workflows that combine AI and traditional modeling and simulation (ModSim). You will work with cutting-edge exascale systems and novel AI hardware, collaborating closely with science application teams, academia, industry partners, and other national laboratories.

Objective:

  • Enhance THAPI: Extend and optimize the THAPI profiler (https://github.com/argonne-lcf/THAPI) to concurrently profile AI/ML and ModSim components at scale.
  • API & Tracing Integration: Design and implement new tracing API layers to capture fine-grained performance data across diverse runtime environments. The main targets will be tracing additional communication layers (NCCL, libfabric) and Python-based applications (either via Python internals or via native Python libraries supports such as PyTorch).

Position Requirements

Required skills, knowledge and abilities:

    • Ph.D. (completed within the last 0-5 years) or equivalent experience in a computational science discipline, computer science, or in a related field.
    • Hands-on experience with performance profiling and tracing tools (LTTng, Babeltrace, perf, ftrace, etc.).
    • Strong C (and/or C++) system-programming skills and familiarity with dynamic linking (e.g., ldd).
    • Experience developing and optimizing scientific workflows, ideally combining AI and traditional simulations.
    • Experience with scientific computing and software development on HPC systems.
    • Ability to conduct independent research and demonstrated publication record in peer-reviewed journals and conferences.
    • The successful candidate will be expected to work with and contribute to open-source projects and community-driven initiatives within computational science.
    • Effective communication skills, both verbal and written, for effective collaboration with interdisciplinary teams and clear presentation of complex technical information.
    • Ability to model Argonne’s core values of impact, safety, respect, integrity and teamwork.

    Preferred skills, knowledge and abilities:

    • Proficiency in additional programming languages (e.g., C++, Ruby) and metaprogramming technique.
    • Experience with HPC programming models (MPI, OpenMP, SYCL, Cuda).
    • Experience in writing technical papers and presentations.

    Job Family

    Postdoctoral

    Job Profile

    Postdoctoral Appointee

    Worker Type

    Long-Term (Fixed Term)

    Time Type

    Full time

    The expected hiring range for this position is $72,879.00-$121,465.00.

    Please note that the pay range information is a general guideline only. The pay offered to a selected candidate will be determined based on factors such as, but not limited to, the scope and responsibilities of the position, the qualifications of the selected candidate, business considerations, internal equity, and external market pay for comparable jobs. Additionally, comprehensive benefits are part of the total rewards package.

    Click here to view Argonne employee benefits!

    As an equal employment opportunity employer, and in accordance with our core values of impact, safety, respect, integrity and teamwork, Argonne National Laboratory is committed to a safe and welcoming workplace that fosters collaborative scientific discovery and innovation. Argonne encourages everyone to apply for employment. Argonne is committed to nondiscrimination and considers all qualified applicants for employment without regard to any characteristic protected by law.

    Argonne employees, and certain guest researchers and contractors, are subject to particular restrictions related to participation in Foreign Government Sponsored or Affiliated Activities, as defined and detailed in United States Department of Energy Order 486.1A. You will be asked to disclose any such participation in the application phase for review by Argonne's Legal Department.  

    All Argonne offers of employment are contingent upon a background check that includes an assessment of criminal conviction history conducted on an individualized and case-by-case basis.  Please be advised that Argonne positions require upon hire (or may require in the future) for the individual be to obtain a government access authorization that involves additional background check requirements.  Failure to obtain or maintain such government access authorization could result in the withdrawal of a job offer or future termination of employment.