Xai

Audio English/Multilingual Tutor

Palo Alto, CA Part Time

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Role

As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI's mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts. Your work will focus on curating and annotating high-quality audio data to enhance Grok's global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI's handling of multilingual audio nuances.

Responsibilities

You will use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages. You must support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards. In this role, you will collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing. You’ll also work with technical staff to improve annotation tools for efficient audio workflows.

Required Qualifications

  • Native or near-native fluency in English and at least one additional language, with clear, natural vocal delivery and pronunciation suitable for audio recording purposes.
  • Strong auditory perception to identify nuances in speech, accents, pronunciation, intonation, and audio quality across languages.
  • Demonstrated ability to handle multilingual audio content, including evaluating speech accuracy, cultural vocal expressions, and contextual interpretation in spoken form.
  • Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.
  • Strong comprehension skills and the ability to make independent judgments on ambiguous or varied audio material, including noisy or accented speech.
  • Strong communication, interpersonal, analytical, detail-oriented, and organizational skills, with the ability to articulate audio-related feedback effectively.
  • Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Preferred Qualifications

  • Fluency in multiple languages beyond English, with exposure to diverse accents, dialects, or regional variations.
  • Experience in voice recording, dubbing, podcasting, audiobooks, speech data annotation, translation with audio components, linguistics (especially phonetics/phonology), or roles involving multilingual audio review and optimization.
  • Portfolio of voice samples or audio work, such as narrated recordings, spoken contributions, or multilingual speech projects.
  • Familiarity with basic audio recording tools, software, or concepts related to sound quality in spoken content.

Location & Other Expectations

  • This position is based in Palo Alto, CA, or fully remote.
  • The Palo Alto option is an in-office role requiring 5 days per week; remote positions require strong self-motivation.
  • If you are based in the US, please note we are unable to hire in the states of Wyoming and Illinois at this time.
  • We are unable to provide visa sponsorship.
  • Team members are expected to work from 9:00am - 5:30pm PST for the first two weeks of training and 9:00am - 5:30pm in their own timezone thereafter.
  • For those who will be working from a personal device, please note your computer must be a Chromebook, Mac with MacOS 11.0 or later, or Windows 10 or later.

Compensation

$30/hour - $75/hour

The posted pay range is intended for U.S.-based candidates and depends on factors including relevant experience, skills, education, geographic location, and qualifications. For international candidates, our recruiting team can provide an estimated pay range for your location.

Benefits:

Hourly pay is just one part of our total rewards package at xAI. Specific benefits vary by country, depending on your country of residence you may have access to medical benefits. We do not offer benefits for part-time roles.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.