Algorithm Engineer - Audio Understanding - Start 2025

TIKTOK PTE. LTD.
3 days ago
Posted date3 days ago
N/A
Minimum levelN/A
EngineeringJob category
EngineeringAbout TikTok
TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.
Why Join Us
Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.
We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
About the Team
The speech team's mission is to empower content understanding, interaction and creation across TikTok and other products using speech & audio related technologies. We focus on cutting-edge R&D in areas like speech & audio, music processing, natural language understanding and multimodal deep learning. We are looking for top talents to work on these exciting technologies, integrate them into various TikTok and other products and ultimately bring joy to our global user base!
We are looking for talented individuals to join us in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance.
Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early.
Responsibilities
- Conduct cutting-edge research and development in speech/audio foundation models
- Contribute to the advancement of audio understanding, including multilingual speech recognition, speech translation, multimodal understanding and etc.
- Focus on and drive the practical application of relevant technologies in business scenarios, including but not limited to closed-captions, voice dubbing, video understanding.
Qualifications
Minimum Qualifications
- Final year Ph.D or recent Ph.D graduates in Computer Science, engineering quantitative field
- Experience in one or more areas of machine learning and deep learning, including but not limited to:
- Automatic Speech Recognition
- Automatic Speech Translation
- Speech/audio self-supervised learning and foundation models
Preferred Qualifications
- Publications in top-tier ML/DL venues such as NeurIPS, ICLR, ICML, AAAI and speech venues such as ICASSP, ASRU, Interspeech
- Deep understanding of Large Language models
- Familiar with distributed computing and large scale model training
- Familiar with deep learning frameworks such as Tensorflow and Pytorch.
- Familiar with engineering principles and best practices.
- Highly competent in algorithms and programming; Strong coding skills in C/C++ and Python.
- Ability to work collaboratively in a fast-paced, multi-functional environments
TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo.
Why Join Us
Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.
We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
About the Team
The speech team's mission is to empower content understanding, interaction and creation across TikTok and other products using speech & audio related technologies. We focus on cutting-edge R&D in areas like speech & audio, music processing, natural language understanding and multimodal deep learning. We are looking for top talents to work on these exciting technologies, integrate them into various TikTok and other products and ultimately bring joy to our global user base!
We are looking for talented individuals to join us in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance.
Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early.
Responsibilities
- Conduct cutting-edge research and development in speech/audio foundation models
- Contribute to the advancement of audio understanding, including multilingual speech recognition, speech translation, multimodal understanding and etc.
- Focus on and drive the practical application of relevant technologies in business scenarios, including but not limited to closed-captions, voice dubbing, video understanding.
Qualifications
Minimum Qualifications
- Final year Ph.D or recent Ph.D graduates in Computer Science, engineering quantitative field
- Experience in one or more areas of machine learning and deep learning, including but not limited to:
- Automatic Speech Recognition
- Automatic Speech Translation
- Speech/audio self-supervised learning and foundation models
Preferred Qualifications
- Publications in top-tier ML/DL venues such as NeurIPS, ICLR, ICML, AAAI and speech venues such as ICASSP, ASRU, Interspeech
- Deep understanding of Large Language models
- Familiar with distributed computing and large scale model training
- Familiar with deep learning frameworks such as Tensorflow and Pytorch.
- Familiar with engineering principles and best practices.
- Highly competent in algorithms and programming; Strong coding skills in C/C++ and Python.
- Ability to work collaboratively in a fast-paced, multi-functional environments
JOB SUMMARY
Algorithm Engineer - Audio Understanding - Start 2025

TIKTOK PTE. LTD.
Singapore
3 days ago
N/A
Full-time
Algorithm Engineer - Audio Understanding - Start 2025