Oddin Himalayas · Posted 2mo ago

Research Scientist, Text-to-Speech

Netherlands USD Contractor

Continue to application Add your email once, then Caio opens the original posting.

Indexed description

About Valka.aiValka, a visionary spin-off from the Realms Group (the parent company of Oddin.gg), is on a mission to revolutionize the way people create and experience digital content.Our team believes that content shouldn’t just be consumed; it should be co-created in real time, blurring the lines between imagination and reality. By harnessing the power of cutting-edge AI, we aim to build an interactive human-digital platform where virtual characters respond dynamically to each user’s voice, text, gestures, and more.This is your chance to join a diverse group of innovators who are driven to redefine what’s possible in generative content. Together, we’re changing the paradigm from passive viewing to active participation, unlocking new creative frontiers across gaming, entertainment, education, and beyond.

What you will be doing

Research and train fast and quality SOTA TTS models for realistic and emotional voice generation for entertainment and education applications.
You will be experimenting with different architectures / data to improve the quality and speed of the TTS model(s) and put the best results to production.
Staying up to date with current research and coming up with new ideas / what to improve is very important for us!
You will be in immediate collaboration with a team of 3 researchers specializing in TTS, and the product is supported by engineering and hardware stuff to ensure deployment

Skills you need

Experience with training some text-to-speech / voice cloning models
Solid knowledge of transformers, diffusion models, GANs
Understanding of human speech and audio processing (sampling, spectrograms, vocoders)
Proficiency in Python and key libraries (e.g., PyTorch, Hugging Face Transformers).
Ability to keep up to date with research, understand papers, implement approaches; strong ML fundamentals and critical thinking

Nice-to-have:

Familiarity with modern speech synthesis models (GPT-based, flow matching… such as Vevo, StyleTTS, IndexTTS, Maskgct etc.)
Contributions to open-source AI tools or research publications in Speech processing field
Familiarity with AWS / similar clusters

Originally posted on Himalayas

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search

Want help applying to roles like this? Search Caio for free. If repetitive applications get heavy, Managed Job Search adds supervised execution for $99/month.

View Managed Job Search

Oddin Company profile preview

Source: Himalayas
Location: Netherlands
Compensation: USD
Open on Caio: 10 roles

Salary insight

USD

Caio highlights salary ranges whenever the original posting exposes them. Compare similar roles as the index fills in.

Similar role details

Contractor roles Location flexible matches Himalayas postings

Company stats

Current index details for Oddin, based on roles Caio has indexed from public sources.

10open roles 2sources 1markets Posted 3d agolatest role