Mô tả công việc
ABOUT US
Base.vn is a product-first technology company building a SaaS platform serving more than 11,000+ businesses across Vietnam and beyond.
As AI becomes a core capability across our product ecosystem, we are expanding into voice AI technologies to create more natural and intelligent user experiences.
We are building AI-powered voice solutions that enable users to interact seamlessly through speech-from capturing voice on mobile devices to transcribing, understanding, and generating natural conversations.
We are looking for a passionate AI Engineer (Speech AI) to help build production-grade speech systems and transform cutting-edge AI technologies into products used by thousands of businesses every day.
WHAT YOU DO
As an AI Engineer, you will design and build the core AI capabilities powering our voice platform.
Build Speech AI Systems
Develop and optimize intelligent voice technologies, including:
- Automatic Speech Recognition (Speech-to-Text)
- Text-to-Speech (Speech Synthesis)
- Audio processing and speech enhancement
- Speech understanding and voice interaction pipelines
Build End-to-End Audio AI Pipelines
Design and optimize production-ready audio processing pipelines, including:
- Audio preprocessing and feature extraction
- Noise suppression and speech enhancement
- Real-time audio streaming
- Model optimization for production deployment
Create AI-powered Voice Experiences
Collaborate closely with Product, Mobile, and Backend Engineers to deliver features such as:
- Voice recording and transcription
- AI-powered voice assistants
- Voice-driven workflows
- Natural conversational experiences
Optimize AI Models for Production
Continuously improve:
- Speech recognition accuracy
- Speech synthesis quality
- Inference latency
- Scalability across cloud and mobile environments
Research & Apply State-of-the-Art Technologies
Evaluate and integrate modern Speech AI technologies, including:
- Whisper
- wav2vec2
- NVIDIA NeMo
- XTTS / VITS
- Other open-source or commercial Speech Foundation Models
WHO YOU ARE
We are looking for engineers who enjoy solving challenging AI problems and building products that create real business impact.
- 3+ years of experience building AI/ML systems or AI-powered products.
- Strong Python programming skills.
- Experience with deep learning frameworks such as PyTorch or TensorFlow.
- Hands-on experience deploying AI models into production.
- Solid understanding of speech processing and audio machine learning.
- Experience working on one or more of the following:
- Automatic Speech Recognition (ASR)
- Text-to-Speech (TTS)
- Audio AI or Voice AI applications
- Ability to design and implement end-to-end AI workflows.
- Strong problem-solving skills with a product-oriented mindset.
Nice to Have: Experience with one or more of the following technologies
- Whisper
- wav2vec2
- NVIDIA NeMo
- Coqui TTS
- XTTS
- VITS
- SpeechBrain
- Kaldi
Additional experience in the following areas is a strong advantage:
- AI deployment on mobile devices
- Real-time audio streaming
- Speaker recognition and speaker diarization
- Voice cloning
- Audio signal processing
- LLM integration or conversational AI systems
WHAT YOU CAN GET
Benefits
- Competitive salary package with performance-based bonuses.
- Flexible salary reviews (1–2 times per year) based on performance.
- Opportunity to receive ESOP (Employee Stock Ownership Plan) for outstanding contributors.
- High-performance workstation and external monitor.
- Annual company trips, team-building activities, and engagement events.
- Full statutory benefits in accordance with Vietnamese Labor Law.
Career Development
Build AI Products with Real Impact: Develop production-grade AI systems that power real customer experiences-not just research prototypes.
Accelerate Your AI Career: Own the complete lifecycle of AI features-from research and model development to deployment and continuous optimization.
Learn from Exceptional Builders: Work alongside experienced AI Engineers, Software Engineers, and Product Builders to create scalable AI products serving thousands of businesses.
Create Visible Impact: Every model you build directly enhances how users interact with our products through intelligent voice experiences.
GET READY TO BE OUR AI ENGINEER!
Apply & Job Discussion with TA
- Round 1: Technical & Culture Interview
- Round 2: Final Interview with CEO
Address Office: 16th floor, HUD Building, 37 Le Van Luong, Thanh Xuan, Hanoi.