
Brief Description:
- In this role you will be responsible for designing, developing, and optimizing speech and audio recognition and related algorithms for our cutting-edge products. This position demands a deep understanding of machine learning and speech & audio technology, coupled with a strong commitment to innovation and product development to effect short-term and long-term product & business goals.
Responsibilities:
- Work with marketing and product management to understand key business drivers, challenges, and future opportunities and solidify business plan for the assigned new business launch area;
- Work as engineer to build and iterate early experiments of new products and business models.
- Develop and deploy advanced speech recognition (ASR) and audio algorithms, including but not limited to noise reduction, feature extraction, and acoustic modeling;
- Optimize end-to-end voice interaction systems to achieve industry-leading performance, accuracy, and resource efficiency across various platform;
- Analyze and enhance efficiency, stability and scalability of system resources to improve algorithm performance;
- Stay abreast of the latest advancements in speech technology, machine learning and AI;
- Conduct exploratory research to identify new algorithms, models, and techniques that can enhance our products;
- Perform rigorous testing and validation of speech algorithms under diverse conditions;
- Lead projects from concept to deployment, ensuring alignment with business objectives and technical standards and ensure execution to schedule and budget;
- Work with business development / sales teams to convince potential customers about the value proposition of, source concept of operations and use cases of, the new product;
- Participate in system architecture and requirement definition, market development and bid support;
- Participate as a key reviewer in technical reviews and provide technical guidance and mentoring to team;
- Other tasks when assigned.
Qualifications Required:
- Bachelor's Degree in Computer Science, Electrical Engineering, or related field;
- Minimum 5 years’ experience in speech and audio algorithm research and development;
- Proficiency in Python, PyTorch, C/C++ and/or other relevant programming languages.
- Proficiency in applying and fine-tuning mainstream ASR/TTR models (e.g., Whisper, SenseVoice, CosyVoice);
- Hands-on experience with developing real time speech and audio processing software;
- Strong knowledge of machine learning frameworks;
- Familiarity with speech processing tools and libraries;
- Strong Chinese (required) and English (desired) oral and written communication skills
- Strong interpersonal and collaboration skills, with the ability to work effectively in a team environment;
- Ability to organize information and make timely and sound decisions;
- Ability to consistently produce high quality work in a production environment;
Qualifications Desired:
- Experience with Generative Machine Learning;
- Experience with multi-model AI systems, combing speech with vision processing;
- Familiarity with cloud-based AI services and microservices architecture;
- Patents in speech technology;
- Demonstrated leadership in advancing the state of the art in a technical specialty;
我们致力于为行业最优秀的人才提供成长、创新的多元文化,以及良好的职业发展机会。
发送你的申请至 recruiting@aviagesystems.com
发送你的申请至 recruiting@aviagesystems.com