AI Specialist (AI Engineering)
Quick Summary
Compress and optimize large language and vision models for on-device inference. Develop pipelines for model distillation and hardware-specific compilation.
Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques. Hands-on experience with TensorRT, ONNX Runtime, and edge deployment. Strong C++ and Python skills.
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities
~1 min read- →Compress and optimize large language and vision models for on-device inference.
- →Develop pipelines for model distillation and hardware-specific compilation.
- →Benchmark performance across various NPU/GPU architectures.
Requirements
~1 min read- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Location & Eligibility
Listing Details
- Posted
- April 24, 2026
- First seen
- April 24, 2026
- Last seen
- May 4, 2026
Posting Health
- Days active
- 10
- Repost count
- 0
- Trust Level
- 35%
- Scored at
- May 5, 2026
Signal breakdown

Web3 and AI talent recruitment agency based in Hong Kong with 700+ placements globally
Please let Hyphenconnect know you found this job on Jobera.
4 other jobs at Hyphenconnect
View all →Explore open roles at Hyphenconnect.
Browse Similar Jobs
Stay ahead of the market
Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.
No spam. Unsubscribe at any time.