MLOps Engineer

August 29, 2025
Open
Open
Location
Anywhere
Occupation
Full-time
Experience level
Senior
Apply
AI Summary

Vị trí này tại Bjak cho phép bạn làm việc toàn thời gian, remote tại Việt Nam, chuyên xây dựng và phát triển các ứng dụng trí tuệ nhân tạo quy mô toàn cầu. Trách nhiệm chính bao gồm tối ưu hóa vận hành mô hình AI mã nguồn mở, phối hợp với các đội nhóm kỹ thuật khác để triển khai hệ thống phục vụ mô hình ổn định, hiệu quả với tài nguyên GPU/CPU. Công ty đảm bảo lương thưởng hấp dẫn, bảo hiểm sức khỏe & du lịch, các chế độ đãi ngộ về nhà ở, bữa ăn và thời gian nghỉ linh hoạt.

Ứng viên cần có kinh nghiệm sử dụng vLLM, HuggingFace TGI và thông thạo Kubernetes, Ray, Modal, cùng khả năng thiết lập, quản lý hệ thống inference endpoint, giám sát chi phí, độ trễ và hiệu năng. Môi trường làm việc linh hoạt, văn hoá đề cao quyền sở hữu và phát triển cá nhân.

Highlight
Highlight

Transform Language Models into Real-World Applications

We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world. This role is a global role with remote work arrangement. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.

Why This Role Matters

You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.

What You’ll Do

  • Run and manage open-source models efficiently, optimizing for cost and reliability

  • Ensure high performance and stability across GPU, CPU, and memory resources

  • Monitor and troubleshoot model inference to maintain low latency and high throughput

  • Collaborate with engineers to implement scalable and reliable model serving solutions

What Is It Like

  • Likes ownership and independence

  • Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.

  • Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.

  • Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.

  • See feedback and failure as part of growth - you’re here to level up.

  • Possess humility, hunger, and hustle, and lift others up as you go.

Requirements

  • Experience with model serving platforms such as vLLM or HuggingFace TGI

  • Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs

  • Ability to monitor latency, costs, and scale systems efficiently with traffic demands

  • Experience setting up inference endpoints for backend engineers

What You’ll Get

  • Flat structure & real ownership

  • Full involvement in direction and consensus decision making

  • Flexibility in work arrangement

  • High-impact role with visibility across product, data, and engineering

  • Top-of-market compensation and performance-based bonuses

  • Global exposure to product development

  • Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals

  • Health, dental & vision insurance

  • Global travel insurance (for you & your dependents)

  • Unlimited, flexible time off

Our Team & Culture

We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.

About Bjak

BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems. If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.

Apply now
Thanks you!
Oops! Something went wrong while submitting the form.
Please let us know if this job is expired. Your support helps us maintain an accurate job board!
Similar Jobs
Employment Hero
Employment Hero
Vietnam
Full-time
Mid-level
image.png
Bjak
Vietnam
Full-time
Mid-level
Employment Hero
Employment Hero
Vietnam
Full-time
Mid-level
Bjak
Anywhere
Full-time
Senior
image.png
Bjak
Bandingkan insurans kereta dan motosikal, serta renew roadtax atas talian di BJAK. Semak harga sekarang. Jadi VIP BJAK dan dapat khidmat tunda tanpa had. Our mission is to develop technology based solutions to improve financial inclusion. We develop new & innovative platforms & services globally. For example, we are the first platform to simplify and digitise comprehensive life and medical insurance, supported by AI agent. BJAK is the largest insurance platform in Southeast Asia. If you enjoy building cutting edge platform-ecosystems that gives equal access to financial services to everyone at scale, join us
HQ Location
Company type
Scale-up
Domain
Information Technology & Services
Website