وظيفة مهندس منصة AI والاستدلال (مستوى Staff/Senior) لدى Qualcomm في الرياض

AI Platform & Inference Suite Engineer (Staff/Senior Staff level) - Riyadh, KSA

🏢 Qualcomm

🕒 نُشرت: 12 يونيو 2026 (منذ شهر) 📍 الرياض وظائف الهندسة والتقنية

تفاصيل الوظيفة

تعلن شركة Qualcomm عبر فرعها في الرياض عن توفر وظيفة مهندس منصة واستدلال للذكاء الاصطناعي (مستوى Staff/Senior Staff) في المملكة العربية السعودية.

نبذة عن الوظيفة

وظيفة هندسية في مجموعة البرمجيات، تركّز على تمكين أحمال العمل العميقة على مستوى الرفوف باستخدام مسرّعات استدلال الذكاء الاصطناعي المتقدمة من Qualcomm.
دور تقني مواجه للعملاء، يتضمن نقل وتحسين والتحقق من نماذج التعلم العميق على أنظمة الإنتاج، وتمكين الشركاء من تطوير تطبيقات تعلم آلي متقدمة تشمل الرؤية الحاسوبية، الكلام، الذكاء الاصطناعي التوليدي، والنماذج متعددة الوسائط باستخدام PyTorch وTensorFlow وONNX.

المهام والمسؤوليات

نقل وتحسين نماذج التعلم العميق إلى منصات مراكز البيانات المعتمدة على المسرّعات، بما في ذلك سير عمل تحويل النماذج، تقنيات التكميم (INT8/دقة مختلطة)، وتكامل بيئة التشغيل وتحسينها.
دمج نماذج التعلم الآلي في حزمة Qualcomm Cloud AI ML من أطر مثل PyTorch وTensorFlow وONNX.
دفع التحسينات في إنتاجية النموذج وزمن الاستجابة والدقة مع تحليل واضح للمقايضات.
بناء واختبار ونشر خطوط أنابيب استدلال قابلة للتوسع باستخدام أطر خدمة مثل vLLM وTGI وTriton.
تحسين أحمال العمل لنماذج LLM وGenAI عبر بنى متعددة الـSoC ومتعددة البطاقات.
التعاون مع فرق الهندسة لتحليل وتحسين التدريب والاستدلال لتطبيقات التعلم العميق المتقدمة.
تحديد الاختناقات عبر الحوسبة والذاكرة وبيئة التشغيل وتوجيه استراتيجيات التحسين.
المساهمة في مستودع Qualcomm Cloud AI على GitHub ووثائق المطورين ومشاركة أفضل الممارسات.
تطوير ودمج خطوط أنابيب تطبيقات تعلم آلي شاملة مع أطر ومكتبات العملاء.
العمل كمستشار تقني موثوق للعملاء الذين ينشرون أحمال عمل الذكاء الاصطناعي، والمشاركة في مناقشات حجم الأجهزة وهندستها.
تقديم إرشادات فنية حول اختيار نموذج الذكاء الاصطناعي، جدوى النشر، هندسة النظام، وتوقعات الأداء.
تقييم متطلبات نموذج الذكاء الاصطناعي وربطها بقيود الذاكرة وهندسة المسرّع وحدود التوسع.
تقييم خصائص أداء النماذج في سيناريوهات الإنتاج (الإنتاجية، زمن الاستجابة، السلوك المتزامن).
توجيه قرارات الهندسة حول استراتيجيات التوسع الأفقي/العمودي وحجم نشر الأجهزة.
قيادة مناقشات حول خطوط أنابيب الذكاء الاصطناعي الشاملة (سير عمل متعدد النماذج، معالجة مسبقة ولاحقة للبيانات) مع خيارات مثل FFMPEG وGStreamer.
إبراز وشرح المقايضات بين الدقة والتوافق وجودة النموذج وجدوى النشر.
دعم أو قيادة التحقق من قدرات النموذج في بيئات الن deployment.

الشروط والمتطلبات

درجة البكالوريوس في علوم الحاسب، هندسة الحاسب، الهندسة الكهربائية أو مجال ذي صلة (أو خبرة معادلة).
10-15+ سنة من الخبرة في تطوير نماذج التعلم العميق أو نشرها على CPUs/GPUs/ASICs، وأنظمة الاستدلال وتحسينها، ومنصات الذكاء الاصطناعي في مراكز البيانات أو الحافة.
خبرة قوية في تقنيات تكميم النماذج وتحسينها، وأطر نماذج الذكاء الاصطناعي (مثل PyTorch، TensorFlow)، وخطوط أنابيب نشر النماذج.
إتقان ممتاز في برمجة C/C++/Python ومهارات تصميم البرمجيات بما في ذلك التصحيح وتحليل الأداء.
خبرة عملية مع أنظمة Linux والبرمجيات منخفضة المستوى والسائقين وتشغيل النظام.
قدرة مثبتة على تحليل وتحسين أداء النموذج في بيئات الإنتاج.
فهم متين لقيود أجهزة استدلال الذكاء الاصطناعي واختناقات الأداء على مستوى النظام.
مهارات اتصال قوية وخبرة في أدوار تقنية موجهة للعملاء.
الاستعداد للسفر لحضور لقاءات العملاء والمراجعات الاستراتيجية.

المهارات المطلوبة

خبرة في نشر النماذج على منصات تستخدم مسرّعات أجهزة للاستدلال.
إدارة سير عمل متعدد النماذج وبناء خطوط أنابيب استدلال في الوقت الفعلي.
معرفة بأطر خدمة الاستدلال مثل vLLM وTGI وTriton.
فهم تقنيات التكميم (INT8/دقة مختلطة).
إجادة تحليل مقايضات النماذج (الدقة مقابل التوافق، الجودة مقابل الجدوى).
خبرة في تحجيم أحمال العمل عبر بنى متعددة البطاقات.

عرض النص الأصلي للإعلان

Company

Qualcomm Middle East Information Technology Company LLC

Job Area

Engineering Group, Engineering Group > Software Engineering

General Summary

About Us

Qualcomm is enabling a world where everyone and everything can be intelligently connected. You interact with products and technologies made possible by Qualcomm every day, including intelligent edge devices, next-generation computing platforms, and advanced AI solutions. Qualcomm’s leadership in AI, high‑performance compute, and connectivity is driving innovation across cloud, edge, and data center environments - delivering scalable, power‑efficient platforms that power the next generation of intelligent infrastructure.

About The Role

Qualcomm is seeking Machine Learning Applications Engineer - AI Inference & Model Optimization to support the enablement of rack-scale deep learning workloads on advanced Qualcomm AI inference accelerators. These accelerators utilize Qualcomm's expertise in hardware-accelerated AI to deliver high-performance, energy-efficient generative AI and computer vision inference solutions for modern data centers.

This is a customer‑facing, highly technical role focused on porting, optimizing, and validating deep learning AI models on production systems, and enabling Qualcomm’s partners to develop and deploy advanced machine learning applications - including computer vision, speech, generative AI and state of the art multimodal reasoning models - using popular frameworks such as PyTorch, TensorFlow, and ONNX on Qualcomm Cloud AI accelerators. Key responsibilities include evaluating models for throughput, latency, and accuracy; profiling and optimizing model performance; building robust application pipelines; integrating customer frameworks; and contributing to documentation, training, and demonstrations.

The role requires strong expertise in AI models, quantization, performance optimization, and deployment, plus the ability to shape architecture, workload sizing, and system design. It also requires experience with deep learning model development across hardware platforms, solid programming skills, collaboration with cross-functional teams, and proficiency in machine learning frameworks, Linux, and container orchestration tools.

The ideal candidate can effectively bridge AI model requirements ↔ hardware capabilities ↔ customer expectations, guiding customers from model selection → hardware sizing → deployment decisions → production readiness.

What You’ll Do

AI Model Porting & Optimization
Deploy, optimize and scale deep learning AI models onto accelerator‑based data center platforms, including:
Model conversion workflows
Quantization techniques (INT8 / mixed precision)
Runtime integration and optimization
Integrate ML models onto Qualcomm’s Cloud AI ML stack from frameworks such as PyTorch, TensorFlow, and ONNX.
Drive improvements in model throughput, latency, and accuracy, with clear trade‑off analysis.
Build, test, and deploy scalable inference pipelines using serving frameworks such as vLLM, TGI, and Triton.
Optimize workloads for LLM and GenAI models across both multi-SoC and multi-card architectures.
Collaborate with engineering teams to analyze and refine training and inference for advanced deep learning applications.
Identify bottlenecks across compute, memory, and runtime, and guide optimization strategies.
Contribute to Qualcomm’s Cloud AI GitHub repository and developer documentation, sharing technical best practices and solutions.
Develop and integrate end-to-end ML application pipelines with customer frameworks and libraries.
Customer‑Facing Technical Engagement
Act as a trusted technical advisor for customers deploying AI workloads.
Engage in hardware sizing and architecture discussions, aligning model requirements with infrastructure capabilities.
Provide technical guidance on:
AI model selection
Deployment feasibility
System architecture and performance expectations
Lead discussions on model capabilities and limitations based on real customer use cases.
Model-Infrastructure Alignment
Assess and evaluate AI model requirements and recommend alternative model approaches when necessary.
Align model characteristics (latency, throughput, accuracy) with accelerator and system capabilities.
Connect model requirements with:
Memory constraints
Accelerator architecture
Scaling limitations
Support customers in defining model selection strategies based on deployment realities.
Performance & Scalability Engineering
Evaluate performance characteristics of AI models in production scenarios, including:
Throughput expectations
Latency targets
Concurrency behavior
Guide architecture decisions around:
Scaling strategies (horizontal vs vertical)
Hardware deployment sizing
Contribute to discussions on:
Workload scalability limits
Impact of model selection on system performance and efficiency
Provide insights into capacity planning and infrastructure optimization.
End‑to‑End AI Pipeline Design
Drive discussions around end‑to‑end AI pipelines, including:
Multi‑model workflows (e.g., detection + tracking + recognition)
Data preprocessing and post‑processing stages
Guide decisions on video and data processing stacks, including:
Video pipeline choices (e.g., FFMPEG vs GStreamer)
Integration into inference pipelines
Ensure pipelines are aligned with:
Performance requirements
Hardware capabilities
Real‑time constraints
Model Trade‑off Analysis & Validation
Highlight and explain trade‑offs between:
Accuracy vs compatibility
Model quality vs deployment feasibility
Support decision‑making on:
Model simplification vs performance gains
Precision vs efficiency trade‑offs
Lead or support model capability validation in deployment environments.
Collaborate with customers to define:
Inference assumptions
Model sizing strategies for large‑scale workloads

Required Qualifications

Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience).
10-15+ years of experience in:
Deep learning model development or deployment experience on CPUs/GPUs/ASICs.
Inference systems and optimization
Data center or edge AI platforms
Strong experience with:
Model quantization and optimization techniques
AI model frameworks (e.g., PyTorch, TensorFlow)
Model deployment pipelines
Excellent C/C++/Python programming and software design skills, including debugging, and performance analysis.
Hands on expertise with Linux-based systems, low level software, drivers, and system bring up.
Proven ability to analyze and optimize model performance in production environments.
Solid understanding of:
AI inference hardware constraints
System level performance bottlenecks
Strong communication skills and experience in customer facing technical roles.
Willingness to travel for customer engagements and strategic reviews.

Preferred Qualifications

Skilled in deploying models on platforms that use hardware accelerators for inference.
Experienced with managing multi-model workflows and building real-time AI systems, including computer vision, video, and analytics projects.
Knowledgeable about distributed inference methods and handling large-scale model deployments.
Proficient in developing and maintaining video processing workflows and using relevant software frameworks.
Deep understanding of how system-level decisions affect performance in actual deployment environments.
Capable of simplifying complex technical ideas into straightforward, useful advice for clients.
Hands-on experience running deep learning models on popular ML frameworks such as PyTorch, TensorFlow, ONNX
Experience developing software solutions that run in Linux environments with containers and orchestration
Experience with Source code and configuration management tools, Git knowledge is required.
Customer-facing experience translating customer requirements into technical solutions (discovery, scoping, success criteria, and execution plans).
Proven ability to build and deliver technical demos, proofs-of-concept, and reference applications for ML/GenAI workloads.
Strong technical writing skills to produce customer-ready documentation (getting started guides, deployment runbooks, troubleshooting guides) and deliver partner training sessions.
Experience driving issue triage and technical escalations with customers, coordinating across product, hardware, and software engineering teams to resolution.
Excellent stakeholder management and communication skills: present complex technical concepts clearly to both engineering and non-engineering audiences.

Why Join Qualcomm

At Qualcomm, you’ll work at the intersection of AI silicon, system architecture, and real‑world deployment. You will engage directly with strategic customers, influence next‑generation AI data center platforms, and help define scalable, power‑efficient infrastructure for the AI era. This role provides a unique opportunity to shape both technology direction and customer outcomes, while working with world‑class engineering and product teams.

What's On Offer

Apart from working with great people, we offer the below:

Salary including housing & transport allowance
Stock (RSU's) and performance related bonus
16 weeks fully paid Maternity Leave
6 weeks fully paid Paternity Leave
Employee stock purchase scheme
Child Education Allowance
Relocation and immigration support (if needed)
Life and Medical Insurance
Live+ Well Reimbursement for health and recreational membership fees

Minimum Qualifications

Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 6+ years of Software Engineering or related work experience.

OR

Master's degree in Engineering, Information Systems, Computer Science, or related field and 5+ years of Software Engineering or related work experience.

OR

PhD in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience.

3+ years of work experience with Programming Language such as C, C++, Java, Python, etc.
References to a particular number of years experience are for indicative purposes only. Applications from candidates with equivalent experience will be considered, provided that the candidate can demonstrate an ability to fulfill the principal duties of the role and possesses the required competencies.

Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).

Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.

If you would like more information about this role, please contact Qualcomm Careers.

المصدر: LinkedIn - أُضيفت للموقع في 12 يونيو 2026

وظيفة مهندس منصة AI والاستدلال (مستوى Staff/Senior) لدى Qualcomm في الرياض

تفاصيل الوظيفة

نبذة عن الوظيفة

المهام والمسؤوليات

الشروط والمتطلبات

المهارات المطلوبة

وظائف أخرى لدى Qualcomm

وظيفة مهندس أنظمة الذكاء الاصطناعي لمراكز البيانات (تحليل تنافسي) لدى Qualcomm في الرياض

وظيفة Data Centre Software Engineer - Rack Management لدى Qualcomm في الرياض

Qualcomm تعلن عن وظيفة Senior Staff Cyber Security Engineer في الرياض

وظيفة مهندس تعلم آلة سحابي لدى Qualcomm بالرياض

Qualcomm تعلن عن وظيفة مهندس حلول الذكاء الاصطناعي لمراكز البيانات في الرياض

Qualcomm تعلن عن وظيفة مهندس بنية تحتية وموثوقية المواقع - مركز بيانات AI في الرياض