LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

System Performance Engineer (CPU, GPU, AI/ML) for Automotive platforms

at Qualcomm

Back to all C/C++ jobs
Q
Industry not specified

System Performance Engineer (CPU, GPU, AI/ML) for Automotive platforms

at Qualcomm

Tech LeadNo visa sponsorshipC/C++/C#

Posted 7 hours ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Not specified
Country
India

Join Qualcomm's System Performance team to profile and optimize system performance on Snapdragon Automotive chipsets, focusing on CPU, GPU, memory (DDR) and AI/ML workloads. You will drive silicon performance analysis using benchmarks and performance tools, identify bottlenecks, and collaborate with cross-functional teams for next-generation chipsets. The role requires deep expertise in CPU/GPU architectures, memory systems, OS-level performance, and profiling techniques to optimize performance across varying temperatures and workloads.


Company:

Qualcomm India Private Limited

Job Area:

Engineering Group, Engineering Group > Systems Engineering

General Summary:

You will be part of System Performance team that is responsible for profiling and optimizing the System Performance on Snapdragon Automotive chipsets.

Skills Required:

  • A deep understanding of CPU, GPU and DDR architecture internals like

    • CPU caches (L1, L2, L3), instruction pipelines, branch prediction, memory hierarchy (including register, cache, and main memory) and multi-core and multi-threaded processing. A good understanding of performance impact with memory access, instruction dependencies, and contention for shared resources.

    • GPU: Understanding of parallel computing concepts, Knowledge of modern GPU architecture, including their memory hierarchies and performance bottlenecks.

    • DDR: Understanding of JEDEC specifications LPDDR4, LPDDR5, LPDDR6, including command and timing parameters (e.g., 𝑑𝐢𝐿, 𝑑𝑅𝐢𝐷, tRP), memory organization (rows, columns, banks), and basic view of training and initialization sequences, how a memory controller works and its specific features like command queue, port arbitration, and various control schemes.

  • Expertise how operating systems manage processes, threads, memory, MMU, and interrupt handling. This knowledge is crucial in understanding software for the kernel scheduler and system-level bottlenecks.

  • Good understanding of Benchmarks CPU (like GeekBench, SpecInt, CoreMark etc), GPU (like Manhattan3.1, Aztec Ruins (Vulcan/OpenGL), Car chase, 3Dmark etc.) and DDR (like lat_mem, stream, bw_mem etc.) and how they exercise the underlying CPU/GPU/DDR architecture.

  • Proficiency in reading and, in some cases, writing assembly code to understand the precise instructions a program executes and identify inefficiencies that compilers may miss.

  • GPU-specific languages like CUDA or OpenCL are important for understanding GPU workloads. Good understanding C/C++/Python is added advantage.

  • Experience with a variety of performance monitoring tools like Intel VTune, Linux perf, and Utilities like top, vmstat, iostat, and netstat to monitor system resources like CPU, memory, and I/O. Experience with software tools to monitor system hotspots, command bus utilization, and identify memory traffic patterns is critical. This includes validating that the traffic generated by software is as expected.

  • Good understanding of memory allocation policies, prefetching, and caching to minimize latency and maximize bandwidth. Understanding how an application accesses memory is vital. Skills in profiling code to analyze memory access patterns and then optimizing the code for better data locality.

  • Analyzing large datasets from performance tests requires strong statistical skills. This can involve creating histograms of transaction latencies and deriving performance metrics to understand a system's behavior.

Responsibilities include:

  • Drive Performance analysis on silicon using various System and Cores (i.e. CPU, GPU, Memory and AI/ML) benchmarks like Dhrystone, GeekBench, SPECInt, CNN/GenAI ML networks etc.

  • Use of Performance tools to analyze the load patterns across IPs and identify any performance bottlenecks in system.

  • Analyzing Perf KPIs of SoC subsystems like CPU, GPU, NSP, Memory, and corelate performance with projection

  • Evaluate and characterize performance at various junction temperatures and optimize running at high ambient temperatures.

  • Analyze and optimize the System performance parameters of SoC infrastructure like NoC, LP5 DDR, etc.

  • Collaborate with cross-functional global teams to plan and execute performance activities on chipsets as well as make recommendations for next generation chipsets.

Minimum Qualifications

7+ years of industry experience in the following:

  • Experience working on above listed required skills on any ARM/x86 based platforms, mobile/automotive operating systems and/or performance profiling tools.

  • Experience in application or driver development in Linux\QNX and ability to create/customize make files with various compiler options is a plus.

  • Must be quick learner and should be able to adapt to new technologies.

  • Must have excellent communication skills.

Preferred Qualifications

Additional skills in the following areas are preferred:

  • Knowledge of Computer architecture, LP5 DDR, Bus/NOC profiling is a big plus.

  • Fundamentals on any operating system like Linux/QNX/Hypervisor & experience working on any Automative applications.

  • Experience in creating professional quality reports and slides using MS Office or any advanced visualization tools.

  • Experience in PoC development and competitive analysis Knowledge on Voltage/Power/ Thermal domain is plus.

Education

Required: Bachelor's, Computer Engineering and/or Electrical Engineering

Preferred: Master's, Computer Engineering and/or Electrical Engineering

Key Skills: CPU Architecture and Performance, MMU, MPAM, Prefetchers, PnP, Multicore, Benchmarks, Operating systems, Linux, QNX, Hypervisor, DDR Performance

Minimum Qualifications:

β€’ Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Systems Engineering or related work experience.
OR
Master's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience.
OR
PhD in Engineering, Information Systems, Computer Science, or related field and 1+ year of Systems Engineering or related work experience.

Applicants: Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).

Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.

If you would like more information about this role, please contact Qualcomm Careers.

System Performance Engineer (CPU, GPU, AI/ML) for Automotive platforms

at Qualcomm

Back to all C/C++ jobs
Q
Industry not specified

System Performance Engineer (CPU, GPU, AI/ML) for Automotive platforms

at Qualcomm

Tech LeadNo visa sponsorshipC/C++/C#

Posted 7 hours ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Not specified
Country
India

Join Qualcomm's System Performance team to profile and optimize system performance on Snapdragon Automotive chipsets, focusing on CPU, GPU, memory (DDR) and AI/ML workloads. You will drive silicon performance analysis using benchmarks and performance tools, identify bottlenecks, and collaborate with cross-functional teams for next-generation chipsets. The role requires deep expertise in CPU/GPU architectures, memory systems, OS-level performance, and profiling techniques to optimize performance across varying temperatures and workloads.


Company:

Qualcomm India Private Limited

Job Area:

Engineering Group, Engineering Group > Systems Engineering

General Summary:

You will be part of System Performance team that is responsible for profiling and optimizing the System Performance on Snapdragon Automotive chipsets.

Skills Required:

  • A deep understanding of CPU, GPU and DDR architecture internals like

    • CPU caches (L1, L2, L3), instruction pipelines, branch prediction, memory hierarchy (including register, cache, and main memory) and multi-core and multi-threaded processing. A good understanding of performance impact with memory access, instruction dependencies, and contention for shared resources.

    • GPU: Understanding of parallel computing concepts, Knowledge of modern GPU architecture, including their memory hierarchies and performance bottlenecks.

    • DDR: Understanding of JEDEC specifications LPDDR4, LPDDR5, LPDDR6, including command and timing parameters (e.g., 𝑑𝐢𝐿, 𝑑𝑅𝐢𝐷, tRP), memory organization (rows, columns, banks), and basic view of training and initialization sequences, how a memory controller works and its specific features like command queue, port arbitration, and various control schemes.

  • Expertise how operating systems manage processes, threads, memory, MMU, and interrupt handling. This knowledge is crucial in understanding software for the kernel scheduler and system-level bottlenecks.

  • Good understanding of Benchmarks CPU (like GeekBench, SpecInt, CoreMark etc), GPU (like Manhattan3.1, Aztec Ruins (Vulcan/OpenGL), Car chase, 3Dmark etc.) and DDR (like lat_mem, stream, bw_mem etc.) and how they exercise the underlying CPU/GPU/DDR architecture.

  • Proficiency in reading and, in some cases, writing assembly code to understand the precise instructions a program executes and identify inefficiencies that compilers may miss.

  • GPU-specific languages like CUDA or OpenCL are important for understanding GPU workloads. Good understanding C/C++/Python is added advantage.

  • Experience with a variety of performance monitoring tools like Intel VTune, Linux perf, and Utilities like top, vmstat, iostat, and netstat to monitor system resources like CPU, memory, and I/O. Experience with software tools to monitor system hotspots, command bus utilization, and identify memory traffic patterns is critical. This includes validating that the traffic generated by software is as expected.

  • Good understanding of memory allocation policies, prefetching, and caching to minimize latency and maximize bandwidth. Understanding how an application accesses memory is vital. Skills in profiling code to analyze memory access patterns and then optimizing the code for better data locality.

  • Analyzing large datasets from performance tests requires strong statistical skills. This can involve creating histograms of transaction latencies and deriving performance metrics to understand a system's behavior.

Responsibilities include:

  • Drive Performance analysis on silicon using various System and Cores (i.e. CPU, GPU, Memory and AI/ML) benchmarks like Dhrystone, GeekBench, SPECInt, CNN/GenAI ML networks etc.

  • Use of Performance tools to analyze the load patterns across IPs and identify any performance bottlenecks in system.

  • Analyzing Perf KPIs of SoC subsystems like CPU, GPU, NSP, Memory, and corelate performance with projection

  • Evaluate and characterize performance at various junction temperatures and optimize running at high ambient temperatures.

  • Analyze and optimize the System performance parameters of SoC infrastructure like NoC, LP5 DDR, etc.

  • Collaborate with cross-functional global teams to plan and execute performance activities on chipsets as well as make recommendations for next generation chipsets.

Minimum Qualifications

7+ years of industry experience in the following:

  • Experience working on above listed required skills on any ARM/x86 based platforms, mobile/automotive operating systems and/or performance profiling tools.

  • Experience in application or driver development in Linux\QNX and ability to create/customize make files with various compiler options is a plus.

  • Must be quick learner and should be able to adapt to new technologies.

  • Must have excellent communication skills.

Preferred Qualifications

Additional skills in the following areas are preferred:

  • Knowledge of Computer architecture, LP5 DDR, Bus/NOC profiling is a big plus.

  • Fundamentals on any operating system like Linux/QNX/Hypervisor & experience working on any Automative applications.

  • Experience in creating professional quality reports and slides using MS Office or any advanced visualization tools.

  • Experience in PoC development and competitive analysis Knowledge on Voltage/Power/ Thermal domain is plus.

Education

Required: Bachelor's, Computer Engineering and/or Electrical Engineering

Preferred: Master's, Computer Engineering and/or Electrical Engineering

Key Skills: CPU Architecture and Performance, MMU, MPAM, Prefetchers, PnP, Multicore, Benchmarks, Operating systems, Linux, QNX, Hypervisor, DDR Performance

Minimum Qualifications:

β€’ Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Systems Engineering or related work experience.
OR
Master's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience.
OR
PhD in Engineering, Information Systems, Computer Science, or related field and 1+ year of Systems Engineering or related work experience.

Applicants: Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail disability-accomodations@qualcomm.com or call Qualcomm's toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).

Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.

To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.

If you would like more information about this role, please contact Qualcomm Careers.

SIMILAR OPPORTUNITIES

No similar jobs available at the moment.