We are seeking a skilled and innovative Senior Kernel Engineer to help build and optimize our clients cuttingedge Software Development Kit (SDK). Our clients SDK is designed to deliver industryleading performance and power efficiency for our clients unique atmemory AI inference chips. By enabling customers to compile AI models directly for their architecture and providing advanced tools for performance analysis and optimization were setting a new standard in AI inference technology.
At the core of our SDK is the kernel library which maximizes hardware efficiency through fast computations flexible parallel workloads and highly configurable dataflow options. This role requires a deep understanding of our architectures strengths and constraints and how the kernel libraries features enable seamless and performant compilations.
Key Responsibilities:
- Develop Neural Network Compute Kernels: Design and implement efficient and flexible kernels tailored for our chip families.
- Enhance Kernel Abstractions: Define and improve abstractions in our kernel library to streamline development and accelerate kernel performance.
- Optimize Performance: Conduct indepth analysis and optimization of individual kernel and fullnetwork implementations.
- Collaborate CrossFunctionally: Work closely with the compiler and physical allocation teams to deliver efficient pushbutton compilation workflows for AI models.
Required Qualifications:
- Bachelors or advanced degree in Computer Science Engineering Mathematics Physics or a related field.
- Proficiency in Python C/C and software design principles.
- Proven ability to work independently on complex tightly constrained technical challenges.
- Demonstrated experience leading technical initiatives and collaborating with crossfunctional teams.
- Strong interest and ability to balance highlevel architecture considerations with detailed lowlevel technical work.
- Expertise in lowlevel and parallel optimization techniques including assembly language GPU shaders SIMD CUDA or AI inference accelerator kernels.
Preferred Qualifications:
- Familiarity with spatial architectures and atmemory compute technologies.
- Knowledge of AI algorithms and neural network principles.
- Advanced mathematical skills and a passion for solving complex problems.
- Enjoy tackling challenging intellectual problems such as tricky math puzzles or logicbased tests.
If youre excited about pushing the boundaries of performance and innovation in AI inference wed love to hear from you!