Role and Responsibilities

  • Optimize deep learning models for deployment using Pytorch, ONNX, TensorRT, and other relevant frameworks.
  • Develop and implement techniques for model quantization and compression to reduce memory footprint and increase inference speed.
  • Develop and implement techniques for model obfuscation and secure deployments.
  • Collaborate with AI researchers and developers to integrate advanced performance optimization techniques into our production systems.
  • Analyze and improve existing model architectures for better efficiency and performance.
  • Interface with production engineering team for assistance with on-prem deployments

About You

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field
  • Experience implementing modern deep learning architectures (transformers, CNNs, etc.)
  • Experience compiling model inference code for deployment
  • Strong software development skills
  • Strong familiarity with machine (deep) learning frameworks such as PyTorch, ONNX, and TensorRT
  • 2+ years industry experience preparing ML models for production
⚠️ This job was posted almost 2 years ago and may no longer be active β€” explore recent jobs.
Mention Woody when you apply β€” your support keeps us going πŸ’œ

Tired of manually applying to jobs?

Let JobCopilot do it for you!

Set your preferences and let your AI Copilot fill out applications while you sleep.

  • β€’ Set your specific job search criteria
  • β€’ Auto-apply to 1,500 relevant jobs per month
  • β€’ Tailors your resume automatically
  • β€’ Works 24/7 (while you sleep)
Reality Defender

Reality Defender

5 jobs posted
View company profile

Never miss jobs like this

πŸ”” Create Job Alerts