Deep-learning inference models: the future of optimized deployment

Intel_logo_(2006-2020).svg

The Problem

Deep-learning models – whether for natural language processing, image processing, or machine vision – are becoming larger and more complex. Using these models in production requires significant computing resources even after traditional optimization techniques such as hyperparameter tuning, pruning, and quantization are applied. As a result, both cloud and edge deployments of deep-learning are challenged in two important dimensions:

  • Performance: Applications like autonomous driving, real-time video processing, and interactive voice response are time-sensitive and need to deliver results within strict time constraints. Failure to meet performance targets can limit overall product success.
  • Cost: In the cloud, longer processing times and increased memory requirements translate into higher costs. At the edge, increasing model complexity requires larger and more expensive CPUs or GPUs. Sometimes the processing unit in a particular edge platform is difficult to change, and thus designers struggle to add functionality and intelligence while staying within the available memory and computing limits.

    Does your company develop, train, and deploy cloud-based AI deep learning workloads on a regular basis?*
    Yes
    No

    Are you in Cloud?*
    Yes
    No

    Does your company have a dedicated infrastructure sw team for running the data/AI/ML pipeline in production?*
    Yes
    No

    When do you expect to deploy a solution that accelerates the performance or saves costs of products using cloud-based deep learning neural networks?*
    Within the next 3 months
    Within the next 6 months
    Within the next year
    Beyond mid 2022

    All information that you supply is protected by our Privacy Policy.

    In order to provide you with this free service, we may share your business information with companies whose content you choose to view on this website.

    By submitting your information you agree to our Terms of Use.

    Third party cookies may be placed, to serve more relevant ads when you browse the web.

    You can learn more about those ads here.