Intel NNP-i 1150
Logs
typeType
Coprocessor
interfaceInterface
connectorSlot connector
M.2 (M Key)
signalSignaling
PCIe 1.0 PCIe 2.0 PCIe 3.0
datePCI ID
45C6
ramRAM type
LPDDR4X
ramRAM size
16GB
dimDimensions
22 mm (width) x 110mm (length) x 24mm (height)
chip Chips and specs
Compute Architecture
CPU Instruction Sets AVX-512, AVX-VNNI
Integrated CPU Cores 2x Intel Sunny Cove Cores
Primary Compute Units 12 Inference Compute Engines (ICE)
ICE Compute Capability 4K MAC/cycle per ICE (4D Grid: 32x32x4)
Integrated Vector Processor Customized Tensilica Vision P6 DSP (512-bit vector width)
System Interface and Power
Host Interface PCIe 3.0 x8 (M.2 form factor only exposes x4)
Power Management Fully Integrated Voltage Regulation (FIVR), DVFS
External Memory Controller Quad-channel LPDDR4X-4266
Thermal Design Power (TDP) 12 W (configurable up to ~25 W)
External Memory ECC Support Yes (Not Used)
On-Board Memory
Brand Micron
Model MT53D1024M32D4DT-046
Total RAM 16GB
Configuration 4x 32bit LPDDR4x
Architecture Identity
Model Intel Nervana NNP-I 1150
Codename Spring Hill
Process Technology Intel 10nm FinFET CMOS
Primary Market Segment Deep learning inference acceleration for data center and edge
On-Chip Memory Hierarchy
Deep SRAM 48 MiB Total (4 MiB per ICE)
On-Chip Communication Integrated ring bus architecture
Last Level Cache (LLC) 24 MiB Total (8 cache slices x 3 MiB)
Tightly-Coupled Memory (TCM) 3 MiB Total (256 KiB per ICE)
Feature and Performance Summary
Peak Performance 50 TOPS (at INT8 precision)
Supported Data Types INT8, FP16, 4-bit, 2-bit, 1-bit
Software Compatibility TensorFlow, PyTorch, ONNX via nGraph
portintI/O ports
Empty
powerPower connectors
Empty
notes Notes

The Intel Nervana NNP-I 1150 (Spring Hill, SRK1U) is an M.2 Key M form factor inference accelerator. It integrates two Intel Sunny Cove cores and twelve Inference Compute Engines (ICEs) on a 10nm SoC. The chip includes 24 MB of shared LLC, 3 MB of tightly coupled memory (256 KB per ICE), and 48 MB of deep SRAM (4 MB per ICE). It supports LPDDR4x-4266 memory with 16 or 32 GB capacity and 67.2 GB/s bandwidth.

The card connects via M.2 PCIe Gen3 x4 and has a typical TDP of 12W. It is designed to accelerate low-latency AI inference workloads and operates independently with an embedded Linux-based OS loaded by the host driver at runtime.

Last updated 2025-06-11T07:32:02Z
bios 2 card BIOS files available
Version
Note
File
Logs
Version
Unknown
Note
Intel Engineering BIOS
File
Logs
Version
Unknown
Note
Altera MAX 10Q16M FPGA data
File
Logs
drv No drivers available
doc No card documents available

Disclaimer

The info found in this page might not be entirely correct. Check out this guide to learn how you can improve it.