Loading…
OCP US Summit 2016 has ended

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Hardware Management [clear filter]
Thursday, March 10
 

9:00am

Multi-Node Management Specification
Live Stream - http://www.youtube.com/watch?v=NOS4Hira8mY

Presentation the latest Multi-Node Management specification. The aim from this presentation is to get & collect more feedback before we present the spec for voting.

Speakers
avatar for Bryan Kelly

Bryan Kelly

Principal Firmware Engineering Manager, Microsoft
Bryan Kelly is a Principle Firmware Engineering Manager for Microsoft’s Azure Cloud Server Infrastructure team. Bryan supports Microsoft’s cloud services by designing and developing firmware that enables hardware solutions in Microsoft’s next generation cloud platforms. During... Read More →


Thursday March 10, 2016 9:00am - 9:30am
Lower Level: LL21C

9:30am

Towards a Firmware Update Standard
Live Stream - http://www.youtube.com/watch?v=3yGbwUwwjxc

Discuss proposal to streamline system and device FW updates as well as personality management.


Thursday March 10, 2016 9:30am - 10:00am
Lower Level: LL21C

10:00am

OpenBMC Update
Live Stream - http://www.youtube.com/watch?v=KkHfz4GMiGA

OpenBMC was introduced in OCP summit-2015 and this talk is to present various enhancements done during last year. The updates include support for Multi Node Server platform like 'Yosemite', addition of various ssh-based utilities, and RESTful API implementation.

Speakers
avatar for Tian Fang

Tian Fang

Software Engineer, Facebook
Tian Fang is a software engineer at Facebook, working on the software stack (FBOSS) running on the Facebook network hardware. In the past two years, his focus has been on the OpenBMC project to bring up different Facebook hardware boards. Prior to Facebook, Tian worked at Cisco on... Read More →
JM

Jubin Mehta

Storage SW Engineer, Facebook


Thursday March 10, 2016 10:00am - 10:30am
Lower Level: LL21C

10:30am

Out of Band Management for OCP Server PCIe add-on-cards and SSDs
Live Stream - http://www.youtube.com/watch?v=_alt6Vx0n0I

The I2C sideband management feature developed for PCIe add-on-cards (AOC) by the OCP community can be a very useful tool for running datacenters more efficiently and offers potential for advanced server management of SSDs. This session will look at how the development of active monitoring of high performance PCIe SSDs by the BMC in OCP servers resulted in significant operational cost savings and SSD endurance management for large scale PCIe flash deployments.

The Add-on-Card Thermal Interface Spec for Intel Motherboard V3.0 specification defines an I2C/SMBus for the SSDs/PCIe AOCs for the BMC in OCP servers. With this the BMC can make decisions to keep server working under most efficient operating range lowering cost of operation. One such use case is temperature monitoring of these devices by BMC so it can dynamically control fan speed to provide adequate cooling. The SSD/PCIe AOC presents itself as an emulated temperature sensor so BMC can get temperature reading as if it is getting directly from a temperature sensor. SSDs draw power based on the type of workload that is presented to them and this power draw can vary over a surprising large range. Heavy writes (the worst workload) draw the maximum power and make them very hot whereas other workload draws significantly different power. These workloads do not necessarily stress other parts of the server, so in the absence of temperature monitoring of the SSDs and dynamic fan control, the server has to assume the worst thermal profile and run fans at the maximum speed. SSD endurance is limited to a certain number of petabytes written so it is impractical that this worst case workload will be running for more than a small fraction of a SSDs life. Additionally, the temperature of the NAND on an SSD is an important input into the endurance that and SSD can deliver, particularly lower cost NAND options. This combination of operational cost savings and lower cost NAND make this a particularly valuable feature that is provided to the OCP community. This mechanism of out-of-band management can be further extended to monitor other aspect of SSDs/PCIe AOCs. The session will cover the potential to more advanced monitoring such as:
• Power: Power consumption, power state transitions etc.
• SSD life cycle metrics like percentage drive life used, SMART data, Throttling information, FW slot information. Basically some of the monitoring already covered as part of NVMe standard.
• Errors logs and crash logs to provide adequate debugging information of failures.

Speakers

Thursday March 10, 2016 10:30am - 11:00am
Lower Level: LL21C

11:00am

Redfish and OCP topics
Live Stream - http://www.youtube.com/watch?v=cLlLf5S2xSs

Update on the DMTF Redfish Specification, including new work in progress, and a discussion of an OCP conformance checklist project to provide a standardized implementation target for Redfish support on OCP platforms. New work under development to align Redfish with UEFI and other OCP or industry specifications will also be discussed.

Speakers

Thursday March 10, 2016 11:00am - 11:30am
Lower Level: LL21C

11:30am

OpenDCRE - Next Generation Data Center Management
Live Stream - http://www.youtube.com/watch?v=83RA1myoWNs

OpenDCRE, the Open Data Center Runtime, is a combination of open-source hardware and software for out-of-band management and monitoring of data centers. OpenDCRE is built on a RESTful API that runs on a Raspberry Pi, or similar footprint, providing rack-level management and sensorfication via power line communications, or via a built-in IPMI bridge. This talk describes the hardware, software and API that comprise OpenDCRE, and demonstrates some of its capabilities.


Thursday March 10, 2016 11:30am - 12:00pm
Lower Level: LL21C

1:00pm

Server Management direction for OCP
Live Stream - http://www.youtube.com/watch?v=sg2ESDh99kk

Speakers
avatar for Badriddine Khessib

Badriddine Khessib

Sr. Director, Microsoft
avatar for Mark Shaw

Mark Shaw

Principal Hardware Engineer Manager, Microsoft
Standardized, scalable hardware for the Cloud! Open CloudServers!Mark A. Shaw is a Principal Hardware Engineering Manager for Microsoft’s Cloud+Enterprise Division. Mark and his team are responsible for development of Compute, Storage, and Data Center support systems utilized in... Read More →


Thursday March 10, 2016 1:00pm - 2:00pm
Lower Level: LL20D