Loading…
3-4 June, 2025
Bengaluru, India
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for OpenSearchCon India 2025 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in India Standard Time (IST | GMT+5:30). To see the schedule in your preferred timezone, please select from the drop-down located at the bottom of the menu to the right.

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Tuesday June 3, 2025 12:00pm - 12:40pm IST
HPCM (HPE Performance Cluster Manager) is a suite for monitoring HPC systems, consisting of rack, chassis, compute nodes(GPU, CPU), power,cooling components. OpenSearch, deployed in a distributed manner, enables efficient indexing and querying of large volumes of data. For real-time monitoring, events across the system is collected and streamed to Kafka, then persisted in OpenSearch via Logstash, allowing for efficient log management. OpenSearch Dashboards, integrated with Fluent Bit, provide real-time log observability, enabling users to visualize and analyze system logs for insightful cluster monitoring. We have developed a user-friendly wrapper around OpenSearch Alerting that uses custom YAML-based configurations to define alerting rules, categorize alerts by severity and group, and trigger notifications to endpoints like email,alertmanager. Argonne National Laboratory's Aurora, the third-fastest supercomputer in the world, uses 21,248 CPUs and 63,744 GPUs across 166 racks. HPCM, along with its 10-instance OpenSearch database deployed on 10 nodes, successfully manages and monitors events and logs, with OpenSearch Alerting effectively deployed to track critical system issues.
Speakers
avatar for Sinchana Karnik

Sinchana Karnik

HPC Software Engineer, Hewlett Packard Enterprise (HPE)
Sinchana Karnik, Hewlett Packard Enterprise, excels in developing monitoring solutions with HPCM. With deep expertise in creating tools for HPC system management and monitoring, she also specializes in deploying alerting solutions for HPE’s largest supercomputers. Her skills include... Read More →
avatar for Raghul Vasudevan

Raghul Vasudevan

HPC Senior System Software Engineer, Hewlett Packard Enterprise
Raghul Vasudevan is a subject matter expert at HPE with over six years of experience developing monitoring solutions for HPCM-managed supercomputers. He specializes in system monitoring, real-time telemetry, and performance optimization for large-scale HPC environments. He has played... Read More →
avatar for Ambresh Gupta

Ambresh Gupta

Senior System Software Engineer, Hewlett Packard Enterprise
I am work as a Senior System Software Engineer in HPE. I am member of team developing and supporting management software for some of the largest HPC (High Performance Clusters) in the world, with a main focus on delivering the monitoring features of the HPE HPCM product. I work on... Read More →
Tuesday June 3, 2025 12:00pm - 12:40pm IST
Ceres

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link