Do I just post about cool events? Maybe? But that's good, right? Now here's another!
Scaling Intelligence: Accelerating HPC and Inference Workflows.
If youβre building or scaling Generative AI infrastructure, you already know the stakes. Balancing the massive compute demands of LLMs with strict production latency and data security requirements is a massive architectural hurdle.
Whether you're tackling real-time streaming analytics, automated compliance pipelines, or complex risk modeling, your underlying infrastructure shouldn't be your bottleneck.
On Thursday, May 28th, weβre hosting an exclusive, hands-on workshop at Google NYC (111 8th Ave) designed specifically for engineers, architects, and tech leaders: Scaling Intelligence: Accelerating HPC and Inference Workflows.
π οΈ The Tech Breakdown
This isn't a high-level pitch; we're diving into the actual plumbing required for breakthrough performance:
- Next-Gen Compute Architectures: Blueprinting high-throughput infrastructure built to handle concurrent, low-latency inference workloads at scale.
- The Hardware & Software Stack: Get a closer look at optimizing workloads using Google Cloudβs new G4 VMs (powered by the massive NVIDIA RTX Pro 6000 Blackwell architecture) alongside TensorRT for maximum throughput.
- Hands-on Labs: Bring your laptop. You'll get practical experience deploying and optimizing state-of-the-art open-source models like Gemma and Llama 3, with live guidance from Google Cloud and NVIDIA AI experts.
π₯ Bring the Whole Squad
Infrastructure decisions don't happen in a vacuum. To get the most out of the hands-on labs and architecture deep-dives, we highly encourage bringing a cross-functional team (2β4 people) across:
- AI/ML Architecture & Engineering
- Platform Engineering / DevSecOps
- IT & Infrastructure Leadership
Aligning your data scientists with your infrastructure engineers is the fastest way to unblock your production roadmap.
π** Logistics & Details**
Where: Google NYC (111 8th Ave)
When: Thursday, May 28 | 12:00 PM β 4:00 PM (Stick around for the networking reception right after!)
Note on Availability: Spaces are strictly limited to ensure high-quality, hands-on coaching and meaningful architectural reviews during the labs.
Top comments (0)