Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)

Amazon Bedrock announces the preview launch of Session Management APIs, a new capability that enables developers to simplify state and context management for generative AI applications built with popular open source frameworks such as LangGraph and LlamaIndex. Session Management APIs provide an out-of-the-box solution that enables developers to securely manage state and conversation context across […]

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

Deploying models efficiently, reliably, and cost-effectively is a critical challenge for organizations of all sizes. As organizations increasingly deploy foundation models (FMs) and other machine learning (ML) models to production, they face challenges related to resource utilization, cost-efficiency, and maintaining high availability during updates. Amazon SageMaker AI introduced inference component functionality that can help organizations […]

Evaluate and improve performance of Amazon Bedrock Knowledge Bases

Amazon Bedrock Knowledge Bases is a fully managed capability that helps implement entire Retrieval Augmented Generation (RAG) workflows from ingestion to retrieval and prompt augmentation without having to build custom integrations to data sources and manage data flows. There is no single way to optimize knowledge base performance: each use case is impacted differently by […]

Enhance enterprise productivity for your LLM solution by becoming an Amazon Q Business data accessor

Since Amazon Q Business became generally available in 2024, customers have used this fully managed, generative AI-powered assistant to enhance their productivity and efficiency. The assistant enables users to answer questions, generate summaries, create content, and complete tasks using enterprise data. Today’s workforce faces significant application overload. According to Gartner, the average desk worker now […]

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone. Our goal is to tackle “stop-and-go” waves, those frustrating slowdowns and speedups that usually have no clear cause but lead to congestion and significant energy waste. To train efficient […]