Artificial Intelligence

NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart

Dan Ferguson

10d ago

Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier reasoning model.

aimachine-learningtechnology

How to build self-driving AI operations on Amazon Bedrock at scale

Sushovan Basak

11d ago

In this post, we introduce Amazon Bedrock Ops Alert, a three-layer automated monitoring solution that proactively detects operational issues, dynamically adjusts alarm thresholds, classifies alarms by category, automatically creates context-aware support cases, helps prevent duplicate cases when an unresolved case of the same alarm category is already active, and delivers contextualized notificat…

aiautomationmachine-learning

Fundamental’s Large Tabular Model NEXUS is now available on Amazon SageMaker JumpStart

Vivek Gangasani

11d ago

In this post, we show you how to get started with NEXUS on Amazon SageMaker JumpStart, walk through the deployment process, and demonstrate how to run predictions against your enterprise datasets.

aimachine-learning

Reducing container cold start times using SOCI index on DLAMI and DLC

Ohad Katz

11d ago

In this post, we look at how to use SOCI on publicly available Deep Learning AMIs and Containers, when to use the various SOCI modes provided by the tool, and how to quickly and efficiently use this tool in your workloads today.

Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI

Amin Dashti

11d ago

In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy of a small language model (SLM). The example uses Amazon SageMaker AI training jobs, so you can focus on training code instead of managing your own training infrastructure. You also learn how to evaluate tool-calling accuracy and compare a base mod…

aimachine-learningsupervised-learning

The art and science of hyperparameter optimization on Amazon Nova Forge

Nishant Dhiman

12d ago

Fine-tuning for domain-specific tasks means improving performance in one area without degrading the model’s general capabilities, and getting that balance right is harder than it looks. This post walks through how to navigate that balance, from selecting the right customization strategy for your data and task, to configuring the training parameters that most influence outcomes, like learning rate…

aimachine-learningoptimization

Object detection with Amazon Nova 2 Lite

Robert Stolz

12d ago

In this post, we'll walk through implementing object detection with Amazon Nova 2 Lite. You'll learn how to deploy an object detection application using Amazon Bedrock, AWS Lambda, and Amazon API Gateway. You'll also learn how to craft effective prompts, process structured JSON output, and visualize results. We explore practical applications across manufacturing, agriculture, and logistics.

aicomputer-vision

How Baz improved its AI Agent Code Review accuracy using Amazon Bedrock AgentCore

Itay Atas

12d ago

This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decisions, implementation details, and the business outcomes they achieved by leveraging these AWS services to automate their code review process

aiautomationmachine-learning

Building a secure auth code flow setup using AgentCore Gateway with MCP clients

Swagat Kulkarni

13d ago

This post demonstrates how to implement Open Authorization (OAuth) Code flow as an inbound authorization mechanism for MCP servers hosted on Amazon Bedrock AgentCore Gateway. By the end of this guide, you will have a production-ready setup where each AI assistant request is authenticated with a valid user identity token issued from your organization’s identity provider.

aimachine-learningsecurity

Reference your own AWS Secrets Manager secrets in Amazon Bedrock AgentCore Identity

Swara Gandhi

13d ago

Today, we’re excited to announce the ability to reference a secret in AWS Secrets Manager for AgentCore Identity, so you can reference your own preconfigured secret from Secrets Manager and retain full control over how it is managed. With this ability, you can extend your organization’s existing secrets governance processes to AgentCore. You can provide an existing, preconfigured AWS Secrets Mana…

Transforming rare cancer research with Amazon Quick: Integrating biomedical databases for breakthrough discoveries

Anu Kaggadasapura Nagaraja

13d ago

In this post, we walk through how to use Amazon Quick Research to integrate biomedical data sources for rare cancer research. The walkthrough uses pediatric sarcoma as the research domain and draws on publicly available datasets from PubMed and other open biomedical repositories. It covers the end-to-end workflow: defining a research objective, configuring data sources, reviewing the AI-generated…

biochemistrybiology

OpenAI models and Codex on Amazon Bedrock are now generally available

Bharat Sandhu

13d ago

GPT-5.5, GPT-5.4, and Codex are now generally available on Amazon Bedrock. Deploy them in production applications and agents today, on Bedrock’s high performance inference engine. 

aimachine-learningtechnology

Extending MCP support for Amazon Bedrock AgentCore Gateway

Anagh Agrawal

13d ago

While deploying Model Context Protocol (MCP) servers in production, enterprises need fine-grained access control across servers, observability into which teams use which tools, security guarantees against data exfiltration, and centralized credential management, all at scale. Amazon Bedrock AgentCore Gateway sits between MCP servers and the clients that consume them, centralizing credential manag…

aicomputer-science

Secure AI agents with Policy and Lambda interceptors in Amazon Bedrock AgentCore gateway

Bharathi Srinivasan

13d ago

In this post, we use a lakehouse data agent to demonstrate how you can use Policy for deterministic access control and Lambda interceptors for dynamic validation. We then show how to combine Lambda interceptors and Policy to implement a geography-based access control which requires both dynamic validation and deterministic access control.

aimachine-learning

Enable safe agentic payments with built-in guardrails using Amazon Bedrock AgentCore payments

Joshua Smith

13d ago

In this post, we address several key risks that surface when designing an agentic payment system, and how to address them with the capabilities of AgentCore payments.

AgentOps: Operationalize agentic AI at scale with Amazon Bedrock AgentCore

Anastasia Tzeveleka

13d ago

When you build agentic AI solutions, you face unique operational challenges. Agents make unpredictable decisions, costs spiral unexpectedly, and debugging non-deterministic failures seems impossible. Agentic AI applications don't just execute predetermined workflows. They reason, adapt, and make autonomous decisions, and DevOps practices need to be adapted. That's where AgentOps comes in, the ope…

aiautonomous-systemsmachine-learning

Accelerate LLM model loading and increase context windows with GPUDirect on Amazon FSx for Lustre and TurboQuant

Randy Seamans

13d ago

If you’re iterating on deploying large language models (LLMs) on AWS GPU instances, you’ve probably noticed the larger the model to be loaded into GPU High Bandwidth Memory (HBM), the longer the painful wait until the GPUs are ready for inference. As models grow to hundreds of billions of parameters and GPU environments grow ever […]

aimachine-learning

Amazon Quick integration with time-series databases for market intelligence using MCP

Abhishek Sharma

13d ago

In this post, we walk through a practical implementation using KDB-X MCP server integration with Amazon Quick, demonstrating how traders and analysts can ask questions using conversational language and receive actionable insights from datasets. You can apply this same integration pattern across various domains, from financial market analysis to IoT sensor monitoring to DevOps performance dashboar…

aimachine-learning

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU utilization to LLM quality

Sandeep Raveesh-Babu

16d ago

This post demonstrates a comprehensive observability solution using Amazon Managed Grafana dashboards that provides a holistic view of both quality and quantity for LLMs served on Amazon SageMaker AI endpoints with inference components.

aimachine-learning

Training Azerbaijani language models on Amazon SageMaker AI

Aleksei Iancheruk

17d ago

Azercell Telecom LLC, Azerbaijan's leading telecommunications provider, wanted to build an Azerbaijani large language model (LLM) on Amazon SageMaker AI for telecom use cases and a customer-facing chatbot. The challenge: adapting foundation models (FMs) to a morphologically rich language with limited training data and no existing blueprint for efficient LLM training in Azerbaijani. In a six-week …

aimachine-learningnlp

research.io

Sign up to keep scrolling

Create your feed subscriptions, save articles, keep scrolling.

Already have an account?