sample-agents-with-nova-act-and-mcp

MCP.Pizza Chef: aws-samples

The sample-agents-with-nova-act-and-mcp MCP client demonstrates building intelligent web automation agents by combining Amazon Nova Act's web interaction capabilities with the Model Context Protocol (MCP). It enables seamless integration of AI models with web automation tools, facilitating real-time, context-aware actions on websites. This client supports the MCP architecture by managing connections between AI hosts and the Nova Act server, leveraging Amazon Bedrock foundation models for enhanced agent intelligence.

Use This MCP client To

Create AI agents that automate web interactions using Amazon Nova Act Integrate LLMs with web automation for dynamic task execution Build context-aware web automation workflows with MCP protocol Connect AI hosts like Claude Desktop to web automation tools Leverage Amazon Bedrock models for intelligent agent decision-making

README

Building Agents with Amazon Nova Act and MCP

This repository demonstrates how to build intelligent web automation agents using Amazon Nova Act integrated with MCP (Model Context Protocol). MCP provides a standardized way to connect AI models to different data sources and tools - think of it like a "USB-C port for AI applications."

Amazon Nova Act

Overview

This project showcases the integration between:

  • Amazon Nova Act: A powerful web automation tool for creating agents that interact with web pages
  • MCP (Model Context Protocol): An open protocol that standardizes how applications provide context to LLMs
  • Amazon Bedrock: Managed service that provides foundation models through a unified API

Architecture

The project follows MCP's client-server architecture:

  • MCP Host: Claude Desktop or other AI tools that want to access Nova Act capabilities
  • MCP Client: Protocol clients maintaining 1:1 connections with the Nova Act server
  • MCP Server: A Nova Act server exposing web automation capabilities through the standardized protocol
  • Remote Services: Web services that Nova Act interacts with through browser automation

Key Features

  • Standardized LLM integration through MCP
  • Automated web navigation and interaction via Nova Act
  • Parallel execution of multiple browser tasks
  • Data extraction and processing
  • Screenshot capture and visual verification
  • Session management and browser control
  • Seamless integration with Claude Desktop
  • Access to foundation models via Amazon Bedrock

Prerequisites

Project Structure

The repository contains two main example directories:

1. Streamlit Examples (streamlit_examples/)

  • video_game_research_st.py: A comprehensive tool that:
    • Finds top games for any selected gaming system
    • Searches Amazon in parallel for pricing and details
    • Creates interactive result tables
    • Saves research data for future reference

2. MCP Examples (mcp_examples/)

  • nova_act_mcp_server.py: MCP-compliant server implementation exposing Nova Act capabilities
  • nova_act_mcp_client.py: Example MCP client implementation
  • Demonstrates integration with Claude Desktop and other MCP hosts

3. Nova Act Examples (nova_act_examples/)

  • get_coffee_maker.py: Simple example demonstrating how to:
    • Initialize Nova Act with a starting web page
    • Perform basic web search and selection actions
    • Extract information from product pages
  • multi_monitor.py: Advanced example showing:
    • Parallel execution of web tasks using ThreadPoolExecutor
    • Searching and comparing multiple products simultaneously
    • Structured data extraction and comparison
    • Error handling for robust web automation

Getting Started

  1. Clone this repository

  2. Install dependencies:

    pip install -r requirements.txt
  3. Set your Nova Act API key:

    export NOVA_ACT_API_KEY="your_api_key"
  4. Make sure your environment has AWS configured correctly - https://docs.aws.amazon.com/cli/latest/userguide/getting-started-quickstart.html

Usage

Running Streamlit Examples

cd streamlit_examples
streamlit run video_game_research_st.py

Running the MCP Server and Client

cd mcp_examples
python nova_act_mcp_client.py nova_act_mcp_server.py

The command above will:

  1. Start the MCP server that exposes Nova Act capabilities
  2. Launch the MCP client that connects to the server
  3. Enable communication between Claude and the Nova Act browser automation

Claude Desktop Integration

For setting up and using this server with Claude Desktop, please follow the official Claude Desktop MCP Setup Guide. The guide covers:

  • Installing and configuring Claude Desktop
  • Setting up MCP servers in Claude Desktop
  • Troubleshooting common issues
  • Example usage and best practices

Best Practices

  1. Follow MCP's standardized approach for exposing capabilities
  2. Always close browser sessions when done
  3. Use headless mode for automated tasks not requiring visual feedback
  4. Break down complex actions into smaller, specific instructions
  5. Use schemas when expecting structured data responses
  6. Save important results to files for persistence
  7. Handle errors appropriately in your code

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

Additional Resources

sample-agents-with-nova-act-and-mcp FAQ

How does this MCP client connect AI models to web automation?
It acts as a protocol client managing connections between AI hosts and Amazon Nova Act, enabling models to perform web actions.
Can this client work with multiple AI hosts?
It maintains 1:1 connections per client instance but can be instantiated multiple times for different hosts.
What role does Amazon Bedrock play in this integration?
Amazon Bedrock provides foundation models accessed via a unified API to power the intelligence of the web automation agents.
Is this client limited to Amazon Nova Act only?
Yes, this client specifically integrates MCP with Amazon Nova Act for web automation tasks.
What kind of web actions can agents perform using this client?
Agents can navigate pages, fill forms, click buttons, and perform complex scripted interactions.
How does MCP enhance the capabilities of Amazon Nova Act?
MCP standardizes context sharing and tool integration, allowing AI models to interact with web automation tools more effectively.
Is this client open source and customizable?
Yes, the repository provides code and examples to build and extend intelligent web automation agents.
What AI models are compatible with this client?
It supports models accessible via MCP, including OpenAI, Claude, and Gemini foundation models.