Skip to content

This is just a tool to use a chrome driver that exposes a MCP server to be easily to use with Cursor IDE

Notifications You must be signed in to change notification settings

ravikumar2026/simple-mcp-selenium

 
 

Repository files navigation

MCP Selenium Server

Powerful browser automation for Cursor IDE with multi-browser instance support

Documentation License GitHub

🚀 What is this?

MCP Selenium Server lets you control browsers using natural language in Cursor IDE. Perfect for testing, automation, and multi-user scenarios.

Example: Just tell Cursor "Open a browser, go to Google, search for 'hello world', and take a screenshot" - and it happens!

⚡ Quick Start

1. Install (Docker - Recommended)

git clone https://github.com/brutalzinn/simple-mcp-selenium.git
cd simple-mcp-selenium
./scripts/install-simple.sh

2. Configure Cursor IDE

Add to Cursor settings:

{
  "mcp.servers": {
    "selenium-browser": {
      "command": "docker",
      "args": [
        "exec",
        "-i",
        "mcp-selenium-server",
        "node",
        "/app/dist/index.js"
      ]
    }
  }
}

3. Use it!

Restart Cursor and start talking to your browser:

  • "Open a browser and go to example.com"
  • "Click the login button"
  • "Type '[email protected]' in the email field"
  • "Take a screenshot"

🎯 Key Features

  • 🤖 Natural Language Control - Tell Cursor what to do in plain English
  • 🌐 Multi-Browser Support - Manage multiple browser instances with unique IDs
  • 🔌 Plugin System - Extend functionality with custom plugins
  • 📸 Screenshots - Capture page states automatically
  • 🔄 Drag & Drop - Full interaction support
  • ⚡ Fast - Optimized for speed (3-second timeouts)

📚 Documentation

Complete documentation: https://brutalzinn.github.io/simple-mcp-selenium/

🛠️ Available Tools

Tool What it does Example
open_browser Start browser "Open a browser"
navigate_to Go to URL "Go to google.com"
click_element Click things "Click the login button"
type_text Type text "Type 'hello' in the search box"
drag_and_drop Move elements "Drag the menu to the workflow area"
take_screenshot Capture page "Take a screenshot"
execute_script Run JavaScript "Execute some JavaScript"

🌐 Multi-Browser Mode

Perfect for testing multiple users or scenarios:

# User 1
"Open a browser with ID 'user1' and go to site1.com"

# User 2
"Open a browser with ID 'user2' and go to site2.com"

# Use specific browsers
"Click the button in browser 'user1'"
"Fill the form in browser 'user2'"

🔌 Plugins

Extend functionality with plugins:

View all plugins →

🐳 Docker Commands

# Start
docker-compose up -d

# Stop
docker-compose down

# Restart
docker-compose restart

# View logs
docker-compose logs -f

🤝 Contributing

  1. Fork the repo
  2. Create your feature branch
  3. Make your changes
  4. Submit a pull request

📄 License

MIT License - see LICENSE for details.

🆘 Need Help?


Made with ❤️ for the Cursor IDE community

Made with 😠 for other alternatives that uses internal proxies to somewhere i dont know

About

This is just a tool to use a chrome driver that exposes a MCP server to be easily to use with Cursor IDE

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 61.9%
  • TypeScript 27.5%
  • Makefile 6.1%
  • Shell 3.5%
  • Dockerfile 1.0%