An MCP server that helps AI assistants access text content from websites that implement bot detection, bridging the gap between what you can see in your browser and what the AI can access.
Intended Use
This tool is optimized for low-volume retrieval of documentation and reference materials (text/HTML only) from websites that implement bot detection. It has not been designed or tested for general-purpose site scraping or data harvesting.
Note: This project was developed in collaboration with Claude Sonnet 3.7, using LLM Context.
Installation
Requirements:
Install dependencies and the tool:
Setup with Claude
Add this configuration to your Claude client's MCP server configuration:
Available Tools
This package provides two distinct tools:
s-fetch-page: Retrieves complete web pages with pagination support
s-fetch-pattern: Extracts content matching regex patterns with surrounding context
Example Usage
Fetching a Complete Page
Extracting Specific Content with Pattern Matching
Functionality Options
Protection Levels:
Content Targeting Options:
Tips for Best Results
Start with basic mode and only escalate to higher protection levels if needed
For large documents, use the pagination parameters with s-fetch-page
Use s-fetch-pattern when looking for specific information on large pages
The AI will automatically adjust its approach based on the site's protection level
Limitations
Designed only for text content: Specifically for documentation, articles, and reference materials
Not designed for high-volume scraping or data harvesting
An MCP server that helps AI assistants access text content from websites that implement bot detection, bridging the gap between what you can see in your browser and what the AI can access.
Intended Use
This tool is optimized for low-volume retrieval of documentation and reference materials (text/HTML only) from websites that implement bot detection. It has not been designed or tested for general-purpose site scraping or data harvesting.
Note: This project was developed in collaboration with Claude Sonnet 3.7, using LLM Context.
Installation
Requirements:
Install dependencies and the tool:
Setup with Claude
Add this configuration to your Claude client's MCP server configuration:
Available Tools
This package provides two distinct tools:
s-fetch-page: Retrieves complete web pages with pagination support
s-fetch-pattern: Extracts content matching regex patterns with surrounding context
Example Usage
Fetching a Complete Page
Extracting Specific Content with Pattern Matching
Functionality Options
Protection Levels:
Content Targeting Options:
Tips for Best Results
Start with basic mode and only escalate to higher protection levels if needed
For large documents, use the pagination parameters with s-fetch-page
Use s-fetch-pattern when looking for specific information on large pages
The AI will automatically adjust its approach based on the site's protection level
Limitations
Designed only for text content: Specifically for documentation, articles, and reference materials
Not designed for high-volume scraping or data harvesting