diff --git a/README.md b/README.md index 69e4a958..5ad5c103 100644 --- a/README.md +++ b/README.md @@ -11,9 +11,6 @@
✨ Turn any website into clean, contextualized data pipelines for your AI applications ✨
-
-Maxun is the easiest way to extract web data with no code.
-
Go To App • @@ -37,55 +34,43 @@ Maxun helps you transform websites into structured APIs, clean markdown for AI w * **[AI Mode](https://docs.maxun.dev/robot/extract/llm-extraction)** - Describe what you want in natural language and let LLM-powered extraction do the rest. 2. **[Scrape](https://docs.maxun.dev/robot/scrape/scrape-robots)** – Convert full webpages into clean Markdown or HTML and capture screenshots. Ideal for AI workflows, agents, and document processing. No code required. - -3. **[SDK](https://docs.maxun.dev/sdk/sdk-overview)** – A complete developer toolkit for scraping, extraction, scheduling, and end-to-end data automation. - -Whether you prefer browsing through a website or integrating automation into your codebase, Maxun adapts to your workflow. - +3. **[Crawl](https://docs.maxun.dev/robot/crawl/crawl-introduction)** - Crawl entire websites and extract content from every relevant page, with full control over scope and discovery. +4. **[Search](https://docs.maxun.dev/robot/search/search-introduction)** - Run automated web searches to discover or scrape results, with support for time-based filters. +5. **[SDK](https://docs.maxun.dev/sdk/sdk-overview)** – A complete developer toolkit for scraping, extraction, scheduling, and end-to-end data automation. ## How Does It Work? -Maxun uses web robots to power everything you can do on the platform. There are two types of robots, each designed for a different job. +Maxun robots are automated tools that help you collect data from websites without writing any code. Think of them as your personal web assistants that can navigate websites, extract information, and organize data just like you would manually - but faster and more efficiently. -### 1. Extract Robots -**Extract robots emulate real user behavior and capture structured data.** - -Choose how to build them -### a. Recorder Mode: Record your actions as you browse -- Build robots visually by browsing like a human. -- Perfect for structured, deterministic data extraction. +There are four types of robots, each designed for a different job. +### 1. Extract +Extract emulates real user behavior and captures structured data. +- Recorder Mode - Record your actions as you browse; Maxun turns them into a reusable extraction robot. ### Example: Extract 10 Property Listings from Airbnb [https://github.com/user-attachments/assets/recorder-mode-demo-video](https://github.com/user-attachments/assets/c6baa75f-b950-482c-8d26-8a8b6c5382c3) - -### b. LLM Extraction (Beta): Describe what you want in plain language -- Use natural language to define extraction patterns. -- Works with closed source & open source LLMs. - -Get Started with LLM Extraction: https://docs.maxun.dev/robot/extract/llm-extraction - +- AI Mode - Describe what you want in natural language and let LLM-powered extraction do the rest. ### Example: Extract Names, Rating & Duration of Top 50 Movies from IMDb https://github.com/user-attachments/assets/f714e860-58d6-44ed-bbcd-c9374b629384 -### Core capabilities -- Extract from any website, including behind logins -- Convert sites into APIs, spreadsheets, and workflows -- Scale extractions and run on schedules or via API -- Handle infinite scrolling and pagination -- Auto-adapt to website layout & structural changes +Learn more here. -### 2. Scrape Robots -**Built for clean content and AI workflows.** -- Get clean HTML and LLM-ready Markdown from any website -- Remove scripts, styling, ads, and clutter automatically -- Perfect for RAG systems, AI summarization, embeddings, and content pipelines -- Ideal for feeding clean data to LLMs +### 2. Scrape +Scrape converts full webpages into clean Markdown, HTML and can capture screenshots. Ideal for AI workflows, agents, and document processing. -### Example: Scrape GitHub Trending Repositories in clean Markdown format +Learn more here. -https://github.com/user-attachments/assets/c774cbd4-5a85-45b7-b41f-128ee570eae6 +### 3. Crawl +Crawl entire websites and extract content from every relevant page, with full control over scope and discovery. + +Learn more here. + +### 4. Search +Run automated web searches to discover or scrape results, with support for time-based filters. + +Learn more here. ## Quick Start