Files
parcer/README.md

154 lines
7.8 KiB
Markdown
Raw Normal View History

<h2 align="center">
2024-10-30 08:35:10 +05:30
<div>
<a href="https://www.Dorod Parser.dev/?ref=ghread">
<img src="/src/assets/Dorod Parserlogo.png" width="70" />
2024-10-30 08:35:10 +05:30
<br>
Dorod Parser
2024-10-30 08:35:10 +05:30
</a>
</div>
2026-02-06 15:57:42 +05:30
Turn Any Website Into A Structured API
<br>
</h2>
2024-10-30 08:35:10 +05:30
<p align="center">
вњЁ The unified open-source no-code platform for real-time web scraping, crawling, search and AI data extraction вњЁ
2024-10-30 08:35:10 +05:30
<p align="center">
<a href="https://app.Dorod Parser.dev/?ref=ghread"><b>Go To App</b></a> •
<a href="https://docs.Dorod Parser.dev/?ref=ghread"><b>Documentation</b></a> •
<a href="https://www.Dorod Parser.dev/?ref=ghread"><b>Website</b></a> •
<a href="https://discord.gg/5GbPjBUkws"><b>Discord</b></a> •
<a href="https://www.youtube.com/@Dorod ParserOSS?ref=ghread"><b>Watch Tutorials</b></a>
2024-11-14 07:32:43 +05:30
<br />
<br />
<a href="https://trendshift.io/repositories/12113" target="_blank"><img src="https://trendshift.io/api/badge/repositories/12113" alt="getDorod Parser%2FDorod Parser | Trendshift" style="width: 250px; height: 55px; margin-top: 10px;" width="250" height="55"/></a>
2024-10-30 08:35:10 +05:30
</p>
## What is Dorod Parser?
2025-11-21 01:08:21 +05:30
Dorod Parser is an open-source no-code web data platform for turning the web into structured, reliable data.
It supports extraction, crawling, scraping, and search — designed to scale from simple use cases to complex, automated workflows.
2025-12-12 05:11:21 +05:30
2025-12-13 17:59:55 +05:30
### Ecosystem
2025-12-12 05:11:21 +05:30
1. **[Extract](https://docs.Dorod Parser.dev/category/extract)** – Emulate real user behavior and collect structured data from any website.
* **[Recorder Mode](https://docs.Dorod Parser.dev/robot/extract/robot-actions)** - Record your actions as you browse; Dorod Parser turns them into a reusable extraction robot.
* **[AI Mode](https://docs.Dorod Parser.dev/robot/extract/llm-extraction)** - Describe what you want in natural language and let LLM-powered extraction do the rest.
2025-12-13 17:59:55 +05:30
2. **[Scrape](https://docs.Dorod Parser.dev/robot/scrape/scrape-robots)** – Convert full webpages into clean Markdown or HTML and capture screenshots.
3. **[Crawl](https://docs.Dorod Parser.dev/robot/crawl/crawl-introduction)** - Crawl entire websites and extract content from every relevant page, with full control over scope and discovery.
4. **[Search](https://docs.Dorod Parser.dev/robot/search/search-introduction)** - Run automated web searches to discover or scrape results, with support for time-based filters.
5. **[SDK](https://docs.Dorod Parser.dev/sdk/sdk-overview)** – A complete developer toolkit for scraping, extraction, scheduling, and end-to-end data automation.
2025-11-21 01:08:21 +05:30
2025-11-21 01:09:32 +05:30
## How Does It Work?
2025-11-21 01:08:21 +05:30
Dorod Parser robots are automated tools that help you collect data from websites without writing any code. Think of them as your personal web assistants that can navigate websites, extract information, and organize data just like you would manually - but faster and more efficiently.
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
There are four types of robots, each designed for a different job.
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
### 1. Extract
Extract emulates real user behavior and captures structured data.
- <a href="/robot/extract/robot-actions">Recorder Mode</a> - Record your actions as you browse; Dorod Parser turns them into a reusable extraction robot.
2025-12-12 05:11:21 +05:30
### Example: Extract 10 Property Listings from Airbnb
[https://github.com/user-attachments/assets/recorder-mode-demo-video](https://github.com/user-attachments/assets/c6baa75f-b950-482c-8d26-8a8b6c5382c3)
2026-01-05 02:27:24 +05:30
- <a href="/robot/extract/llm-extraction">AI Mode</a> - Describe what you want in natural language and let LLM-powered extraction do the rest.
### Example: Extract Names, Rating & Duration of Top 50 Movies from IMDb
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
https://github.com/user-attachments/assets/f714e860-58d6-44ed-bbcd-c9374b629384
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
Learn more <a href="/category/extract">here</a>.
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
### 2. Scrape
Scrape converts full webpages into clean Markdown, HTML and can capture screenshots. Ideal for AI workflows, agents, and document processing.
2025-12-12 05:11:21 +05:30
Learn more <a href="https://docs.Dorod Parser.dev/robot/scrape/scrape-robots">here</a>.
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
### 3. Crawl
Crawl entire websites and extract content from every relevant page, with full control over scope and discovery.
2025-11-21 01:08:21 +05:30
Learn more <a href="https://docs.Dorod Parser.dev/robot/crawl/crawl-introduction">here</a>.
2025-12-12 05:11:21 +05:30
2026-01-05 02:27:24 +05:30
### 4. Search
Run automated web searches to discover or scrape results, with support for time-based filters.
2025-11-21 01:08:21 +05:30
Learn more <a href="https://docs.Dorod Parser.dev/robot/search/search-introduction">here</a>.
2025-11-21 01:08:21 +05:30
## Quick Start
### Getting Started
The simplest & fastest way to get started is to use the hosted version: https://app.Dorod Parser.dev. You can self-host if you prefer!
### Installation
Dorod Parser can run locally with or without Docker
1. [Setup with Docker Compose](https://docs.Dorod Parser.dev/installation/docker)
2. [Setup without Docker](https://docs.Dorod Parser.dev/installation/local)
3. [Environment Variables](https://docs.Dorod Parser.dev/installation/environment_variables)
4. [SDK](https://github.com/getDorod Parser/node-sdk)
2025-11-21 01:08:21 +05:30
### Upgrading & Self Hosting
1. [Self Host Dorod Parser With Docker & Portainer](https://docs.Dorod Parser.dev/self-host)
2. [Upgrade Dorod Parser With Docker Compose Setup](https://docs.Dorod Parser.dev/installation/upgrade#upgrading-with-docker-compose)
3. [Upgrade Dorod Parser Without Docker Compose Setup](https://docs.Dorod Parser.dev/installation/upgrade#upgrading-with-local-setup)
2025-11-21 01:08:21 +05:30
## Sponsors
<table>
<tr>
2025-10-30 16:14:35 +05:30
<td width="229">
<br/>
<a href="https://www.testmuai.com/?utm_medium=sponsor&utm_source=Dorod Parser" target="_blank">
<img src="https://github.com/user-attachments/assets/6c96005b-85df-43e0-9b63-96aaca676c11" /><br/><br/>
<b>TestMu AI</b>
</a>
<br/>
<sub>The Native AI-Agentic Cloud Platform to Supercharge Quality Engineering. Test Intelligently and Ship Faster.
</sub>
</td>
</tr>
</table>
2025-11-21 01:08:21 +05:30
## Features
- ✨ **Extract Data With No-Code** – Point and click interface
- ✨ **LLM-Powered Extraction** – Describe what you want; use LLMs to scrape structured data
- ✨ **Developer SDK** – Programmatic extraction, scheduling, and robot management
- ✨ **Handle Pagination & Scrolling** – Automatic navigation
- ✨ **Run Robots On Schedules** – Set it and forget it
- ✨ **Turn Websites to APIs** – RESTful endpoints from any site
- ✨ **Turn Websites to Spreadsheets** – Direct data export to Google Sheets & Airtable
- ✨ **Adapt To Website Layout Changes** – Auto-recovery from site updates
- ✨ **Extract Behind Login** – Handle authentication seamlessly
- ✨ **Integrations** – Connect with your favorite tools
- ✨ **MCP Support** – Model Context Protocol integration
- ✨ **LLM-Ready Data** – Clean Markdown for AI applications
- ✨ **Self-Hostable** – Full control over your infrastructure
- ✨ **Open Source** – Transparent and community-driven
2025-12-13 17:59:55 +05:30
2026-01-05 02:31:50 +05:30
## Demos
Dorod Parser can be used for various use-cases, including lead generation, market research, content aggregation and more.
View demos here: https://www.Dorod Parser.dev/usecases
2024-10-30 08:35:10 +05:30
2025-12-13 17:59:55 +05:30
## Note
2025-06-26 20:12:39 +05:30
This project is in early stages of development. Your feedback is very important for us - we're actively working on improvements. </a>
2024-10-30 09:25:59 +05:30
2025-12-13 17:59:55 +05:30
## License
2024-10-30 08:35:10 +05:30
<p>
2026-01-31 17:20:18 +05:30
This project is licensed under <a href="./LICENSE">AGPLv3</a>.
2024-10-30 08:35:10 +05:30
</p>
2026-01-31 17:20:18 +05:30
## Project Values
We believe in fair and responsible use of open source.
If you rely on this project commercially, please consider contributing back
or supporting its development.
2025-12-13 17:59:55 +05:30
## Support Us
Star the repository, contribute if you love what we’re building, or [sponsor us](https://github.com/sponsors/amhsirak).
2025-08-12 03:19:50 +05:30
2025-12-13 17:59:55 +05:30
## Contributors
2024-10-30 08:35:10 +05:30
Thank you to the combined efforts of everyone who contributes!
<a href="https://github.com/getDorod Parser/Dorod Parser/graphs/contributors">
<img src="https://contrib.rocks/image?repo=getDorod Parser/Dorod Parser" />
2024-10-30 08:35:10 +05:30
</a>