Commit Graph

258 Commits

Author SHA1 Message Date
LawyZheng
6e4e73cc54 improve interactable parsing (#4355) 2025-12-23 01:44:35 +08:00
LawyZheng
ce717146f3 reenbale the download action (#4299) 2025-12-15 14:30:32 +08:00
LawyZheng
196ead43e6 increase incremental detecting logic (#4280) 2025-12-12 13:52:01 +08:00
LawyZheng
2de27637db fix hover action (#4245) 2025-12-10 02:39:17 +08:00
Mohamed Khalil
f49b07f30d feat: add hover action support (#3994)
Co-authored-by: LawyZheng <lawyzheng1106@gmail.com>
2025-12-09 23:27:26 +08:00
Stanislav Novosad
f754272f9c Extract BrowserState.scrape_website (#4184) 2025-12-03 15:08:32 -07:00
Stanislav Novosad
19d2deb859 Split browser_state/manager protocol and implementation (#4165) 2025-12-02 11:08:38 -07:00
LawyZheng
e0d537a099 fix ng selectable detect (#4104) 2025-11-26 23:02:57 +08:00
Shuchang Zheng
2b1b28e4d7 improve caching reliability by add retries for scrape_page (#4058) 2025-11-21 00:08:13 -08:00
LawyZheng
7f9916d3ca optimize bool type value parsing in domutils (#4046) 2025-11-20 15:16:24 +08:00
LawyZheng
31a79a1313 use value of the optiona when the content is empty (#4035) 2025-11-20 00:04:39 +08:00
LawyZheng
4cf6757d05 add remove all unique ids (#3964) 2025-11-12 02:41:55 +08:00
LawyZheng
dcbd2f7a26 support angular click event (#3960) 2025-11-11 11:26:15 +08:00
LawyZheng
f6145665c8 mark readonly element as interactable (#3845) 2025-10-29 13:10:21 +08:00
pedrohsdb
5b80614aac Wait time optimization xp (#3802) 2025-10-23 16:09:42 -07:00
LawyZheng
c2d7e6f2d0 check customized element pointer style (#3772) 2025-10-20 14:20:11 +08:00
LawyZheng
f69acdaadc readonly input type dropdown (#3717) 2025-10-15 14:39:27 +08:00
LawyZheng
e08778993e abort scraping when too many elements (#3678) 2025-10-10 15:46:55 +08:00
pedrohsdb
52c38a66c1 reverting lawys PR to fix workable (#3579) 2025-10-01 11:51:07 -07:00
pedrohsdb
80c7ea2577 Revert "skip malformed css selector" (#3578) 2025-10-01 11:38:07 -07:00
pedrohsdb
0fce84a384 skip malformed css selector (#3577) 2025-10-01 11:24:43 -07:00
LawyZheng
6b90f10221 remove valid css selector check (#3524) 2025-09-25 11:06:28 +08:00
LawyZheng
55bc6bd367 helper function for wait animation (#3240) 2025-08-20 14:28:01 +08:00
LawyZheng
7823ff9c46 start build tree from HTML element (#3237) 2025-08-20 10:58:18 +08:00
LawyZheng
458b7e43ab remove hard wait time in input action (#3229) 2025-08-19 14:26:25 +08:00
LawyZheng
1588d8018b improve dynamic wait when multiple frames (#3228) 2025-08-19 14:09:03 +08:00
LawyZheng
9a359ebfde decrease parse input prompt token (#3210) 2025-08-16 10:05:38 +08:00
LawyZheng
654cdb14e4 fix wait for animation end (#3201) 2025-08-15 15:24:54 +08:00
LawyZheng
6b8d29a23d fix stop waiting bug (#3197) 2025-08-15 03:51:39 +08:00
LawyZheng
cac4792f38 remove hard waiting time in scraping (#3195) 2025-08-15 02:24:59 +08:00
LawyZheng
f971cf8e58 optimize cache element tree logic (#3194) 2025-08-15 02:06:08 +08:00
LawyZheng
81767e3189 optimize scraping part 4 (#3192) 2025-08-15 01:55:59 +08:00
LawyZheng
04fd540cd5 stop building element tree again and again when drawing boudingbox (#3191) 2025-08-15 01:40:39 +08:00
LawyZheng
2556d04e70 fix scraping edge case (#3186) 2025-08-14 15:04:15 +08:00
LawyZheng
65e9cb10e9 optimize scraping part 2 (#3185) 2025-08-14 14:51:43 +08:00
LawyZheng
30606645ea optimize scraping part 1 (#3184) 2025-08-14 14:24:21 +08:00
LawyZheng
b88cf18590 optimize scraping part 3 (#3183) 2025-08-14 14:12:16 +08:00
Shuchang Zheng
52dc5a510b fix economy element tree trimming (#3182) 2025-08-13 21:45:14 -07:00
Shuchang Zheng
434bbff459 add support_empty_page and wait_seconds to the scrape_website interface (#3181) 2025-08-13 19:22:50 -07:00
devsy-bot[bot]
e3a3309e9c fix: change scraper log level from info to debug (#3143)
Co-authored-by: devsy-bot <no-reply@devsy.ai>
Co-authored-by: Claude <noreply@anthropic.com>
2025-08-08 08:56:30 -07:00
LawyZheng
f33906509f fix dom listener bug (#3095) 2025-08-04 11:10:49 +08:00
LawyZheng
ecc0e2e17d better failure reason for blank page (#3049)
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
2025-07-29 14:40:54 +08:00
LawyZheng
bff7544b83 fix scraping issue (#3035) 2025-07-25 12:53:22 +08:00
LawyZheng
4093a7fab0 fix style map parsing (#3029) 2025-07-25 00:50:06 +08:00
LawyZheng
fcd22017b7 make scraping timeout configurable (#2991) 2025-07-19 13:18:12 +08:00
Jonathan Dobson
f5d7639de8 allow empty urls (#2984) 2025-07-18 10:20:33 -04:00
Jonathan Dobson
c13c36f99e distinctify failed scrapes due to no url (#2977) 2025-07-17 16:19:16 -04:00
LawyZheng
5363d33dcc fix interactable detecting (#2941) 2025-07-15 03:31:34 +08:00
LawyZheng
dd9710eb9f add force textural element as interactable exp (#2936) 2025-07-14 13:09:40 +08:00
LawyZheng
95ab8295ce laminar integration (#2887) 2025-07-07 14:43:10 +08:00