Commit Graph

157 Commits

Author SHA1 Message Date
Asher Foa
effd0c4911 Add pyupgrade pre-commit hook + modernize python code (#2611) 2025-06-10 18:52:38 +00:00
Shuchang Zheng
09ca00e948 fix timeout for svg and css LLM call (#2662)
Co-authored-by: lawyzheng <lawyzheng1106@gmail.com>
2025-06-10 07:23:15 +00:00
Shuchang Zheng
3a927aa4e2 fix upload file storage class input style (#2618) 2025-06-06 01:36:46 -07:00
Asher Foa
990b1c921f Add support for s3 storage classes + added logging when we upload to s3 (#2610) 2025-06-05 18:26:52 -04:00
Shuchang Zheng
06d026a231 fallback gemini to gpt + use latest gemini flash 2.5 release (#2588) 2025-06-04 14:41:38 +08:00
Shuchang Zheng
2ddcb9eb0b log duration and prompt_name in llm failure log (#2578) 2025-06-03 10:24:47 -07:00
Shuchang Zheng
aef945cb63 fix llm key override in llm_api_handler_with_router_and_fallback (#2562) 2025-05-31 23:23:37 -07:00
Tiger Shen
73e366bbad Fix Claude 4 model names to use specific versions instead of -latest suffix (#2531) 2025-05-30 18:13:05 -07:00
Shuchang Zheng
5580548e7d support anthropic claude 4 bedrock (#2539) 2025-05-30 16:16:07 -07:00
Shuchang Zheng
ea5620acd2 weave LLMCaller.llm_key through to api handler/agent (#2524) 2025-05-29 16:49:59 -04:00
Shuchang Zheng
08bf2bc428 backend changes extracted from codex/jon/SKY-5016 (#2508) 2025-05-29 09:15:04 -04:00
Shuchang Zheng
fc71c32133 Add Anthropic Claude 4 and 3.5 Haiku models (#2477) 2025-05-26 22:18:20 -04:00
Shuchang Zheng
f49b5c4100 add gemini 2.5 flash preview 05-20 (#2434) 2025-05-22 14:12:54 -07:00
Shuchang Zheng
e5f52b8c9d add download filename rule for supabase (#2432) 2025-05-22 10:53:59 -07:00
Shuchang Zheng
3c7ac35d4f use vertex global region (#2326) 2025-05-12 10:28:10 +08:00
Shuchang Zheng
1fbaf711b1 fix anthropic llm stats (#2313) 2025-05-08 14:30:00 -07:00
Shuchang Zheng
349e506dc1 shu/remove max completion tokens for ollama (#2300) 2025-05-06 12:21:42 -07:00
Shuchang Zheng
28b1ff4872 remove dead code in aws.py - execute_with_async_client (#2287) 2025-05-05 00:41:48 -07:00
Shuchang Zheng
e2a82a75d9 fix linter errors for config registry & cli (#2292) 2025-05-05 00:12:46 -07:00
Prakash Maheshwaran
c3072d7572 Added Ollama & Openrouter & Groq & improved cdp browser (#2283) 2025-05-05 00:03:23 -07:00
Shuchang Zheng
dbcc48833e fix LiteLLMParams typing (#2285) 2025-05-04 01:02:01 -07:00
Shuchang Zheng
8b834436b5 anthropic support - dymanic window size / window popup (#2284) 2025-05-04 00:40:16 -07:00
Shuchang Zheng
38f8a21d32 anthropic CUA - limit image message count (#2265) 2025-04-30 18:03:40 -07:00
Shuchang Zheng
eacf9d5388 resize screenshots to make it compatible with anthropic CUA (#2255) 2025-04-30 01:13:36 -07:00
Shuchang Zheng
53f4c23ef9 fix bug (#2249) 2025-04-30 00:06:59 +08:00
Shuchang Zheng
f4ea546ae4 fix anthropic tool result format (#2246) 2025-04-29 04:17:50 +08:00
Shuchang Zheng
47ad15a864 add vertex ai (#2243) 2025-04-29 03:11:56 +08:00
Shuchang Zheng
7e4a193443 fix tool_call_input parsing with anthropic sdk (#2242) 2025-04-29 02:57:56 +08:00
Shuchang Zheng
1449720038 enable thinking in anthropic CUA (#2240) 2025-04-29 00:16:22 +08:00
Shuchang Zheng
8e6819c987 fix anthropic llm caller routing (#2241) 2025-04-29 00:06:37 +08:00
Shuchang Zheng
9e7906a083 fix anthropic messages image input pattern (#2239) 2025-04-28 23:12:45 +08:00
Shuchang Zheng
1530338cad Anthropic CUA fix (#2238) 2025-04-28 18:15:23 +08:00
Shuchang Zheng
a8d237deee use anthropic beta client (#2235) 2025-04-28 16:24:12 +08:00
Shuchang Zheng
0a0228b341 anthropic CUA (#2231)
Co-authored-by: ellipsis-dev[bot] <65095814+ellipsis-dev[bot]@users.noreply.github.com>
2025-04-28 09:49:44 +08:00
Shuchang Zheng
56cfb55096 add LLMCaller that supports message history (#2204) 2025-04-24 03:37:26 +08:00
Shuchang Zheng
ef36125b91 debug log: Using general model configuration for unknown LLM key (#2223) 2025-04-24 02:08:00 +08:00
Shuchang Zheng
fc72240c03 bedrock claude 3.7 sonnet (#2203) 2025-04-21 15:48:08 +08:00
Shuchang Zheng
7a15ea7405 update latest openai model configs for azure & openai (#2202) 2025-04-21 14:44:15 +08:00
Brandon Beiler
3d381a60f0 Feature: Configuring OpenAI-Compatible (LiteLLM) Models (#2166)
Co-authored-by: bbeiler <bbeiler@ridgelineintl.com>
2025-04-21 08:25:59 +08:00
Suchintan
eefddb4105 GPT-o4-mini and GPT-o3 support is here (#2170) 2025-04-16 21:34:00 -04:00
Shuchang Zheng
3154a9ce10 fix llm cost for gpt4.1 (#2172) 2025-04-16 15:32:51 -07:00
Suchintan
c829966936 GPT-4.1-support (#2159) 2025-04-16 02:12:33 -04:00
Shuchang Zheng
80cdc1e0e2 Retry url encode fix (#2136) 2025-04-12 15:18:15 -07:00
Shuchang Zheng
cb46232cff fix url encoding (#2135) 2025-04-12 13:35:43 -07:00
Shuchang Zheng
7e5e684f53 shu/gemini 2.5 pro support (#2109) 2025-04-06 00:03:24 -04:00
Shuchang Zheng
c45ff02872 fix url encoding problem when downloading file (#2103) 2025-04-04 12:16:39 -04:00
Suchintan
ff57f9977c Update docs plus init (#2073)
Co-authored-by: Shuchang Zheng <wintonzheng0325@gmail.com>
2025-04-03 00:46:57 -04:00
Shuchang Zheng
6d8a49d5b5 file upload block backend (#2000) 2025-03-23 15:37:20 -07:00
Shuchang Zheng
f924185caf fix cached_tokens & completion_tokens (#1986) 2025-03-20 19:56:02 -07:00
Shuchang Zheng
eb3eb4eede track reasoning token and cached token (#1985) 2025-03-20 16:42:57 -07:00