Commit Graph

30 Commits

Author SHA1 Message Date
Shuchang Zheng
ea5620acd2 weave LLMCaller.llm_key through to api handler/agent (#2524) 2025-05-29 16:49:59 -04:00
Shuchang Zheng
3c7ac35d4f use vertex global region (#2326) 2025-05-12 10:28:10 +08:00
Shuchang Zheng
dbcc48833e fix LiteLLMParams typing (#2285) 2025-05-04 01:02:01 -07:00
Shuchang Zheng
205e2b35c0 return dummy llm api handler which will raise exception if called (#1954) 2025-03-17 12:37:10 -07:00
Shuchang Zheng
179c12db8c add support back for old gpt versions by supporting max_tokens (#1860) 2025-03-02 00:16:00 -05:00
Shuchang Zheng
14689b53e4 task v2 refactor part 10: ObserverTask -> TaskV2 in backend code (#1839) 2025-02-27 23:19:02 -05:00
Shuchang Zheng
ffbc95e1b4 task v2 refactor part 6 - observer_cruise_id -> task_v2_id (#1817) 2025-02-23 16:03:49 -08:00
Shuchang Zheng
713005f403 task v2 refactor part 2: observer schema py -> task_v2 schema py (#1814) 2025-02-22 03:21:19 -08:00
Shuchang Zheng
59756cb1d2 O3 Mini support (#1709) 2025-02-03 16:07:10 -05:00
Shuchang Zheng
204972e225 Add step / task / workflow run / observer metrics as logs (#1698)
Co-authored-by: Suchintan <suchintan@users.noreply.github.com>
2025-02-01 14:10:38 -05:00
Shuchang Zheng
c158ad3f21 migrate observer to task v2 (#1564) 2025-01-15 09:59:18 -08:00
Shuchang Zheng
5796de73d1 Add AI suggestion endpoints (#1519) 2025-01-08 21:45:38 -08:00
Shuchang Zheng
175ce55f06 recreate index for artifacts table (#1455) 2024-12-31 01:38:07 -08:00
Shuchang Zheng
5842bfc1fd create cruise related artifact in cruise api (#1355) 2024-12-08 21:17:58 -08:00
Shuchang Zheng
bf06fcfeb7 Update max output tokens to 16K (#1110) 2024-11-02 21:46:55 -07:00
Shuchang Zheng
60069a6256 fix LiteLLMParams (#958) 2024-10-10 23:24:27 -07:00
Shuchang Zheng
b536f1d27f gpt4o mini pricing for massive (#957) 2024-10-10 22:57:54 -07:00
LawyZheng
46a139bd27 litellm cooldown config (#711) 2024-08-19 16:49:26 +08:00
LawyZheng
98e2f7f206 add gpt4o mini support (#666) 2024-08-02 19:35:52 +08:00
Kerem Yilmaz
fd9419b33a clean up gpt4 routers. add model_info base_model for azure gpt router (#620) 2024-07-18 15:37:20 -07:00
Shuchang Zheng
9e1b07451f update pre-commit (#330) 2024-05-16 17:11:49 -07:00
Kerem Yilmaz
cf01e81ba2 Force Claude 3 models to output JSON object and parse it more reliably (#293)
Co-authored-by: otmane <otmanebenazzou.pro@gmail.com>
2024-05-10 00:51:12 -07:00
Kerem Yilmaz
4459ca7d09 Integrate posthog feature flags + setup Claude 3 experiment (#251) 2024-05-02 15:06:21 -07:00
Kerem Yilmaz
3d712cb0fb llm router for gpt4 turbo (#237) 2024-04-25 01:32:22 -07:00
Kerem Yilmaz
f5125a9856 Ykeremy/test new litellm version (#223) 2024-04-23 23:48:35 -07:00
Kerem Yilmaz
3c06d44f6f Ykeremy/revert litellm upgrade (#204) 2024-04-17 11:51:14 -07:00
Kerem Yilmaz
02cf2a1e87 New LLM Router strategy (#199) 2024-04-16 14:52:15 -07:00
Kerem Yilmaz
3cc07c9d20 Ykeremy/fix router pass empty redis password (#143) 2024-04-01 16:28:46 -07:00
Kerem Yilmaz
1c397a13af Implement LLMRouter (#127) 2024-03-27 14:44:25 -07:00
Kerem Yilmaz
d1de19556e Implement LLM router (#95) 2024-03-16 23:13:18 -07:00