Dorod-Sky

Author	SHA1	Message	Date
Shuchang Zheng	ea5620acd2	weave LLMCaller.llm_key through to api handler/agent (#2524 )	2025-05-29 16:49:59 -04:00
Shuchang Zheng	3c7ac35d4f	use vertex global region (#2326 )	2025-05-12 10:28:10 +08:00
Shuchang Zheng	dbcc48833e	fix LiteLLMParams typing (#2285 )	2025-05-04 01:02:01 -07:00
Shuchang Zheng	205e2b35c0	return dummy llm api handler which will raise exception if called (#1954 )	2025-03-17 12:37:10 -07:00
Shuchang Zheng	179c12db8c	add support back for old gpt versions by supporting `max_tokens` (#1860 )	2025-03-02 00:16:00 -05:00
Shuchang Zheng	14689b53e4	task v2 refactor part 10: ObserverTask -> TaskV2 in backend code (#1839 )	2025-02-27 23:19:02 -05:00
Shuchang Zheng	ffbc95e1b4	task v2 refactor part 6 - observer_cruise_id -> task_v2_id (#1817 )	2025-02-23 16:03:49 -08:00
Shuchang Zheng	713005f403	task v2 refactor part 2: observer schema py -> task_v2 schema py (#1814 )	2025-02-22 03:21:19 -08:00
Shuchang Zheng	59756cb1d2	O3 Mini support (#1709 )	2025-02-03 16:07:10 -05:00
Shuchang Zheng	204972e225	Add step / task / workflow run / observer metrics as logs (#1698 ) Co-authored-by: Suchintan <suchintan@users.noreply.github.com>	2025-02-01 14:10:38 -05:00
Shuchang Zheng	c158ad3f21	migrate observer to task v2 (#1564 )	2025-01-15 09:59:18 -08:00
Shuchang Zheng	5796de73d1	Add AI suggestion endpoints (#1519 )	2025-01-08 21:45:38 -08:00
Shuchang Zheng	175ce55f06	recreate index for artifacts table (#1455 )	2024-12-31 01:38:07 -08:00
Shuchang Zheng	5842bfc1fd	create cruise related artifact in cruise api (#1355 )	2024-12-08 21:17:58 -08:00
Shuchang Zheng	bf06fcfeb7	Update max output tokens to 16K (#1110 )	2024-11-02 21:46:55 -07:00
Shuchang Zheng	60069a6256	fix LiteLLMParams (#958 )	2024-10-10 23:24:27 -07:00
Shuchang Zheng	b536f1d27f	gpt4o mini pricing for massive (#957 )	2024-10-10 22:57:54 -07:00
LawyZheng	46a139bd27	litellm cooldown config (#711 )	2024-08-19 16:49:26 +08:00
LawyZheng	98e2f7f206	add gpt4o mini support (#666 )	2024-08-02 19:35:52 +08:00
Kerem Yilmaz	fd9419b33a	clean up gpt4 routers. add model_info base_model for azure gpt router (#620 )	2024-07-18 15:37:20 -07:00
Shuchang Zheng	9e1b07451f	update pre-commit (#330 )	2024-05-16 17:11:49 -07:00
Kerem Yilmaz	cf01e81ba2	Force Claude 3 models to output JSON object and parse it more reliably (#293 ) Co-authored-by: otmane <otmanebenazzou.pro@gmail.com>	2024-05-10 00:51:12 -07:00
Kerem Yilmaz	4459ca7d09	Integrate posthog feature flags + setup Claude 3 experiment (#251 )	2024-05-02 15:06:21 -07:00
Kerem Yilmaz	3d712cb0fb	llm router for gpt4 turbo (#237 )	2024-04-25 01:32:22 -07:00
Kerem Yilmaz	f5125a9856	Ykeremy/test new litellm version (#223 )	2024-04-23 23:48:35 -07:00
Kerem Yilmaz	3c06d44f6f	Ykeremy/revert litellm upgrade (#204 )	2024-04-17 11:51:14 -07:00
Kerem Yilmaz	02cf2a1e87	New LLM Router strategy (#199 )	2024-04-16 14:52:15 -07:00
Kerem Yilmaz	3cc07c9d20	Ykeremy/fix router pass empty redis password (#143 )	2024-04-01 16:28:46 -07:00
Kerem Yilmaz	1c397a13af	Implement LLMRouter (#127 )	2024-03-27 14:44:25 -07:00
Kerem Yilmaz	d1de19556e	Implement LLM router (#95 )	2024-03-16 23:13:18 -07:00

30 Commits