Shuchang Zheng
|
6db01f4783
|
gemini 1.5 pro & flash (#1308)
|
2024-12-02 15:59:59 -08:00 |
|
Shuchang Zheng
|
57e6404881
|
add gemeni pro support (#1307)
|
2024-12-02 15:51:18 -08:00 |
|
Shuchang Zheng
|
7f6b2c0929
|
shu/removeSettingsManager.get_settings (#1305)
|
2024-12-02 15:01:22 -08:00 |
|
Shuchang Zheng
|
675933128c
|
support bedrock sonnet 3.5 inference profile (#1195)
|
2024-11-14 14:37:53 -08:00 |
|
Shuchang Zheng
|
281feb60e4
|
Remove Call LLM API Log (#1132)
|
2024-11-04 22:00:15 -05:00 |
|
Shuchang Zheng
|
2ac8a1a7d0
|
SKY-3915 Remove incomplete json fixing logic -- models have gotten better (#1121)
|
2024-11-04 00:56:58 -05:00 |
|
Shuchang Zheng
|
bf06fcfeb7
|
Update max output tokens to 16K (#1110)
|
2024-11-02 21:46:55 -07:00 |
|
Suchintan
|
7cba401e2e
|
Add support for the latest sonnet 3.5 (#1025)
|
2024-10-22 16:57:26 -04:00 |
|
Shuchang Zheng
|
60069a6256
|
fix LiteLLMParams (#958)
|
2024-10-10 23:24:27 -07:00 |
|
Shuchang Zheng
|
b536f1d27f
|
gpt4o mini pricing for massive (#957)
|
2024-10-10 22:57:54 -07:00 |
|
Shuchang Zheng
|
06daa9834f
|
catch litellm no deployment error (token ratelimit / ValueError) and raise LLMProviderErrorRetryableTask (#758)
|
2024-08-29 12:19:51 -07:00 |
|
Shuchang Zheng
|
b5d743d6eb
|
handle litellm APIError for the router and raise LLMProviderErrorRetryableTask (#750)
|
2024-08-28 10:23:16 -07:00 |
|
Kerem Yilmaz
|
a6a545224a
|
adding step_exception to provide more signal of why the step failed (#730)
|
2024-08-28 01:06:04 -07:00 |
|
Shuchang Zheng
|
1967e44acc
|
add support for gpt-4o-2024-08-06 (#715)
|
2024-08-22 14:08:43 -07:00 |
|
LawyZheng
|
46a139bd27
|
litellm cooldown config (#711)
|
2024-08-19 16:49:26 +08:00 |
|
LawyZheng
|
98e2f7f206
|
add gpt4o mini support (#666)
|
2024-08-02 19:35:52 +08:00 |
|
Shuchang Zheng
|
dd188b5f4d
|
fix t_llm_request is unbounded problem (#657)
|
2024-07-29 13:54:42 -07:00 |
|
Shuchang Zheng
|
c52f8f2433
|
shu/making scrolling debug logs (#635)
|
2024-07-25 12:56:10 -07:00 |
|
Kerem Yilmaz
|
20a611a935
|
Use perf-counter instead of time (#643)
|
2024-07-24 23:47:02 -07:00 |
|
Kerem Yilmaz
|
5000f3d762
|
Catch CanceledError in litellm call (#642)
|
2024-07-24 23:41:34 -07:00 |
|
Shuchang Zheng
|
ec5a0a03c0
|
add gpt4o-mini support (#625)
|
2024-07-18 22:33:19 -07:00 |
|
Kerem Yilmaz
|
fd9419b33a
|
clean up gpt4 routers. add model_info base_model for azure gpt router (#620)
|
2024-07-18 15:37:20 -07:00 |
|
Kerem Yilmaz
|
0ab41a207a
|
Start using Claude 3.5 Sonnet (#497)
|
2024-06-20 10:33:17 -07:00 |
|
Kerem Yilmaz
|
14009bae34
|
Ykeremy/gpt4o router (#431)
|
2024-06-05 21:36:32 -07:00 |
|
Kerem Yilmaz
|
04c6e55848
|
Keep track of token counts in steps table (#412)
|
2024-06-03 15:55:34 -07:00 |
|
Kerem Yilmaz
|
753bac3174
|
Set 5 min timeout for LLM calls (#358)
|
2024-05-23 18:49:49 -07:00 |
|
Kerem Yilmaz
|
df2aee3c79
|
Add logs (#357)
|
2024-05-23 12:07:49 -07:00 |
|
Kerem Yilmaz
|
e6d4302d8c
|
Auto-fix invalid JSON (#354)
|
2024-05-21 22:04:32 -07:00 |
|
Kerem Yilmaz
|
6908ca5844
|
Pass base parameters to acompletion (#343)
|
2024-05-17 16:36:15 -07:00 |
|
Suchintan
|
9a6ec13af6
|
Ruff (#339)
|
2024-05-17 12:10:54 -04:00 |
|
Divyansh Verma
|
8598f56a83
|
GPT4 integration (#324)
Co-authored-by: Suchintan <suchintan@users.noreply.github.com>
|
2024-05-17 11:25:03 -04:00 |
|
Hasnain
|
45a2840538
|
gpt-4o integration (#333)
|
2024-05-17 11:22:08 -04:00 |
|
Shuchang Zheng
|
2466897158
|
adopt ruff as the replacement for python black (#332)
|
2024-05-16 18:20:11 -07:00 |
|
Shuchang Zheng
|
9e1b07451f
|
update pre-commit (#330)
|
2024-05-16 17:11:49 -07:00 |
|
OB42
|
72389b217c
|
add missing check for ENABLE_BEDROCK in LLMConfigRegistry (#298)
Co-authored-by: Kerem Yilmaz <kerem@skyvern.com>
|
2024-05-14 10:07:47 -07:00 |
|
Kerem Yilmaz
|
cf01e81ba2
|
Force Claude 3 models to output JSON object and parse it more reliably (#293)
Co-authored-by: otmane <otmanebenazzou.pro@gmail.com>
|
2024-05-10 00:51:12 -07:00 |
|
Kerem Yilmaz
|
bf55bf8314
|
Default the old anthropic model name for backwards compatibility (#285)
|
2024-05-08 23:43:52 -04:00 |
|
Kerem Yilmaz
|
e5d094493e
|
Define haiku + prompt engine takes a directory arg (#279)
|
2024-05-08 02:07:18 -07:00 |
|
Kerem Yilmaz
|
4459ca7d09
|
Integrate posthog feature flags + setup Claude 3 experiment (#251)
|
2024-05-02 15:06:21 -07:00 |
|
Shuchang Zheng
|
a1fa5ee4e0
|
log main_model_group when router.acompletion has an unexpected error (#249)
|
2024-05-02 14:20:49 -04:00 |
|
Kerem Yilmaz
|
3d712cb0fb
|
llm router for gpt4 turbo (#237)
|
2024-04-25 01:32:22 -07:00 |
|
Suchintan
|
fdef12dac3
|
GPT-4-Turbo support vision=true (#234)
|
2024-04-24 15:42:39 -04:00 |
|
Suchintan
|
2c4c309523
|
GPT-4-Turbo w/ Vision is GA now (#230)
|
2024-04-24 13:56:52 -04:00 |
|
Kerem Yilmaz
|
f5125a9856
|
Ykeremy/test new litellm version (#223)
|
2024-04-23 23:48:35 -07:00 |
|
Kerem Yilmaz
|
3c06d44f6f
|
Ykeremy/revert litellm upgrade (#204)
|
2024-04-17 11:51:14 -07:00 |
|
Kerem Yilmaz
|
02cf2a1e87
|
New LLM Router strategy (#199)
|
2024-04-16 14:52:15 -07:00 |
|
Kerem Yilmaz
|
39d7d91938
|
Enable precall checks for LiteLLM Router (#174)
|
2024-04-09 23:46:12 -07:00 |
|
Kerem Yilmaz
|
3cc07c9d20
|
Ykeremy/fix router pass empty redis password (#143)
|
2024-04-01 16:28:46 -07:00 |
|
Kerem Yilmaz
|
1c397a13af
|
Implement LLMRouter (#127)
|
2024-03-27 14:44:25 -07:00 |
|
Kerem Yilmaz
|
d1de19556e
|
Implement LLM router (#95)
|
2024-03-16 23:13:18 -07:00 |
|