Reduce size of swarms : Tick token using torch #705

jmikedupont2 · 2024-12-26T19:07:48Z

We currently seem to have a dependency in the api server on the ticktoken that uses torch,
this adds many gbs into the docker image. Can we please remove or refactor that?

kyegomez · 2024-12-26T21:27:34Z

@jmikedupont2 this will be difficult because we need to cound the tokens the agents uses.

kyegomez · 2024-12-26T21:28:33Z

@jmikedupont2 solution i can think of is : setup a tiktoken api that counts the number of tokens in each request and returns the number of tokens. But this will take some time. How much is it adding to the server?

kyegomez · 2024-12-26T21:35:00Z

@jmikedupont2 i'm removing 2 more packages to slim down the size ;)

jmikedupont2 · 2024-12-26T21:36:43Z

what if we move the token count to a separate litellm proxy server?
and that would hide the openai secrets from the agents, it could be more secure.

kyegomez · 2024-12-31T20:27:08Z

oken count to a separate litellm proxy server?
and that would hide the openai secrets from the agents, it could be more secure.

that could work well, would we need an api for it?

evelynmitchell · 2025-01-06T20:13:47Z

A litellm proxy server would run in the same environment as the swarms api server, and would wrap the incoming and outgoing api calls to the swarms api server.

user -> request -> litellm server supabase(user_usage += len(request)) -> swarms api call -> litellm server supabase(user_usage += len(answer)) -> answer -> user

The instructions to set up litellm are at: https://docs.litellm.ai/docs/proxy/deploy

jmikedupont2 · 2025-01-07T15:23:13Z

https://github.com/jmikedupont2/openlightllm I have a fork of litellm that I would like to continue working on that removes all the non-open source mess.

jmikedupont2 · 2025-01-07T15:23:37Z

I might have some terraform for that as well somewhere

jmikedupont2 assigned kyegomez Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce size of swarms : Tick token using torch #705

Reduce size of swarms : Tick token using torch #705

jmikedupont2 commented Dec 26, 2024

kyegomez commented Dec 26, 2024

kyegomez commented Dec 26, 2024 •

edited

Loading

kyegomez commented Dec 26, 2024

jmikedupont2 commented Dec 26, 2024

kyegomez commented Dec 31, 2024 •

edited

Loading

evelynmitchell commented Jan 6, 2025

jmikedupont2 commented Jan 7, 2025

jmikedupont2 commented Jan 7, 2025

Reduce size of swarms : Tick token using torch #705

Reduce size of swarms : Tick token using torch #705

Comments

jmikedupont2 commented Dec 26, 2024

kyegomez commented Dec 26, 2024

kyegomez commented Dec 26, 2024 • edited Loading

kyegomez commented Dec 26, 2024

jmikedupont2 commented Dec 26, 2024

kyegomez commented Dec 31, 2024 • edited Loading

evelynmitchell commented Jan 6, 2025

jmikedupont2 commented Jan 7, 2025

jmikedupont2 commented Jan 7, 2025

kyegomez commented Dec 26, 2024 •

edited

Loading

kyegomez commented Dec 31, 2024 •

edited

Loading