Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document design of dial assistant context management #22

Open
avavilau opened this issue Nov 15, 2023 · 1 comment
Open

Document design of dial assistant context management #22

avavilau opened this issue Nov 15, 2023 · 1 comment
Assignees
Labels
documentation Improvements or additions to documentation

Comments

@avavilau
Copy link

avavilau commented Nov 15, 2023

  • Target state (design)
  • API Spec for Assistant limits
  • Smaller task list with ETAs
@avavilau avavilau added the documentation Improvements or additions to documentation label Nov 20, 2023
@avavilau avavilau moved this to In Testing in AI DIAL Nov 20, 2023
@Oleksii-Klimov Oleksii-Klimov modified the milestone: 2023-11-29 Nov 23, 2023
@avavilau avavilau moved this from In Testing to Code Review in AI DIAL Nov 27, 2023
@Oleksii-Klimov
Copy link
Collaborator

Oleksii-Klimov commented Dec 11, 2023

Target state:

Usage scenario:

  1. Tokenize the completion request using the tokenization endpoint.
  2. If the request fits then go to step #⁠4.
  3. If the request does not fit, reduce the number of messages or message sizes then repeat step #⁠1.
  4. Send the request to the chat completion endpoint.
    The truncation endpoint aids in managing history size by allowing for the discarding of old messages.

@Oleksii-Klimov Oleksii-Klimov moved this from Code Review to Done in AI DIAL Mar 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
Status: Done
Development

No branches or pull requests

2 participants