Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

Open
hurenjun opened this issue Aug 12, 2024 · 8 comments
Open

Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

hurenjun opened this issue Aug 12, 2024 · 8 comments
Labels
enhancement New feature or request

Comments

@hurenjun
Copy link

Hi there,

Thank you for bringing the elegant RAG Assessment framework to the community.

I am an AI engineer from Alibaba Cloud, and our team has been fine-tuning LLM-as-a-Judge models based on the Qwen foundation LLMs. Through extensive optimizations, our latest model has achieved GPT-4 level alignment with human preferences (indeed, it's performing approximately 5% better on our benchmarks) and it is particularly optimized for Chinese language support.

We are very interested in integrating our model as an evaluation LLM within RAGAS. Additionally, we would be happy to support the use of LLM hosted on Alibaba Cloud's LLM serving platform, EAS, as extension to the current support of AWA, Azure, and Google Vertex AI.

Please let me know if these contributions could be included in RAGAS.

I look forward to your response.

Best regards,
Renjun

@hurenjun hurenjun added the enhancement New feature or request label Aug 12, 2024
@jjmachan
Copy link
Member

@hurenjun would love to explore this further too? what would be the best way forward you have in mind?

would the models be open-sourced by the way? I think it would be really helpful for the chinese userbase because we had a few model that were not supported/working as expected.

@hurenjun hurenjun reopened this Aug 14, 2024
@hurenjun
Copy link
Author

Hi @jjmachan, we could offer two methods, following the existing practices of ragas, for integrating our models:

  1. We have supported users in accessing our judge models via an API key. We can integrate this approach into RAGAS to facilitate quick access for users, similar to the current OpenAI/Together model.

  2. We will enable users to deploy our latest judge models on their own resources within Alibaba Cloud. We can assist in implementing support for Alibaba Cloud's EAS as an addition to the current integrated LLM services from Azure, AWS, and Vertex.

In addition, we will continue to iterate on our models and will open-source them as part of our contribution to the community. Currently, we have a paper under double-blind review. I will figure out the best way to do this without violating the anonymity requirement.

@hurenjun
Copy link
Author

@jjmachan Hi, is there any feedback on the above proposal?

@landhu
Copy link

landhu commented Aug 21, 2024

@hurenjun I think you can check https://docs.ragas.io/en/stable/howtos/customisations/bring-your-own-llm-or-embs.html
Also, aliyun API compatible with Openai. you could replace base_url.

@hurenjun
Copy link
Author

@landhu Thank you for your comment.
That should work. And I am considering that ragas could maintain some third-party LLMs for evaluation and other purposes, like those on huggingface, in addition to the OpenAI models.

@jjmachan
Copy link
Member

jjmachan commented Sep 5, 2024

hey @hurenjun sorry about the delay but we are working on #1237 which will have tabs for popular LLM providers in the getting started page. This should make it easier to use.

In the mean time we can maybe write a specific notebook in the how to guides as well if that would help users too - what do you think?

@hurenjun
Copy link
Author

hurenjun commented Sep 6, 2024

@jjmachan That's gonna be very helpful, especially for developers in regions without directyt access to openai models.
Looking forward to the new feature.

@jjmachan
Copy link
Member

jjmachan commented Sep 6, 2024

will keep you posted 🙂

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants