Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

hurenjun · 2024-08-12T09:16:12Z

Hi there,

Thank you for bringing the elegant RAG Assessment framework to the community.

I am an AI engineer from Alibaba Cloud, and our team has been fine-tuning LLM-as-a-Judge models based on the Qwen foundation LLMs. Through extensive optimizations, our latest model has achieved GPT-4 level alignment with human preferences (indeed, it's performing approximately 5% better on our benchmarks) and it is particularly optimized for Chinese language support.

We are very interested in integrating our model as an evaluation LLM within RAGAS. Additionally, we would be happy to support the use of LLM hosted on Alibaba Cloud's LLM serving platform, EAS, as extension to the current support of AWA, Azure, and Google Vertex AI.

Please let me know if these contributions could be included in RAGAS.

I look forward to your response.

Best regards,
Renjun

jjmachan · 2024-08-13T15:14:00Z

@hurenjun would love to explore this further too? what would be the best way forward you have in mind?

would the models be open-sourced by the way? I think it would be really helpful for the chinese userbase because we had a few model that were not supported/working as expected.

hurenjun · 2024-08-14T07:23:23Z

Hi @jjmachan, we could offer two methods, following the existing practices of ragas, for integrating our models:

We have supported users in accessing our judge models via an API key. We can integrate this approach into RAGAS to facilitate quick access for users, similar to the current OpenAI/Together model.
We will enable users to deploy our latest judge models on their own resources within Alibaba Cloud. We can assist in implementing support for Alibaba Cloud's EAS as an addition to the current integrated LLM services from Azure, AWS, and Vertex.

In addition, we will continue to iterate on our models and will open-source them as part of our contribution to the community. Currently, we have a paper under double-blind review. I will figure out the best way to do this without violating the anonymity requirement.

hurenjun · 2024-08-20T02:10:47Z

@jjmachan Hi, is there any feedback on the above proposal?

landhu · 2024-08-21T07:45:37Z

@hurenjun I think you can check https://docs.ragas.io/en/stable/howtos/customisations/bring-your-own-llm-or-embs.html
Also, aliyun API compatible with Openai. you could replace base_url.

hurenjun · 2024-08-22T07:26:33Z

@landhu Thank you for your comment.
That should work. And I am considering that ragas could maintain some third-party LLMs for evaluation and other purposes, like those on huggingface, in addition to the OpenAI models.

jjmachan · 2024-09-05T04:39:12Z

hey @hurenjun sorry about the delay but we are working on #1237 which will have tabs for popular LLM providers in the getting started page. This should make it easier to use.

In the mean time we can maybe write a specific notebook in the how to guides as well if that would help users too - what do you think?

hurenjun · 2024-09-06T04:27:44Z

@jjmachan That's gonna be very helpful, especially for developers in regions without directyt access to openai models.
Looking forward to the new feature.

jjmachan · 2024-09-06T04:42:20Z

will keep you posted 🙂

hurenjun added the enhancement New feature or request label Aug 12, 2024

hurenjun closed this as completed Aug 14, 2024

hurenjun reopened this Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

hurenjun commented Aug 12, 2024

jjmachan commented Aug 13, 2024

hurenjun commented Aug 14, 2024

hurenjun commented Aug 20, 2024

landhu commented Aug 21, 2024

hurenjun commented Aug 22, 2024

jjmachan commented Sep 5, 2024

hurenjun commented Sep 6, 2024

jjmachan commented Sep 6, 2024

Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

Integrating third-party LLMs for Evaluating Chinese-native RAGs #1188

Comments

hurenjun commented Aug 12, 2024

jjmachan commented Aug 13, 2024

hurenjun commented Aug 14, 2024

hurenjun commented Aug 20, 2024

landhu commented Aug 21, 2024

hurenjun commented Aug 22, 2024

jjmachan commented Sep 5, 2024

hurenjun commented Sep 6, 2024

jjmachan commented Sep 6, 2024