intel / llm-on-ray Public

Notifications
Fork 30
Star 112

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: intel/llm-on-ray

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clear current search query, filters, and sorts

30 Open 52 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Benchmark] Load config from yaml and output results with multiple formats

#82 opened Jan 24, 2024 by xwu99

[DOC] Add Coding Style Guide and how to format code to CONTRIBUTING.md

#46 opened Jan 10, 2024 by xwu99

[Lint] Add file license header check

#47 opened Jan 10, 2024 by xwu99

[Core] Unifying output printing by using logging with log level

#48 opened Jan 10, 2024 by xwu99

[Serving] Add a table of models and corresponding supported parameters

#51 opened Jan 11, 2024 by KepingYan

[Doc] Document serve simple protocol of request/response

#53 opened Jan 11, 2024 by xwu99

Model evaluation for finetuning and quantization.

#54 opened Jan 11, 2024 by xwu99

Add type hints to core interfaces

#55 opened Jan 12, 2024 by xwu99

Gettign error while running start_ui.py

#64 opened Jan 18, 2024 by dkiran1

Not able to run inference server for mistral 7b model, mpt-7b model on Ray

#65 opened Jan 18, 2024 by dkiran1

Getting error while executing query_openai_sdk.py to test the inference

#66 opened Jan 18, 2024 by dkiran1

Getting dependencies issues while installing on CPU

#67 opened Jan 18, 2024 by nkanike07

[Serving] Add Observability with OpenTelemetry inference

#17 opened Jan 2, 2024 by xwu99

[Serving] Example to chat from command line

#74 opened Jan 22, 2024 by carsonwang

[Quantization] Support loading AWQ, GPTQ, GGUF/GGML quantized models

#85 opened Jan 26, 2024 by xwu99

Support and validate model Mixtral-8x7B enhancement

New feature or request

#119 opened Feb 23, 2024 by carsonwang

Support functions/tools in OpenAI API enhancement

New feature or request

#121 opened Feb 23, 2024 by carsonwang

Enable mllm in CI

#126 opened Feb 28, 2024 by carsonwang

Add ipex extra in pyproject.toml to use restricted transformers version

#127 opened Feb 29, 2024 by jiafuzha

Openai API not allow temperature=0.0 for llama-2-7b-chat-hf

#139 opened Mar 12, 2024 by yutianchen666

Issue about using ipex on cpu

#197 opened Apr 19, 2024 by KepingYan

Define simple_protocol.py and define pydantic SimpleRequest and SimpleModelResponse classes to encapsulate current json format

#217 opened May 13, 2024 by xwu99

Output some debug info in CI when Internal Server Error

#218 opened May 13, 2024 by xwu99

Calculate correct input length for every prompt in a single batch

#222 opened May 14, 2024 by kira-lin

Inference Mixtral on Gaudi

#249 opened Jun 12, 2024 by Deegue

Previous 1 2 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly