Spaces:

DesertWolf
/

test3

Paused

App Files Files Community

test3 / litellm /passthrough /README.md

DesertWolf

Upload folder using huggingface_hub

447ebeb verified 6 months ago

preview code

raw

history blame contribute delete

2.53 kB

	This makes it easier to pass through requests to the LLM APIs.

	E.g. Route to VLLM's `/classify` endpoint:


	## SDK (Basic)

	```python
	import litellm


	response = litellm.llm_passthrough_route(
	model="hosted_vllm/papluca/xlm-roberta-base-language-detection",
	method="POST",
	endpoint="classify",
	api_base="http://localhost:8090",
	api_key=None,
	json={
	"model": "swapped-for-litellm-model",
	"input": "Hello, world!",
	}
	)

	print(response)
	```

	## SDK (Router)

	```python
	import asyncio
	from litellm import Router

	router = Router(
	model_list=[
	{
	"model_name": "roberta-base-language-detection",
	"litellm_params": {
	"model": "hosted_vllm/papluca/xlm-roberta-base-language-detection",
	"api_base": "http://localhost:8090",
	}
	}
	]
	)

	request_data = {
	"model": "roberta-base-language-detection",
	"method": "POST",
	"endpoint": "classify",
	"api_base": "http://localhost:8090",
	"api_key": None,
	"json": {
	"model": "roberta-base-language-detection",
	"input": "Hello, world!",
	}
	}

	async def main():
	response = await router.allm_passthrough_route(**request_data)
	print(response)

	if __name__ == "__main__":
	asyncio.run(main())
	```

	## PROXY

	1. Setup config.yaml

	```yaml
	model_list:
	- model_name: roberta-base-language-detection
	litellm_params:
	model: hosted_vllm/papluca/xlm-roberta-base-language-detection
	api_base: http://localhost:8090
	```

	2. Run the proxy

	```bash
	litellm proxy --config config.yaml

	# RUNNING on http://localhost:4000
	```

	3. Use the proxy

	```bash
	curl -X POST http://localhost:4000/vllm/classify \
	-H "Content-Type: application/json" \
	-H "Authorization: Bearer <your-api-key>" \
	-d '{"model": "roberta-base-language-detection", "input": "Hello, world!"}' \
	```

	# How to add a provider for passthrough

	See [VLLMModelInfo](https://github.com/BerriAI/litellm/blob/main/litellm/llms/vllm/common_utils.py) for an example.

	1. Inherit from BaseModelInfo

	```python
	from litellm.llms.base_llm.base_utils import BaseLLMModelInfo

	class VLLMModelInfo(BaseLLMModelInfo):
	pass
	```

	2. Register the provider in the ProviderConfigManager.get_provider_model_info

	```python
	from litellm.utils import ProviderConfigManager
	from litellm.types.utils import LlmProviders

	provider_config = ProviderConfigManager.get_provider_model_info(
	model="my-test-model", provider=LlmProviders.VLLM
	)

	print(provider_config)
	```