EmbeddedLLM
/

Phi-3-mini-128k-instruct-onnx-directml

Text Generation

Model card Files Files and versions Community

Phi-3-mini-128k-instruct-onnx-directml / README.md

ssyok's picture

Update README.md

f595e99 verified 3 months ago

|

history blame contribute delete

853 Bytes

	---
	pipeline_tag: text-generation
	tags:
	- ONNX
	- DML
	- ONNXRuntime
	- phi3
	- nlp
	- conversational
	- custom_code
	inference: false
	language:
	- en
	---
	# EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml

	## Performance Metrics

	<!-- These are the evaluation metrics being used, ideally with a description of why. -->
	### DirectML
	We measured the performance of DirectML on AMD Ryzen 9 7940HS /w Radeon 78

	\| Prompt Length \| Generation Length \| Average Throughput (tps) \|
	\|---------------------------\|-------------------\|-----------------------------\|
	\| 128 \| 128 \| - \|
	\| 128 \| 256 \| - \|
	\| 128 \| 512 \| - \|
	\| 128 \| 1024 \| - \|
	\| 256 \| 128 \| - \|
	\| 256 \| 256 \| - \|
	\| 256 \| 512 \| - \|
	\| 256 \| 1024 \| - \|
	\| 512 \| 128 \| - \|
	\| 512 \| 256 \| - \|
	\| 512 \| 512 \| - \|
	\| 512 \| 1024 \| - \|
	\| 1024 \| 128 \| - \|
	\| 1024 \| 256 \| - \|
	\| 1024 \| 512 \| - \|
	\| 1024 \| 1024 \| - \|