@@ -144,7 +144,7 @@ python -m QEfficient.cloud.infer \
1441442 . Compiles to QPC
1451453 . Executes inference with your prompt
146146
147- ** CLI API Reference:** [ ` QEfficient.cloud.infer ` ] ( https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-infer )
147+ ** CLI API Reference:** [ ` QEfficient.cloud.infer ` ] ( https://quic.github.io/efficient-transformers/source/ cli_api.html#qefficient-cloud-infer )
148148
149149### Step-by-Step Workflow
150150
@@ -162,7 +162,7 @@ python -m QEfficient.cloud.export \
162162
163163This downloads the model and converts it to ONNX format. The ONNX model is saved in the QEfficient cache directory.
164164
165- ** CLI API Reference:** [ ` QEfficient.cloud.export ` ] ( https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-export )
165+ ** CLI API Reference:** [ ` QEfficient.cloud.export ` ] ( https://quic.github.io/efficient-transformers/source/ cli_api.html#qefficient-cloud-export )
166166
167167#### Step 2: Compile Model to QPC
168168
@@ -184,7 +184,7 @@ python -m QEfficient.cloud.compile \
184184
185185** Note:** The ` compile ` API is deprecated for direct use. Use the unified ` infer ` API instead for most use cases.
186186
187- ** CLI API Reference:** [ ` QEfficient.cloud.compile ` ] ( https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-compile )
187+ ** CLI API Reference:** [ ` QEfficient.cloud.compile ` ] ( https://quic.github.io/efficient-transformers/source/ cli_api.html#qefficient-cloud-compile )
188188
189189#### Step 3: Execute Inference
190190
@@ -200,7 +200,7 @@ python -m QEfficient.cloud.execute \
200200
201201This uses the pre-compiled QPC for fast inference. You can run this multiple times with different prompts without recompiling.
202202
203- ** CLI API Reference:** [ ` QEfficient.cloud.execute ` ] ( https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-execute )
203+ ** CLI API Reference:** [ ` QEfficient.cloud.execute ` ] ( https://quic.github.io/efficient-transformers/source/ cli_api.html#qefficient-cloud-execute )
204204
205205### Common CLI Parameters
206206
@@ -239,7 +239,7 @@ python -m QEfficient.cloud.infer \
239239 --aic_enable_depth_first
240240```
241241
242- ** Documentation:** [ Multi-Qranium Inference] ( https://quic.github.io/efficient-transformers/features_enablement.html#multi-qranium-inference )
242+ ** Documentation:** [ Multi-Qranium Inference] ( https://quic.github.io/efficient-transformers/source/ features_enablement.html#multi-qranium-inference )
243243
244244#### Continuous Batching
245245
@@ -260,7 +260,7 @@ python -m QEfficient.cloud.infer \
260260
261261** Note:** Use pipe (` | ` ) to separate multiple prompts. When using continuous batching, do not specify ` --batch_size ` .
262262
263- ** Documentation:** [ Continuous Batching] ( https://quic.github.io/efficient-transformers/features_enablement.html#continuous-batching )
263+ ** Documentation:** [ Continuous Batching] ( https://quic.github.io/efficient-transformers/source/ features_enablement.html#continuous-batching )
264264
265265#### Batch Processing from File
266266
@@ -284,7 +284,7 @@ python -m QEfficient.cloud.infer \
284284For a comprehensive collection of copy-paste ready CLI commands, run:
285285
286286``` bash
287- bash examples/text_generation/ cli_examples.sh
287+ bash cli_examples.sh
288288```
289289
290290This script demonstrates:
@@ -300,11 +300,11 @@ This script demonstrates:
300300## Additional Resources
301301
302302### Documentation
303- - [ CLI API Reference] ( https://quic.github.io/efficient-transformers/cli_api.html ) - Complete CLI command documentation
304- - [ Quick Start Guide] ( https://quic.github.io/efficient-transformers/quick_start.html ) - Getting started with QEfficient
305- - [ Features Enablement] ( https://quic.github.io/efficient-transformers/features_enablement.html ) - Advanced features guide
306- - [ QEff Auto Classes] ( https://quic.github.io/efficient-transformers/qeff_autoclasses.html ) - Python API reference
307- - [ Validated Models] ( https://quic.github.io/efficient-transformers/validate.html#text-only-language-models ) - Supported models list
303+ - [ CLI API Reference] ( https://quic.github.io/efficient-transformers/source/ cli_api.html ) - Complete CLI command documentation
304+ - [ Quick Start Guide] ( https://quic.github.io/efficient-transformers/source/ quick_start.html ) - Getting started with QEfficient
305+ - [ Features Enablement] ( https://quic.github.io/efficient-transformers/source/ features_enablement.html ) - Advanced features guide
306+ - [ QEff Auto Classes] ( https://quic.github.io/efficient-transformers/source/ qeff_autoclasses.html ) - Python API reference
307+ - [ Validated Models] ( https://quic.github.io/efficient-transformers/source/ validate.html ) - Supported models list
308308
309309
310310### Model Storage
0 commit comments