Skip to content

Commit 8a98394

Browse files
authored
Fixing some Docs link issue (#627)
Signed-off-by: Abukhoyer Shaik <[email protected]>
1 parent dd377ed commit 8a98394

File tree

10 files changed

+35
-32
lines changed

10 files changed

+35
-32
lines changed

examples/audio/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -82,6 +82,6 @@ This example:
8282

8383
## Documentation
8484

85-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html)
86-
- [Validated Audio Models](https://quic.github.io/efficient-transformers/validate.html#audio-models)
87-
- [Quick Start Guide](https://quic.github.io/efficient-transformers/quick_start.html)
85+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html)
86+
- [Validated Audio Models](https://quic.github.io/efficient-transformers/source/validate.html#audio-models)
87+
- [Quick Start Guide](https://quic.github.io/efficient-transformers/source/quick_start.html)

examples/embeddings/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -66,6 +66,6 @@ The example supports different pooling strategies:
6666

6767
## Documentation
6868

69-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html)
70-
- [Validated Embedding Models](https://quic.github.io/efficient-transformers/validate.html#embedding-models)
71-
- [Quick Start Guide](https://quic.github.io/efficient-transformers/quick_start.html)
69+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html)
70+
- [Validated Embedding Models](https://quic.github.io/efficient-transformers/source/validate.html#embedding-models)
71+
- [Quick Start Guide](https://quic.github.io/efficient-transformers/source/quick_start.html)

examples/image_text_to_text/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -109,4 +109,4 @@ Some models have specialized examples demonstrating advanced features:
109109

110110
## Documentation
111111
- **Full Guide**: [VLM Documentation](../../docs/source/quick_start.md#vision-language-models)
112-
- **API Reference**: [QEFFAutoModelForImageTextToText](../../docs/source/qeff_autoclasses.md)
112+
- **API Reference**: [QEFFAutoModelForImageTextToText](../../docs/source/qeff_autoclasses.md#QEFFAutoModelForImageTextToText)

examples/peft/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,7 @@ qeff_model.unload_adapter("adapter_name")
7777

7878
## Documentation
7979

80-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html)
81-
- [Validated Base Models](https://quic.github.io/efficient-transformers/validate.html#text-only-language-models)
80+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html)
81+
- [Validated Base Models](https://quic.github.io/efficient-transformers/source/validate.html#text-only-language-models)
8282
- [PEFT Documentation](https://huggingface.co/docs/peft)
83-
- [Quick Start Guide](https://quic.github.io/efficient-transformers/quick_start.html)
83+
- [Quick Start Guide](https://quic.github.io/efficient-transformers/source/quick_start.html)

examples/performance/README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,8 @@ python speculative_decoding/draft_based.py \
3737
--target-device-group 0,1 \
3838
--draft-device-group 2
3939
```
40+
errors in this example
41+
4042

4143
#### prompt_lookup.py
4244
Prompt Lookup Decoding (PLD) - N-gram based speculation without a draft model.
@@ -57,6 +59,7 @@ Multi-projection speculative decoding (Turbo models).
5759
python speculative_decoding/multi_projection.py \
5860
--pretrained-model-name-or-path TinyLlama/TinyLlama-1.1B-Chat-v1.0
5961
```
62+
error
6063

6164
### On-Device Sampling
6265

@@ -102,6 +105,6 @@ python on_device_sampling.py \
102105

103106
## Documentation
104107

105-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html)
106-
- [Performance Features](https://quic.github.io/efficient-transformers/features_enablement.html)
107-
- [Quick Start Guide](https://quic.github.io/efficient-transformers/quick_start.html)
108+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html)
109+
- [Performance Features](https://quic.github.io/efficient-transformers/source/features_enablement.html)
110+
- [Quick Start Guide](https://quic.github.io/efficient-transformers/source/quick_start.html)

examples/performance/compute_context_length/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -318,6 +318,6 @@ model = QEFFAutoModelForCausalLM.from_pretrained(
318318

319319
## Documentation
320320

321-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html)
322-
- [Performance Features](https://quic.github.io/efficient-transformers/features_enablement.html)
323-
- [Quick Start Guide](https://quic.github.io/efficient-transformers/quick_start.html)
321+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html)
322+
- [Performance Features](https://quic.github.io/efficient-transformers/source/features_enablement.html)
323+
- [Quick Start Guide](https://quic.github.io/efficient-transformers/source/quick_start.html)

examples/performance/cpp_execution/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ make -j 8
2424
cd ../../../ # Need to be in base folder - efficient-transformers to run below cmd
2525

2626
# Run the python script to get the generated text
27-
python examples/cpp_execution/text_inference_using_cpp.py --model_name gpt2 --batch_size 1 --prompt_len 32 --ctx_len 128 --mxfp6 --num_cores 14 --device_group [0] --prompt "My name is" --mos 1 --aic_enable_depth_first
27+
python examples/performance/cpp_execution/text_inference_cpp.py --model_name gpt2 --batch_size 1 --prompt_len 32 --ctx_len 128 --mxfp6 --num_cores 14 --device_group [0] --prompt "My name is" --mos 1 --aic_enable_depth_first
2828

2929
```
3030

examples/performance/speculative_decoding/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -176,6 +176,6 @@ Avg number of accepted tokens = 2.8 # Speculation effectiveness
176176

177177
## Documentation
178178

179-
- [Speculative Decoding Guide](https://quic.github.io/efficient-transformers/features_enablement.html#speculative-decoding)
180-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html)
181-
- [Performance Optimization](https://quic.github.io/efficient-transformers/features_enablement.html)
179+
- [Speculative Decoding Guide](https://quic.github.io/efficient-transformers/source/features_enablement.html#speculative-decoding)
180+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html)
181+
- [Performance Optimization](https://quic.github.io/efficient-transformers/source/features_enablement.html)

examples/text_generation/README.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -144,7 +144,7 @@ python -m QEfficient.cloud.infer \
144144
2. Compiles to QPC
145145
3. Executes inference with your prompt
146146

147-
**CLI API Reference:** [`QEfficient.cloud.infer`](https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-infer)
147+
**CLI API Reference:** [`QEfficient.cloud.infer`](https://quic.github.io/efficient-transformers/source/cli_api.html#qefficient-cloud-infer)
148148

149149
### Step-by-Step Workflow
150150

@@ -162,7 +162,7 @@ python -m QEfficient.cloud.export \
162162

163163
This downloads the model and converts it to ONNX format. The ONNX model is saved in the QEfficient cache directory.
164164

165-
**CLI API Reference:** [`QEfficient.cloud.export`](https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-export)
165+
**CLI API Reference:** [`QEfficient.cloud.export`](https://quic.github.io/efficient-transformers/source/cli_api.html#qefficient-cloud-export)
166166

167167
#### Step 2: Compile Model to QPC
168168

@@ -184,7 +184,7 @@ python -m QEfficient.cloud.compile \
184184

185185
**Note:** The `compile` API is deprecated for direct use. Use the unified `infer` API instead for most use cases.
186186

187-
**CLI API Reference:** [`QEfficient.cloud.compile`](https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-compile)
187+
**CLI API Reference:** [`QEfficient.cloud.compile`](https://quic.github.io/efficient-transformers/source/cli_api.html#qefficient-cloud-compile)
188188

189189
#### Step 3: Execute Inference
190190

@@ -200,7 +200,7 @@ python -m QEfficient.cloud.execute \
200200

201201
This uses the pre-compiled QPC for fast inference. You can run this multiple times with different prompts without recompiling.
202202

203-
**CLI API Reference:** [`QEfficient.cloud.execute`](https://quic.github.io/efficient-transformers/cli_api.html#qefficient-cloud-execute)
203+
**CLI API Reference:** [`QEfficient.cloud.execute`](https://quic.github.io/efficient-transformers/source/cli_api.html#qefficient-cloud-execute)
204204

205205
### Common CLI Parameters
206206

@@ -239,7 +239,7 @@ python -m QEfficient.cloud.infer \
239239
--aic_enable_depth_first
240240
```
241241

242-
**Documentation:** [Multi-Qranium Inference](https://quic.github.io/efficient-transformers/features_enablement.html#multi-qranium-inference)
242+
**Documentation:** [Multi-Qranium Inference](https://quic.github.io/efficient-transformers/source/features_enablement.html#multi-qranium-inference)
243243

244244
#### Continuous Batching
245245

@@ -260,7 +260,7 @@ python -m QEfficient.cloud.infer \
260260

261261
**Note:** Use pipe (`|`) to separate multiple prompts. When using continuous batching, do not specify `--batch_size`.
262262

263-
**Documentation:** [Continuous Batching](https://quic.github.io/efficient-transformers/features_enablement.html#continuous-batching)
263+
**Documentation:** [Continuous Batching](https://quic.github.io/efficient-transformers/source/features_enablement.html#continuous-batching)
264264

265265
#### Batch Processing from File
266266

@@ -284,7 +284,7 @@ python -m QEfficient.cloud.infer \
284284
For a comprehensive collection of copy-paste ready CLI commands, run:
285285

286286
```bash
287-
bash examples/text_generation/cli_examples.sh
287+
bash cli_examples.sh
288288
```
289289

290290
This script demonstrates:
@@ -300,11 +300,11 @@ This script demonstrates:
300300
## Additional Resources
301301

302302
### Documentation
303-
- [CLI API Reference](https://quic.github.io/efficient-transformers/cli_api.html) - Complete CLI command documentation
304-
- [Quick Start Guide](https://quic.github.io/efficient-transformers/quick_start.html) - Getting started with QEfficient
305-
- [Features Enablement](https://quic.github.io/efficient-transformers/features_enablement.html) - Advanced features guide
306-
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/qeff_autoclasses.html) - Python API reference
307-
- [Validated Models](https://quic.github.io/efficient-transformers/validate.html#text-only-language-models) - Supported models list
303+
- [CLI API Reference](https://quic.github.io/efficient-transformers/source/cli_api.html) - Complete CLI command documentation
304+
- [Quick Start Guide](https://quic.github.io/efficient-transformers/source/quick_start.html) - Getting started with QEfficient
305+
- [Features Enablement](https://quic.github.io/efficient-transformers/source/features_enablement.html) - Advanced features guide
306+
- [QEff Auto Classes](https://quic.github.io/efficient-transformers/source/qeff_autoclasses.html) - Python API reference
307+
- [Validated Models](https://quic.github.io/efficient-transformers/source/validate.html) - Supported models list
308308

309309

310310
### Model Storage

0 commit comments

Comments
 (0)