Skip to content

Commit fd8f623

Browse files
Merge pull request #325 from microsoft/psl-strengthen-deployment
docs: Updated the deployment guide & updated stale images
2 parents a69ffdb + 5465e80 commit fd8f623

File tree

7 files changed

+489
-59
lines changed

7 files changed

+489
-59
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -87,6 +87,8 @@ Follow the quick deploy steps on the deployment guide to deploy this solution
8787
> ⚠️ **Important: Check Azure OpenAI Quota Availability**
8888
<br/>To ensure sufficient quota is available in your subscription, please follow [quota check instructions guide](./docs/quota_check.md) before you deploy the solution.
8989
90+
> 🛠️ **Need Help?** Check our [Troubleshooting Guide](./docs/TroubleShootingSteps.md) for solutions to common deployment issues.
91+
9092
<br/>
9193

9294
### Prerequisites and costs
@@ -121,6 +123,8 @@ Use the [Azure pricing calculator](https://azure.microsoft.com/en-us/pricing/cal
121123
>⚠️ **Important:** To avoid unnecessary costs, remember to take down your app if it's no longer in use,
122124
either by deleting the resource group in the Portal or running `azd down`.
123125

126+
For detailed cost estimation and pricing information, see the [Deployment Guide](./docs/DeploymentGuide.md).
127+
124128
<br /><br />
125129
<h2><img src="./docs/images/readme/business-scenario.png" width="48" />
126130
Business scenario

docs/DeploymentGuide.md

Lines changed: 280 additions & 59 deletions
Large diffs are not rendered by default.

docs/GoldenPathWorkflows.md

Lines changed: 205 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,205 @@
1+
# Golden Path Workflows Guide
2+
3+
This guide provides detailed step-by-step workflows for getting the most out of the Content Processing Solution Accelerator. These "golden path" workflows represent the most common and effective use cases for the solution.
4+
5+
## Overview
6+
7+
The golden path workflows are designed to:
8+
- Demonstrate the full capabilities of the solution
9+
- Provide a structured learning experience
10+
- Showcase best practices for document processing
11+
- Help users understand the confidence scoring and validation features
12+
13+
## Workflow 1: Invoice Processing Golden Path
14+
15+
### 📋 Prerequisites
16+
- Solution deployed and validated successfully
17+
- Sample schemas registered (Invoice schema)
18+
- Authentication configured
19+
20+
### 🚀 Step-by-Step Process
21+
22+
1. **Access the Web Interface**
23+
- Navigate to your deployed web app URL
24+
- Log in using your configured authentication
25+
26+
2. **Select Invoice Schema**
27+
- In the Processing Queue pane, select "Invoice" from the schema dropdown
28+
- Verify the schema shows as available
29+
30+
3. **Upload Sample Invoice**
31+
- Click "Import Content" button
32+
- Select an invoice file from the sample data (PDF, PNG, or JPEG)
33+
- Click "Upload" to submit
34+
35+
4. **Monitor Processing**
36+
- Watch the file status change from "Uploaded" → "Processing" → "Completed"
37+
- This typically takes 1-2 minutes
38+
39+
5. **Review Extracted Data**
40+
- Click on the completed file to open the review interface
41+
- Examine the extracted data in the "Extracted Results" tab
42+
- Compare with the source document in the "Source Document" pane
43+
44+
6. **Validate and Modify Results**
45+
- Edit any incorrect data in the JSON output
46+
- Add notes in the "Comments" section
47+
- Pay attention to confidence scores for each field
48+
49+
7. **Save and Approve**
50+
- Click "Save" to store your modifications
51+
- Review the process steps in the "Process Steps" tab
52+
53+
### 🎯 Expected Outcomes
54+
- ✅ Invoice data accurately extracted (vendor, amounts, dates, line items)
55+
- ✅ Confidence scores above 80% for most fields
56+
- ✅ Any low-confidence fields flagged for manual review
57+
- ✅ Process steps show successful extraction, mapping, and evaluation
58+
59+
## Workflow 2: Property Claims Golden Path
60+
61+
### 📋 Prerequisites
62+
- Invoice workflow completed successfully
63+
- Property Loss Damage Claim Form schema registered
64+
65+
### 🚀 Step-by-Step Process
66+
67+
1. **Switch to Property Claims Schema**
68+
- Select "Property Loss Damage Claim Form" from the schema dropdown
69+
70+
2. **Upload Property Damage Document**
71+
- Import a property claim form from the sample data
72+
- Monitor the processing workflow
73+
74+
3. **Validate Complex Extraction**
75+
- Review extracted claim details, damages, and policy information
76+
- Note how the system handles form fields vs. free text
77+
78+
4. **Test Validation Features**
79+
- Modify extracted data to test validation rules
80+
- Add detailed comments about damage assessments
81+
82+
5. **Process Multiple Documents**
83+
- Upload additional property claim documents
84+
- Compare extraction accuracy across different document formats
85+
86+
### 🎯 Expected Outcomes
87+
- ✅ Complex form data accurately extracted
88+
- ✅ Multi-modal content (text, images, tables) processed correctly
89+
- ✅ Validation rules applied appropriately
90+
91+
## Workflow 3: Custom Document Processing Golden Path
92+
93+
### 📋 Prerequisites
94+
- Basic workflows completed
95+
- Understanding of your specific document types
96+
97+
### 🚀 Step-by-Step Process
98+
99+
1. **Create Custom Schema**
100+
- Follow the [Custom Schema Guide](./CustomizeSchemaData.md)
101+
- Define your document structure and required fields
102+
103+
2. **Register Your Schema**
104+
- Use the schema registration scripts
105+
- Validate schema is available in the web interface
106+
107+
3. **Test with Sample Documents**
108+
- Start with 2-3 representative documents
109+
- Process and review initial results
110+
111+
4. **Refine Extraction Quality**
112+
- Analyze confidence scores and accuracy
113+
- Modify schema definitions if needed
114+
- Re-test with updated schema
115+
116+
5. **Scale to Production**
117+
- Process larger document batches
118+
- Establish quality thresholds
119+
- Set up automated workflows using the API
120+
121+
### 🎯 Expected Outcomes
122+
- ✅ Custom schema accurately processes your document types
123+
- ✅ Confidence scoring helps identify manual review needs
124+
- ✅ Workflow scales to handle production volumes
125+
126+
## Advanced Workflows
127+
128+
### Multi-Schema Processing
129+
- Process different document types in the same session
130+
- Compare extraction approaches across schemas
131+
- Understand when to use different processing strategies
132+
133+
### API Integration Golden Path
134+
- Use programmatic APIs for document submission
135+
- Implement webhook callbacks for processing notifications
136+
- Build custom validation and approval workflows
137+
138+
### Batch Processing Workflow
139+
- Upload multiple documents simultaneously
140+
- Monitor batch processing status
141+
- Export results for downstream systems
142+
143+
## Best Practices
144+
145+
### Quality Assurance
146+
- Always review low-confidence extractions manually
147+
- Use comments to document validation decisions
148+
- Track accuracy improvements over time
149+
150+
### Confidence Score Interpretation
151+
- **90-100%**: High confidence, likely accurate
152+
- **70-89%**: Medium confidence, review recommended
153+
- **Below 70%**: Low confidence, manual review required
154+
155+
### Performance Optimization
156+
- Use consistent document formats when possible
157+
- Ensure good image quality for scanned documents
158+
- Batch similar document types for better consistency
159+
160+
## Troubleshooting Common Issues
161+
162+
### Low Extraction Accuracy
163+
- Check document quality and formatting
164+
- Verify schema matches document structure
165+
- Review and update system prompts if needed
166+
167+
### Processing Timeouts
168+
- Reduce document file sizes
169+
- Check Azure quota availability
170+
- Monitor system logs for errors
171+
172+
### Authentication Issues
173+
- Verify app registration configuration
174+
- Check user permissions and role assignments
175+
- Review authentication provider settings
176+
177+
## Next Steps
178+
179+
After completing these golden path workflows:
180+
181+
1. **Explore Advanced Features**
182+
- Custom validation rules
183+
- Webhook integrations
184+
- Batch processing APIs
185+
186+
2. **Integrate with Your Systems**
187+
- Connect to downstream databases
188+
- Set up automated workflows
189+
- Implement custom business logic
190+
191+
3. **Scale Your Solution**
192+
- Monitor performance metrics
193+
- Optimize for your specific use cases
194+
- Plan for production deployment
195+
196+
## Support and Resources
197+
198+
- **Technical Documentation**: [API Guide](./API.md)
199+
- **Troubleshooting**: [Common Issues](./TroubleShootingSteps.md)
200+
- **Sample Data**: [Download samples](../src/ContentProcessorAPI/samples)
201+
- **Community**: [Submit issues](https://github.com/microsoft/content-processing-solution-accelerator/issues)
202+
203+
---
204+
205+
*This guide is based on the automated test suite golden path workflows that validate the core functionality of the solution.*
-45.5 KB
Loading
-48.8 KB
Loading
-140 KB
Loading
-84.2 KB
Loading

0 commit comments

Comments
 (0)