|
| 1 | +# Golden Path Workflows Guide |
| 2 | + |
| 3 | +This guide provides detailed step-by-step workflows for getting the most out of the Content Processing Solution Accelerator. These "golden path" workflows represent the most common and effective use cases for the solution. |
| 4 | + |
| 5 | +## Overview |
| 6 | + |
| 7 | +The golden path workflows are designed to: |
| 8 | +- Demonstrate the full capabilities of the solution |
| 9 | +- Provide a structured learning experience |
| 10 | +- Showcase best practices for document processing |
| 11 | +- Help users understand the confidence scoring and validation features |
| 12 | + |
| 13 | +## Workflow 1: Invoice Processing Golden Path |
| 14 | + |
| 15 | +### 📋 Prerequisites |
| 16 | +- Solution deployed and validated successfully |
| 17 | +- Sample schemas registered (Invoice schema) |
| 18 | +- Authentication configured |
| 19 | + |
| 20 | +### 🚀 Step-by-Step Process |
| 21 | + |
| 22 | +1. **Access the Web Interface** |
| 23 | + - Navigate to your deployed web app URL |
| 24 | + - Log in using your configured authentication |
| 25 | + |
| 26 | +2. **Select Invoice Schema** |
| 27 | + - In the Processing Queue pane, select "Invoice" from the schema dropdown |
| 28 | + - Verify the schema shows as available |
| 29 | + |
| 30 | +3. **Upload Sample Invoice** |
| 31 | + - Click "Import Content" button |
| 32 | + - Select an invoice file from the sample data (PDF, PNG, or JPEG) |
| 33 | + - Click "Upload" to submit |
| 34 | + |
| 35 | +4. **Monitor Processing** |
| 36 | + - Watch the file status change from "Uploaded" → "Processing" → "Completed" |
| 37 | + - This typically takes 1-2 minutes |
| 38 | + |
| 39 | +5. **Review Extracted Data** |
| 40 | + - Click on the completed file to open the review interface |
| 41 | + - Examine the extracted data in the "Extracted Results" tab |
| 42 | + - Compare with the source document in the "Source Document" pane |
| 43 | + |
| 44 | +6. **Validate and Modify Results** |
| 45 | + - Edit any incorrect data in the JSON output |
| 46 | + - Add notes in the "Comments" section |
| 47 | + - Pay attention to confidence scores for each field |
| 48 | + |
| 49 | +7. **Save and Approve** |
| 50 | + - Click "Save" to store your modifications |
| 51 | + - Review the process steps in the "Process Steps" tab |
| 52 | + |
| 53 | +### 🎯 Expected Outcomes |
| 54 | +- ✅ Invoice data accurately extracted (vendor, amounts, dates, line items) |
| 55 | +- ✅ Confidence scores above 80% for most fields |
| 56 | +- ✅ Any low-confidence fields flagged for manual review |
| 57 | +- ✅ Process steps show successful extraction, mapping, and evaluation |
| 58 | + |
| 59 | +## Workflow 2: Property Claims Golden Path |
| 60 | + |
| 61 | +### 📋 Prerequisites |
| 62 | +- Invoice workflow completed successfully |
| 63 | +- Property Loss Damage Claim Form schema registered |
| 64 | + |
| 65 | +### 🚀 Step-by-Step Process |
| 66 | + |
| 67 | +1. **Switch to Property Claims Schema** |
| 68 | + - Select "Property Loss Damage Claim Form" from the schema dropdown |
| 69 | + |
| 70 | +2. **Upload Property Damage Document** |
| 71 | + - Import a property claim form from the sample data |
| 72 | + - Monitor the processing workflow |
| 73 | + |
| 74 | +3. **Validate Complex Extraction** |
| 75 | + - Review extracted claim details, damages, and policy information |
| 76 | + - Note how the system handles form fields vs. free text |
| 77 | + |
| 78 | +4. **Test Validation Features** |
| 79 | + - Modify extracted data to test validation rules |
| 80 | + - Add detailed comments about damage assessments |
| 81 | + |
| 82 | +5. **Process Multiple Documents** |
| 83 | + - Upload additional property claim documents |
| 84 | + - Compare extraction accuracy across different document formats |
| 85 | + |
| 86 | +### 🎯 Expected Outcomes |
| 87 | +- ✅ Complex form data accurately extracted |
| 88 | +- ✅ Multi-modal content (text, images, tables) processed correctly |
| 89 | +- ✅ Validation rules applied appropriately |
| 90 | + |
| 91 | +## Workflow 3: Custom Document Processing Golden Path |
| 92 | + |
| 93 | +### 📋 Prerequisites |
| 94 | +- Basic workflows completed |
| 95 | +- Understanding of your specific document types |
| 96 | + |
| 97 | +### 🚀 Step-by-Step Process |
| 98 | + |
| 99 | +1. **Create Custom Schema** |
| 100 | + - Follow the [Custom Schema Guide](./CustomizeSchemaData.md) |
| 101 | + - Define your document structure and required fields |
| 102 | + |
| 103 | +2. **Register Your Schema** |
| 104 | + - Use the schema registration scripts |
| 105 | + - Validate schema is available in the web interface |
| 106 | + |
| 107 | +3. **Test with Sample Documents** |
| 108 | + - Start with 2-3 representative documents |
| 109 | + - Process and review initial results |
| 110 | + |
| 111 | +4. **Refine Extraction Quality** |
| 112 | + - Analyze confidence scores and accuracy |
| 113 | + - Modify schema definitions if needed |
| 114 | + - Re-test with updated schema |
| 115 | + |
| 116 | +5. **Scale to Production** |
| 117 | + - Process larger document batches |
| 118 | + - Establish quality thresholds |
| 119 | + - Set up automated workflows using the API |
| 120 | + |
| 121 | +### 🎯 Expected Outcomes |
| 122 | +- ✅ Custom schema accurately processes your document types |
| 123 | +- ✅ Confidence scoring helps identify manual review needs |
| 124 | +- ✅ Workflow scales to handle production volumes |
| 125 | + |
| 126 | +## Advanced Workflows |
| 127 | + |
| 128 | +### Multi-Schema Processing |
| 129 | +- Process different document types in the same session |
| 130 | +- Compare extraction approaches across schemas |
| 131 | +- Understand when to use different processing strategies |
| 132 | + |
| 133 | +### API Integration Golden Path |
| 134 | +- Use programmatic APIs for document submission |
| 135 | +- Implement webhook callbacks for processing notifications |
| 136 | +- Build custom validation and approval workflows |
| 137 | + |
| 138 | +### Batch Processing Workflow |
| 139 | +- Upload multiple documents simultaneously |
| 140 | +- Monitor batch processing status |
| 141 | +- Export results for downstream systems |
| 142 | + |
| 143 | +## Best Practices |
| 144 | + |
| 145 | +### Quality Assurance |
| 146 | +- Always review low-confidence extractions manually |
| 147 | +- Use comments to document validation decisions |
| 148 | +- Track accuracy improvements over time |
| 149 | + |
| 150 | +### Confidence Score Interpretation |
| 151 | +- **90-100%**: High confidence, likely accurate |
| 152 | +- **70-89%**: Medium confidence, review recommended |
| 153 | +- **Below 70%**: Low confidence, manual review required |
| 154 | + |
| 155 | +### Performance Optimization |
| 156 | +- Use consistent document formats when possible |
| 157 | +- Ensure good image quality for scanned documents |
| 158 | +- Batch similar document types for better consistency |
| 159 | + |
| 160 | +## Troubleshooting Common Issues |
| 161 | + |
| 162 | +### Low Extraction Accuracy |
| 163 | +- Check document quality and formatting |
| 164 | +- Verify schema matches document structure |
| 165 | +- Review and update system prompts if needed |
| 166 | + |
| 167 | +### Processing Timeouts |
| 168 | +- Reduce document file sizes |
| 169 | +- Check Azure quota availability |
| 170 | +- Monitor system logs for errors |
| 171 | + |
| 172 | +### Authentication Issues |
| 173 | +- Verify app registration configuration |
| 174 | +- Check user permissions and role assignments |
| 175 | +- Review authentication provider settings |
| 176 | + |
| 177 | +## Next Steps |
| 178 | + |
| 179 | +After completing these golden path workflows: |
| 180 | + |
| 181 | +1. **Explore Advanced Features** |
| 182 | + - Custom validation rules |
| 183 | + - Webhook integrations |
| 184 | + - Batch processing APIs |
| 185 | + |
| 186 | +2. **Integrate with Your Systems** |
| 187 | + - Connect to downstream databases |
| 188 | + - Set up automated workflows |
| 189 | + - Implement custom business logic |
| 190 | + |
| 191 | +3. **Scale Your Solution** |
| 192 | + - Monitor performance metrics |
| 193 | + - Optimize for your specific use cases |
| 194 | + - Plan for production deployment |
| 195 | + |
| 196 | +## Support and Resources |
| 197 | + |
| 198 | +- **Technical Documentation**: [API Guide](./API.md) |
| 199 | +- **Troubleshooting**: [Common Issues](./TroubleShootingSteps.md) |
| 200 | +- **Sample Data**: [Download samples](../src/ContentProcessorAPI/samples) |
| 201 | +- **Community**: [Submit issues](https://github.com/microsoft/content-processing-solution-accelerator/issues) |
| 202 | + |
| 203 | +--- |
| 204 | + |
| 205 | +*This guide is based on the automated test suite golden path workflows that validate the core functionality of the solution.* |
0 commit comments