Skip to content

AI-Accelerated Data Lineage & Certification #148

@akkhil2012

Description

@akkhil2012

Objective

AI-Accelerated Data Lineage & Certification

Visualize data trust in real time.
Generate interactive, GPU-accelerated lineage graphs that dynamically reveal data flow, dependencies, and certification status across systems.

Embed intelligent ML agents.
Leverage Isolation Forests for anomaly detection and MPNet-based embeddings for semantic similarity to automatically uncover duplicates, drifts, and lineage gaps.

Automate certification with explainable AI.
Each dataset is assigned a machine-evaluated trust score, combining data quality, lineage consistency, and drift metrics — ensuring transparency and regulatory readiness.

Empower consumers and auditors.
Provide an intuitive visual interface where users can trace data origin, verify accuracy, and audit certification evidence with full explainability.

One can submit a talk on anything related to Python or Open Source and below are the list of categories -

Governance & Maintainability

Title of the talk/workshop
Design and Implement AI-Driven Data Lineage for Data Certification

Abstract of the talk/workshop
Data lineage for the data certification

Category of the talk/workshop
Cata Governanace

Duration (including Q&A)
20 mins

Level of Audience
Intermediate

Speaker Bio
Please do include the following things

Speaker Bio (Brief): Working as VP(Technical) at JPMC
Company/College: NIT Srinagar
Email: [email protected]
Years of Exp : 16 Years

Vice President (Technology) at JP Morgan Chase and Co
https://www.linkedin.com/in/akhil-kumar-gupta-314481102/
https://deducethelogic.com/

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions