Skip to content

Python code to Generate Report on Validation and Credibility of Datasets #108

@Gladwin001

Description

@Gladwin001

Description about Issue

As users downloads dataset for their project, We try to give more understanding and clear overview about the datasets they are downloading in a Report format thus the user be feed with idea about how to use dataset for their own project in effective way.

Expected Behavior

we expect,

  1. More Statistical Analysis about Datasets
  2. How it's values are present and their Distributions over plot
  3. Check for corruption and Mismatch of data
  4. Suggestions to which kind of project the dataset will suit
  5. Suggestions on preprocessing of datasets for effective usage in project.

Expect to generate report with respect to it's format like CSV,JSON,txt etc...

Current Behavior

In Validation folder in Main.py we implement some of the previously mentioned, you can also view Report.txt for sample report we generated.

Contributions

You can Implement features one by one and then make a pull request to us.
Expect your Valuable Contributions and collaborations

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions