changes to the report

A7med7x7 · A7med7x7 · commit 19ab6a2b75fa · 2025-06-24T17:22:21.000+03:00
diff --git a/content/authors/alghali/_index.md b/content/authors/alghali/_index.md
@@ -10,7 +10,7 @@ authors:
 superuser: false
 
 # Role/position
-role: "underaduate Computer Science student at The University of Khartoum"
+role: "undergraduate Computer Science student at The University of Khartoum"
 
 # Organizations/Affiliations
 organizations:
@@ -20,7 +20,7 @@ organizations:
 
 
 # Short bio (displayed in user profile at end of posts)
-bio: Ahmed Alghali is an undergraduate in Computer Science at the University of Khartoum with interest in applied machine learning and data platforms.
+bio: Ahmed Alghali is an undergraduate Computer Science student at the University of Khartoum with interest in applied machine learning and data platforms.
 
 
 # Social/Academic Networking
diff --git a/content/report/osre25/ucsc/06212025-alghali/index.md b/content/report/osre25/ucsc/06212025-alghali/index.md
@@ -39,13 +39,14 @@ The same way the famous paper about the [repoducibility crisis in science](https
 
 The lack of software dependency management, proper version control, log tracking, and effective artifacts sharing made it very difficult to reproduce research in machine learning.
 
-Reproducibility in ML is largely driven by well-established MLOps practices.However, in academic settings reproducibility remains a great challenge, the adaptation and standardization of these practices progress slowly, the best way to ensure is to seamleas experience with MLOps, is to make these capabilities are easily accessible to the researchers' workflow. by developing a tool that steamlines the process of provisioning resources, enviornment setup, model training and artifacts tracking, that ensures reproducible results.
+Reproducibility in machine learning is largely supported by MLOps practices which is the case in the industry where the majority of researchers are backed by software engineers who are responsible of setting experimental environments or develop tools that streamline the workflow.However, in academic settings reproducibility remains a great challenge, researchers prefer to focus on coding, and worry a little about the the complexities invloved in configuring their experimental environment,As a result, the adaptation and standardization of MLOps practices in academia progress slowly. The best way to ensure a seamleas experience with MLOps, is to make these capabilities easily accessible to the researchers' workflow. by developing a tool that steamlines the process of provisioning resources, enviornment setup, model training and artifacts tracking, that ensures reproducible results.
+
 
 ### Proposed Solution
 
 ![Solution Architecture](Design.png)
 
-We want researcher to spin up ML research instances/bare metal on Chameleon testbed while keeping the technical complexity involved in configuring and stitching everything together abstracted, the user answers basic questions about the project info, frameworks, tools, features and integrations if there are any and have a full generated project that is reproducible. it contains a provisioning/infrastracture config layer for provisioning resources on the cloud, a dockerfile to spin up services and presistent storage for data,the ML code at its core is backed by ML tracking server system that logs the artifacts, metadata, environment configuration, system specification (GPUs type) and Git status using Mlflow, powered by a postgresSQL for storing metadata and a S3 Minio bucket to store artifacts.
+We want the researchers to spin up ML research instances/bare metal on Chameleon testbed while keeping the technical complexity involved in configuring and stitching everything together abstracted, users simply answer frew questions about their project info, frameworks, tools, features and integrations if there are any, and have a full generated,reproducible project. it contains a provisioning/infrastracture config layer for provisioning resources on the cloud, a dockerfile to spin up services and presistent storage for data,the ML code at its core is a containarized training environment backed by ML tracking server system that logs the artifacts, metadata, environment configuration, system specification (GPUs type) and Git status using Mlflow, powered by a postgresSQL for storing metadata and a S3 Minio bucket to store artifacts.
 persistent storage for the artifacts generated from the experiment and the datasets and containarization of all these to ensure reproducibility.we aim to make the cloud experience easier, by dealing with the configuration needed for setting up the environment having a 3rd party framework, enabling seamless access to benchmarking dataset or any necessary components from services like Hugging face and GitHub as an example will be accessible from the container easily.  for more techincal details about the solution you can read my propsal [here](https://docs.google.com/document/d/1ilm-yMEq-UTiJPGMl8tQc3Anl5cKM5RD2sUGInLjLbU).