Skip to content

Commit f5e3f8b

Browse files
authored
Merge branch 'main' into sophietao127-patch-1
2 parents 91637c3 + 157daf8 commit f5e3f8b

File tree

16 files changed

+310
-1
lines changed

16 files changed

+310
-1
lines changed

content/authors/alghali/_index.md

Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,50 @@
1+
---
2+
# Display name
3+
title: Ahmed Alghali
4+
5+
# Username (this should match the folder name)
6+
authors:
7+
- alghali
8+
9+
# Is this the primary user of the site?
10+
superuser: false
11+
12+
# Role/position
13+
role: "undergraduate Computer Science student at The University of Khartoum"
14+
15+
# Organizations/Affiliations
16+
organizations:
17+
- name: University of Khartoum
18+
url: "https://www.uofk.edu/"
19+
20+
21+
22+
# Short bio (displayed in user profile at end of posts)
23+
bio: Ahmed Alghali is an undergraduate Computer Science student at the University of Khartoum with interest in applied machine learning and data platforms.
24+
25+
26+
# Social/Academic Networking
27+
# For available icons, see: https://sourcethemes.com/academic/docs/widgets/#icons
28+
# For an email link, use "fas" icon pack, "envelope" icon, and a link in the
29+
# form "mailto:[email protected]" or "#contact" for contact widget.
30+
social:
31+
- icon: envelope
32+
icon_pack: fas
33+
34+
- icon: github
35+
icon_pack: fab
36+
link: https://github.com/a7med7x7
37+
- icon: linkedin
38+
icon_pack: fab
39+
link: https://www.linkedin.com/in/ahmed-alghali-4997a5229/
40+
41+
42+
# Enter email to display Gravatar (if Gravatar enabled in Config)
43+
44+
45+
# Organizational groups that you belong to (for People widget)
46+
# Set this to `[]` or comment out if you are not using People widget.
47+
user_groups:
48+
- 2025 Contributors
49+
---
50+
Ahmed Alghali is doing his undergrad degree as a Computer Science student at the University of Khartoum during his undergrad he was involved in machine learning projects . He gained hands-on experience working as an ML Engineer at [AirQo](https://www.airqo.net/home), where he contributed to the research Modeling of fused ground measurements and satellite remote-sensing air quality data and the deployment of the machine learning models developed into the AirQo Platform His work blends strong interests in applied machine intelligence. He is currently part of the project [Applying MLOps to overcome reproducibility barriers](https://ucsc-ospo.github.io/project/osre25/nyu/mlops/) at OSPO, with a growing focus on data platforms, reproducible systems and competiting in data science competitions.

content/authors/alghali/avatar.jpg

746 KB
Loading
Lines changed: 44 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,44 @@
1+
---
2+
# Display name
3+
title: Haocheng Xia
4+
5+
# Username (this should match the folder name)
6+
authors:
7+
- haochengxia
8+
9+
# Is this the primary user of the site?
10+
superuser: false
11+
12+
# Role/position
13+
role: Visiting Student
14+
15+
# Organizations/Affiliations
16+
organizations:
17+
- name: Harvard University
18+
url: "https://www.harvard.edu/"
19+
20+
# Short bio (displayed in user profile at end of posts)
21+
bio: >
22+
Haocheng Xia is a Computer Science student at the University of Illinois Urbana-Champaign and currently visiting Harvard University, focusing on cache benchmarking.
23+
24+
# Social/Academic Networking
25+
social:
26+
- icon: envelope
27+
icon_pack: fas
28+
link: mailto:[email protected]
29+
- icon: github
30+
icon_pack: fab
31+
link: https://github.com/haochengxia
32+
- icon: linkedin
33+
icon_pack: fab
34+
link: https://www.linkedin.com/in/haocheng-xia-04b5a0210/
35+
36+
# Enter email to display Gravatar (if Gravatar enabled in Config)
37+
38+
39+
# Organizational groups that you belong to (for People widget)
40+
user_groups:
41+
- 2025 Contributors
42+
---
43+
44+
Haocheng Xia is a first-year Ph.D. student in the Data and Information Systems Laboratory (DAIS) at the University of Illinois Urbana-Champaign (UIUC), with a keen interest in machine learning and storage systems. As a Summer of Reproducibility (SoR) 2025 contributor, Haocheng is enhancing the CacheBench project. His work focuses on building a comprehensive benchmark suite for cache eviction algorithms, providing researchers with an effortless way to compare their designs against existing solutions, analyze performance across diverse microbenchmarks and workloads, and participate in an open cache competition leaderboard. He has previously contributed to libCacheSim, a high-performance simulation tool critical to CacheBench, and is committed to maintaining the benchmark beyond the SoR program.
55.9 KB
Loading

content/authors/nbrewer/_index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,6 @@ social:
3838
# Set this to `[]` or comment out if you are not using People widget.
3939
user_groups:
4040
- 2024 Contributors
41-
41+
- 2025 Contributors
4242
---
4343
Nicole Brewer is a graduate research assistant and PhD student at Arizona State University, where she is studying computational reproducibility from both empirical and philosophical perspectives. Specifically, she is studying the reproducibility of Jupyter Notebooks used in scientific contexts. Before grad school, she was a research software engineer at Purdue University, where she developed Jupyter-based web applications to help domain scientists make their work accessible and reproducible. As a former [Better Scientific Software Fellow](https://bssw.io/fellows/nicole-brewer), Nicole had the opportunity to create templates and tutorials promote those uses of Juptyer Notebooks. Her fellowship resulted in the [Jupyter4Science](https://jupyter4.science/) blog and knowledge base for all things scientific Jupyter Notebooks. She formerly serveed on the steering committee of the [United States Research Software Engineering Association](https://us-rse.org/), and has been a member of the [notebook sub-committee](https://us-rse.org/usrse24/participate/#notebooks) for the US-RSE'23 and US-RSE'24 conferences. She holds a BS in Mathematics with Computer Science from Purdue University.

content/authors/wbq321/_index.md

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
---
2+
# Display name
3+
title: Baiqiang Wang
4+
5+
# Username (this should match the folder name)
6+
authors:
7+
- wbq321
8+
9+
# Is this the primary user of the site?
10+
superuser: false
11+
12+
# Role/position
13+
role: "Ph.D. Student in CSS, University of Washington"
14+
15+
# Organizations/Affiliations
16+
organizations:
17+
- name: University of Washington
18+
url: "https://www.washington.edu/"
19+
20+
# Short bio (displayed in user profile at end of posts)
21+
bio: Baiqiang Wang is a first-year Ph.D. student at University of Washington. He works on vector database and cryptography.
22+
23+
# Social/Academic Networking
24+
social:
25+
- icon: envelope
26+
icon_pack: fas
27+
link: mailto:[email protected]
28+
- icon: github
29+
icon_pack: fab
30+
link: https://github.com/wbq321
31+
- icon: linkedin
32+
icon_pack: fab
33+
link: https://www.linkedin.com/in/baiqiang-wang-205b35224/
34+
35+
# Enter email to display Gravatar (if Gravatar enabled in Config)
36+
37+
38+
# Organizational groups that you belong to (for People widget)
39+
user_groups:
40+
- 2025 Contributors
41+
---
42+
43+
Baiqiang Wang is a first-year Ph.D. student at University of Washington. His research interests are vector database and cryptography. During OSRE 2025, he is working on enhancing reproducibility in RAG frameworks for scientific workflows.
630 KB
Loading
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
---
2+
title: "Building a Benchmarking Suite for Cache Performance Evaluation"
3+
subtitle: "Introducing my SoR 2025 project with UC OSPO"
4+
summary: "Develop a comprehensive benchmarking suite, CacheBench, for evaluating the performance of cache systems in modern computing environments."
5+
authors:
6+
- haochengxia
7+
tags: ["osre25","reproducibility","storage systems"]
8+
categories: ["SummerofReproducibility24"]
9+
date: 2025-06-21
10+
lastmod: 2025-06-21
11+
featured: true
12+
draft: false
13+
14+
image:
15+
caption: "Haocheng Xia, SoR 2025"
16+
focal_point: "Center"
17+
preview_only: false
18+
---
19+
20+
Hi! I'm Haocheng Xia, a Computer Science student at the **University of Illinois Urbana-Champaign**, passionate about the intersection of **machine learning and storage systems**. Specifically, I'm keen on **workload analysis** and **KV cache management for large language models**.
21+
22+
This summer, I'm happy to be a part of **SoR 2025** and **OSRE 2025**. I'm contributing to the **CacheBench** project. My initiative, **'Building a Benchmarking Suite for Cache Performance Evaluation,'** will create a robust platform. This involves extensive simulation of existing eviction algorithms using [libCacheSim](https://github.com/cacheMon/libCacheSim), developing microbenchmarks, and building a user-friendly platform for researchers to effortlessly evaluate novel cache designs. The ultimate goal is to establish a competitive leaderboard.
23+
24+
My contributions will include a comprehensive dataset detailing simulated **miss ratios** and **throughput** of current cache eviction algorithms, an extension to [libCacheSim](https://github.com/cacheMon/libCacheSim) for executing microbenchmarks both locally and on our online platform, and the creation and ongoing maintenance of a public web leaderboard. I'm grateful to be mentored by **Juncheng Yang** and **Yazhuo Zhang**.
25+
26+
I'm thrilled to be part of building tools that empower users and advance the vision of a more decentralized web. Looking forward to a productive summer!
306 KB
Loading
Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
---
2+
title: "Enhancing Reproducibility in RAG Frameworks for Scientific Workflows"
3+
subtitle:
4+
summary: "This project addresses the critical issue of non-determinism in Retrieval-Augmented Generation (RAG) systems. We aim to develop a suite of tools, benchmarks, and best practices to ensure scientific workflows using Large Language Models are reliable, transparent, and reproducible."
5+
authors:
6+
- wbq-321
7+
tags: ["osre25", "reproducibility", "rag", "llm", "ai-for-science"]
8+
categories: ["Project"]
9+
date: 2025-06-25
10+
lastmod: 2025-06-25
11+
featured: false
12+
draft: false
13+
14+
# Featured image
15+
# To use, add an image named `featured.jpg/png` to your page's folder.
16+
# Focal points: Smart, Center, TopLeft, Top, TopRight, Left, Right, BottomLeft, Bottom, BottomRight.
17+
image:
18+
caption: ""
19+
focal_point: "Smart"
20+
preview_only: false
21+
---
22+
23+
Hello, I'm Baiqiang. As part of the [Enhancing Reproducibility in RAG Frameworks for Scientific Workflows](https://ucsc-ospo.github.io/project/osre25/pnnl/llm_rag_reproducibility/) project, I am excited to introduce my work on a crucial challenge in modern computational science. My [proposal](https://www.overleaf.com/read/fcbxtpngdnhw#8cc2c8) under the mentorship of Luanzheng "Lenny" Guo at Pacific Northwest National Laboratory and Dongfang Zhao at the University of Washington aims to enhance the reproducibility of AI-driven scientific workflows.
24+
25+
### The Problem: A Crisis of Confidence in AI for Science
26+
27+
Large Language Models (LLMs) are transforming scientific research, from accelerating literature reviews to generating novel hypotheses. However, their power is matched by their pitfalls: a tendency to "hallucinate" facts and a lack of transparency. Retrieval-Augmented Generation (RAG) was developed as a powerful solution, grounding LLM outputs in factual evidence retrieved from a specific knowledge base (like a database of scientific papers).
28+
29+
But a hidden problem lurks within RAG: **non-determinism**. The very first step of a RAG system—the similarity search that finds relevant documents—can produce different results even when asked the same question. Variations in indexing algorithms, data updates, or even the underlying software can change which documents are retrieved. For science, this is a critical flaw. If an experiment cannot be repeated with the same results, its conclusions cannot be trusted. This project tackles that challenge head-on.
30+
31+
### Our Mission: Forging a Path to Reproducible RAG
32+
33+
This project proposes a comprehensive solution to systematically identify, measure, and mitigate non-determinism in RAG frameworks. Our goal is to empower researchers to build and use AI tools with confidence.
34+
35+
Our approach is built on four key pillars:
36+
37+
1. **Systematic Analysis:** We will conduct a deep dive into popular RAG components (like FAISS, ScaNN, and HNSW) to pinpoint the exact sources of randomness and variability.
38+
2. **Rigorous Benchmarking:** We will develop a public, open-source benchmarking suite using standardized scientific datasets (from PubMed, arXiv, etc.). This will allow anyone to quantitatively measure the reproducibility of their own RAG pipeline using clear metrics like retrieval overlap and rank correlation.
39+
3. **Targeted Enhancements:** Based on our findings, we will implement practical solutions, including:
40+
* Promoting deterministic algorithms and configurations.
41+
* Building robust data versioning and provenance tracking tools (inspired by DVC and Git LFS).
42+
* Creating tools for precise configuration management to capture the entire experimental setup.
43+
4. **Practical Guidance and Open Source Tools:** We will distill our insights into comprehensive documentation, reusable code examples, and best practices. All tools and findings will be contributed back to the open-source community.

0 commit comments

Comments
 (0)