Skip to content

Conversation

@gerashegalov
Copy link
Contributor

What changes were proposed in this pull request?

Spark Connect Overview doc should reference the non-JVM pyspark-client pip package instead of pyspark[connect] which includes a complete SPARK_HOME with JVM jars

Why are the changes needed?

Spark Connect Overview is a top search result and should encourage using a package with the minimum footprint.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Previewed the doc on Github

Was this patch authored or co-authored using generative AI tooling?

No

Spark Connect Overview doc should reference the non-JVM pyspark-client pip package instead of `pyspark[connect]`  
which includes a complete SPARK_HOME with JVM jars
<div data-lang="python" markdown="1">

First, install PySpark with `pip install pyspark[connect]=={{site.SPARK_VERSION_SHORT}}` or if building a packaged PySpark application/library,
First, install PySpark with `pip install pyspark-client=={{site.SPARK_VERSION_SHORT}}` or if building a packaged PySpark application/library,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to double check, do all examples here work properly? I am asking this cuz I didn't test it yet :-).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@HyukjinKwon
Copy link
Member

Merged to master and branch-4.1.

HyukjinKwon pushed a commit that referenced this pull request Nov 6, 2025
…nect overview

### What changes were proposed in this pull request?
Spark Connect Overview doc should reference the non-JVM pyspark-client pip package instead of `pyspark[connect]`   which includes a complete SPARK_HOME with JVM jars

### Why are the changes needed?
Spark Connect Overview is a top search result  and should encourage using a package with the minimum footprint.

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
Previewed the doc on Github

### Was this patch authored or co-authored using generative AI tooling?
No

Closes #52901 from gerashegalov/patch-1.

Authored-by: Gera Shegalov <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit 2f04e78)
Signed-off-by: Hyukjin Kwon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants