-
Notifications
You must be signed in to change notification settings - Fork 242
Remove dependency of soda-core-spark-df on pyspark #2218
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Remove dependency of soda-core-spark-df on pyspark #2218
Conversation
|
Thanks for the contrib @ghjklw. The pinned version ensures that the expected programmatic api (i.e. classes in this cases) is present. Version we are using is 2 years old, do you think that is still too much to ask? |
Hi @m1n0, I think that in this case the version requirement is perfectly reasonable, no issue with that 🙂 The specific challenge I'm facing is that the packages To be fair, this is much more of a Databricks issue than a Soda issue, but it's also much easier to fix on this side 😉 In practice, when I override By "ensures that the expected programmatic api (i.e. classes in this cases) is present", do you mean:
|
The I would rather move forward and not back on this, how about and extra that removes the pyspark dependency? Doing that and adding it to docs would not break existing user experience and allow users in scenarios like yours to move forward without friction |
That would be a good option, but I'm not aware of any way to define an extra with "negative" dependencies. There have been several discussions related to similar problems:
It looks like this will be coming via will take some time. There are only 2 options left I can think of:
|
Hi, just in case it adds something to the discussion. I had some issues with local tests upgrading from soda-core-spark-df 3.3.5 to 3.3.6, because of the |
Soda-core-spark-df declares a dependency on pyspark>=3.4.0, but pyspark used only for typing, and should therefore be declared as an extra dependency to avoid unnecessary dependency conflicts.
Closes #2217