Skip to content

Fix/spark oss implementation#152

Merged
anantadwi13 merged 2 commits intomainfrom
fix/spark-oss-implementation
Jun 26, 2025
Merged

Fix/spark oss implementation#152
anantadwi13 merged 2 commits intomainfrom
fix/spark-oss-implementation

Conversation

@andreas-aji
Copy link
Contributor

@andreas-aji andreas-aji commented Jun 25, 2025

Summary:
We got this error running spark jobs in kubernetes cluster: Exception in thread "main" java.lang.RuntimeException: java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.aliyun.oss.OSS not found

Turns out in the hadoop aliyun library version that we use, that class doesn't exist. After efforts of resolving dependencies that, in the end, was not working, we found a solution with just removing the line that tries to access said class.

remove this: conf.set("spark.hadoop.fs.AbstractFileSystem.oss.impl", "org.apache.hadoop.fs.aliyun.oss.OSS")

It's working now.

@andreas-aji andreas-aji self-assigned this Jun 25, 2025
Copy link
Collaborator

@shydefoo shydefoo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing this!

@anantadwi13 anantadwi13 merged commit 81b87c2 into main Jun 26, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants