Skip to content

Commit 99ecf53

Browse files
authored
[AMORO-3972][Core] Upgrade default Spark version from 3.3 to 3.5 (#3975)
* [AMORO-3972][Core] Upgrade default Spark version from 3.3 to 3.5 This PR upgrades the default Spark version from 3.3 to 3.5 in the Amoro project. Changes: - pom.xml: Update spark.version from 3.3.4 to 3.5.7 and spark.major.version from 3.3 to 3.5 - pom.xml (hadoop2 profile): Update spark.version from 3.3.4 to 3.5.7 and spark.major.version from 3.3 to 3.5 - docker/build.sh: Update SPARK_VERSION from 3.3.3 to 3.5.7 - docker/optimizer-spark/Dockerfile: Update ARG SPARK_VERSION from 3.3.3 to 3.5.7 - .github/workflows/docker-images.yml: Update Spark optimizer matrix from 3.3.3 to 3.5.7 - README.md: Update Spark optimizer default version documentation Closes #3972 * Keep Spark 3.3 as default for hadoop2 profile Address review feedback from @turboFei: For hadoop2 profile, keep using the legacy Spark 3.3 by default for better compatibility with Hadoop 2.x. * Fix Spark version parameter name in README.md Address review feedback from @turboFei: Change -Dspark-optimizer.spark-version to -Dspark.version This was missed in PR #3874.
1 parent 91962e7 commit 99ecf53

File tree

5 files changed

+12
-12
lines changed

5 files changed

+12
-12
lines changed

.github/workflows/docker-images.yml

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -186,7 +186,7 @@ jobs:
186186
if: ${{ startsWith(github.repository, 'apache/') }}
187187
strategy:
188188
matrix:
189-
spark: [ "3.3.3" ]
189+
spark: [ "3.5.7" ]
190190
scala: [ "2.12.15" ]
191191
steps:
192192
- uses: actions/checkout@v3
@@ -212,11 +212,11 @@ jobs:
212212
images: |
213213
name=apache/amoro-spark-optimizer
214214
tags: |
215-
type=ref,event=branch,enable=${{ matrix.spark == '3.3.3' }},suffix=-snapshot
216-
type=ref,event=branch,enable=${{ matrix.spark == '3.3.3' }},suffix=-snapshot-spark3.3
217-
type=raw,enable=${{ matrix.hadoop == '3.3.3' && startsWith(github.ref, 'refs/tags/v') }},value=latest
218-
type=semver,enable=${{ matrix.spark == '3.3.3' }},pattern={{version}}
219-
type=semver,enable=${{ matrix.spark == '3.3.3' }},pattern={{version}}, suffix=-spark3.3
215+
type=ref,event=branch,enable=${{ matrix.spark == '3.5.7' }},suffix=-snapshot
216+
type=ref,event=branch,enable=${{ matrix.spark == '3.5.7' }},suffix=-snapshot-spark3.5
217+
type=raw,enable=${{ matrix.hadoop == '3.5.7' && startsWith(github.ref, 'refs/tags/v') }},value=latest
218+
type=semver,enable=${{ matrix.spark == '3.5.7' }},pattern={{version}}
219+
type=semver,enable=${{ matrix.spark == '3.5.7' }},pattern={{version}}, suffix=-spark3.5
220220
221221
- name: Print tags
222222
run: echo '${{ steps.meta.outputs.tags }}'

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -126,7 +126,7 @@ Amoro is built using Maven with JDK 8, 11 and 17(required for `amoro-format-mixe
126126
* Build with hadoop 2.x(the default is 3.x) dependencies: `./mvnw clean package -DskipTests -Phadoop2`
127127
* Specify Flink version for Flink optimizer(the default is 1.20.0): `./mvnw clean package -DskipTests -Dflink-optimizer.flink-version=1.20.0`
128128
* If the version of Flink is below 1.15.0, you also need to add the `-Pflink-optimizer-pre-1.15` parameter: `./mvnw clean package -DskipTests -Pflink-optimizer-pre-1.15 -Dflink-optimizer.flink-version=1.14.6`
129-
* Specify Spark version for Spark optimizer(the default is 3.3.3): `./mvnw clean package -DskipTests -Dspark-optimizer.spark-version=3.3.3`
129+
* Specify Spark version for Spark optimizer(the default is 3.5.7): `./mvnw clean package -DskipTests -Dspark.version=3.5.7`
130130
* Build `amoro-mixed-trino` module under JDK 17: `./mvnw clean package -DskipTests -Pformat-mixed-format-trino,build-mixed-format-trino -pl 'amoro-format-mixed/amoro-mixed-trino' -am`.
131131
* Build all modules: `./mvnw clean package -DskipTests -Ptoolchain,build-mixed-format-trino`, besides you need config `toolchains.xml` in `${user.home}/.m2/` dir with content below.
132132
* Build a distribution package with all formats integrated: `./mvnw clean package -Psupport-all-formats`

docker/build.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ cd $CURRENT_DIR
2727

2828
AMORO_VERSION=`cat $PROJECT_HOME/pom.xml | grep 'amoro-parent' -C 3 | grep -Eo '<version>.*</version>' | awk -F'[><]' '{print $3}'`
2929
FLINK_VERSION=1.20.0
30-
SPARK_VERSION=3.3.3
30+
SPARK_VERSION=3.5.7
3131
DEBIAN_MIRROR=http://deb.debian.org
3232
APACHE_ARCHIVE=https://archive.apache.org/dist
3333
FLINK_OPTIMIZER_JOB_PATH=amoro-optimizer/amoro-optimizer-flink/target/amoro-optimizer-flink-${AMORO_VERSION}-jar-with-dependencies.jar
@@ -50,7 +50,7 @@ Images:
5050
5151
Options:
5252
--flink-version Flink binary release version, default is 1.20.0, format must be x.y.z
53-
--spark-version Spark binary release version, default is 3.3.3, format must be x.y.z
53+
--spark-version Spark binary release version, default is 3.5.7, format must be x.y.z
5454
--apache-archive Apache Archive url, default is https://archive.apache.org/dist
5555
--debian-mirror Mirror url of debian, default is http://deb.debian.org
5656
--maven-mirror Mirror url of maven, default is https://repo.maven.apache.org/maven2

docker/optimizer-spark/Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
# See the License for the specific language governing permissions and
1515
# limitations under the License.
1616

17-
ARG SPARK_VERSION=3.3.3
17+
ARG SPARK_VERSION=3.5.7
1818

1919
FROM apache/spark:${SPARK_VERSION}
2020

pom.xml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -126,8 +126,8 @@
126126
<orc-core.version>1.9.7</orc-core.version>
127127
<awssdk.version>2.24.12</awssdk.version>
128128
<aliyun-sdk-oss.version>3.10.2</aliyun-sdk-oss.version>
129-
<spark.version>3.3.4</spark.version>
130-
<spark.major.version>3.3</spark.major.version>
129+
<spark.version>3.5.7</spark.version>
130+
<spark.major.version>3.5</spark.major.version>
131131
<dropwizard.metrics.version>4.2.19</dropwizard.metrics.version>
132132
<caffeine.version>2.9.3</caffeine.version>
133133
<snakeyaml.version>2.2</snakeyaml.version>

0 commit comments

Comments
 (0)