Spark with availableNow trigger doesn't archive sources

I use Spark to read JSON files that appear in a folder everyday with path pattern Yyyy/mm/dd to convert them into Iceberg format. Both folders JSON and Iceberg are in a s3 bucket on different paths.

Im using a stream reader as in

jsondf = spark.readStream.format("json").schema(myschema).option("cleanSource", "archive").option("sourceArchiveDir", "s3a://mybucket/myarchivepath").load("s3a://mybucket/sourcefolder/yyyy/mm/dd").select("*")

I have been trying several choices of streamwriters. A continuous streamwriter seems to work well and archives files when they popup. But we dont have so many files so I want to try a trigger. Once=true triggers seems to be a wrong choice for archiving but I dont know why (any reason for Once=true to fail when archiving? It looks to me like the natural choice for archiving). Due to this Im trying availableNow=true like in:

jsondf.writeStream.trigger(availableNow=true).format("iceberg").option("checkpointLocation", "s3a://mybucket/chkpointfolder").outputMode("append").start(jsontable)

Excuse any typos. I'm writing from a mobile.

Given the version without triggers works and archives, why using triggers make the archive to fail? As a matter of fact, I don't even see that this streamWriter makes the reader read any file at all.

PS: Im using Spark 3.4.1. It seems trigger Once is deprecated and is recommended to use availableNow.

edited Apr 18 at 10:54

Vindhya G

1,5012 gold badges24 silver badges49 bronze badges

asked Apr 17 at 10:19

Alex

1,01912 silver badges28 bronze badges

Sorry i did not get. Does AvailableNow works or not for your usecase?

Vindhya G
– Vindhya G

2025-04-17 12:56:18 +00:00
Commented Apr 17 at 12:56
What does the title say? It does not work. I started using Once=True and didnt work. availableNow=True does not work either

Alex
– Alex

2025-04-17 13:58:25 +00:00
Commented Apr 17 at 13:58
I forgot the title once i read the description as the description talks about only once mostly. So I asked the question without checking title. No need to be rude about it I suppose considering we all are here to help each other out :)

Vindhya G
– Vindhya G

2025-04-17 14:13:17 +00:00
Commented Apr 17 at 14:13

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Spark with availableNow trigger doesn't archive sources

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest