Have you tried to download the log through Yarn API? This might help
The logs found from storage account is not human-readable. The file parser doesn’t work with java.io.IOException: Not a valid BCFile.
This is because Yarn log is aggregated into IndexFile format, which is not supported by the file parser we have.
yarn.log-aggregation.file-formats = IndexedFormat,TFile
You should see IndexedFormat,TFile as default. Remove IndexedFormat, leave TFile.
yarn.log-aggregation.file-formats = TFile
Restart all the affected services and now you should be able to get TFile logs from storage and read with the parser.
yarn.nodemanager.log-aggregation.compression-type = gz
You should see compression type set to gz by default. Please change the value to “none” to make it work.
yarn.nodemanager.log-aggregation.compression-type = none
By using IndexedFormant + none compression, you should be able to read the log file directly from storage account.