azure - 数据 block : I met with an issue when I was trying to use autoloader to read json files from Azure ADLS Gen2-6ren

azure - 数据 block : I met with an issue when I was trying to use autoloader to read json files from Azure ADLS Gen2

转载作者：行者123 更新时间：2023-12-03 03:31:29

28

4

当我尝试使用自动加载器从 Azure ADLS Gen2 读取 json 文件时遇到问题。我仅针对特定文件遇到此问题。我检查过文件完好并且没有损坏。

问题如下:

Caused by: java.lang.IllegalArgumentException: ***requirement failed: Literal must have a corresponding value to string, but class Integer found.***
    at scala.Predef$.require(Predef.scala:281)
    at at ***com.databricks.sql.io.FileReadException: Error while reading file /mnt/Source/kafka/customer_raw/filtered_data/year=2022/month=11/day=9/hour=15/part-00000-31413bcf-0a8f-480f-8d45-6970f4c4c9f7.c000.json.***
at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1$$anon$2.logFileNameAndThrow(FileScanRDD.scala:598)
at org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.hasNext(FileScanRDD.scala:422)
at scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
at org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage1.processNext(null:-1)
at org.apache.spark.sql.execution.BufferedRowIterator.hasNext(BufferedRowIterator.java:43)
at org.apache.spark.sql.execution.WholeStageCodegenExec$$anon$1.hasNext(WholeStageCodegenExec.scala:759)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
java.lang.IllegalArgumentException: requirement failed: Literal must have a corresponding value to string, but class Integer found.
at scala.Predef$.require(Predef.scala:281)
at org.apache.spark.sql.catalyst.expressions.Literal$.validateLiteralValue(literals.scala:274)
org.apache.spark.sql.execution.WholeStageCodegenExec$$anon$1.hasNext(WholeStageCodegenExec.sat java.lang.Thread.run(Thread.java:750)

我正在使用 Delta Live Pipeline。这是代码:

@dlt.table(name = tablename,
    comment = "Create Bronze Table",
    table_properties={
        "quality": "bronze"
    }
)
def Bronze_Table_Create():
    return
            spark
            .readStream
            .schema(schemapath)
            .format("cloudFiles")
            .option("cloudFiles.format","json)
            .option("cloudFile.schemaLocation, schemalocation)
            .option("cloudFiles.inferColumnTypes", "false")
            .option("cloudFiles.schemaEvolutionMode", "rescue")
            .load(sourcelocation

最佳答案

我已经解决了这个问题。问题是我们错误地在架构文件中存在重复的列。因此它显示了该错误。然而，这个错误完全是误导性的，这就是为什么无法纠正它。

关于azure - 数据 block : I met with an issue when I was trying to use autoloader to read json files from Azure ADLS Gen2，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/74650227/

28

4

0

文章推荐： c# - 如何使用 Moq 在 C# 中模拟 Azure 存储帐户？

scala - 为什么可以将 Try[Try[Unit]] 值分配给 Try[Unit]？
我刚刚遇到了一个非常奇怪的行为。这是代码: // So far everything's fine val x: Try[Try[Unit]] = Try(Try{}) x: scala.util.T
ruby-on-rails - 更好的替代方法 try( :output). try( :data). try( :name)?
“输出”是一个序列化的 OpenStruct。定义标题 try(:output).try(:data).try(:title) 结束什么会更好？ :) 最佳答案或者只是这样: def title
scala 模式匹配 (Try,Try)
我有以下元组 - (t1,t2) :(Try,Try) 我想检查两者是否成功或其中之一是否失败，但避免代码重复。像这样的东西: (t1,t2) match { case (Success(v1),Su
java - 是否必须将内部 try-with-resources 放入内部 try-with-resources 或其中一个 try-with-resources 中的所有内容都将自动关闭？
是否必须放置内部 try-with-resources 或其中一个 try-with-resources 中的所有内容都会自动关闭？ try (BasicDataSource ds = Bas
java - grails: try 抛出意外的标记: try:
有一点特殊，尝试创建一段 try catch 代码来处理 GoogleTokenResponse，但编译器在 try 时抛出异常错误。有什么想法吗？错误信息: | Loading Grails 2.
try-catch - try ... catch ... [finally] 是如何工作的？
它几乎可以在所有语言中找到，而且我大部分时间都在使用它。我不知道它是内部的，不知道它是如何真正起作用的。它如何在任何语言的运行时在 native 级别工作？例如:如果在 try 内部发生 sta
java - try catch 与 try-with-resources
为什么在 readFile2() 中我需要捕获 FileNotFoundException 以及稍后由 close( ) 方法，并且在 try-with-resources(inside readfi
java - 有没有办法可以制作一个 try-try-catch block ？
我正在使用 Apache POI 尝试读取 Word 文件，但即使您使用过 Apache POI，这仍然应该是可以回答的。在 HWPF.extractor 包中有两个对象:WordExtractor
try-catch - try catch finally 执行流程
如果try-catch的catch block 中抛出异常，那么finally block 会被调用吗？ try { //some thing which throws error } cat
java - Try With Resources 与 Try-Catch
这个问题已经有答案了: What's the purpose of try-with-resources statements? (7 个回答) 已关闭 3 年前。我一直在查看代码，并且已经看到了对
java - Try With Resources 与 Try-Catch
这个问题已经有答案了: What's the purpose of try-with-resources statements? (7 个回答) 已关闭 3 年前。我一直在查看代码，并且已经看到了对
perl - Try::Tiny:try-catch 的奇怪行为与否？
我正在使用 Try::Tiny尝试捕捉。代码如下: use Try::Tiny; try { print "In try"; wrongsubroutine(); # undefi
c++ - try-catch 在 try 中检查多个对象
我想知道这样的代码是否会在抛出异常后总是中断而不继续运行，因此代码不会继续执行第二个 temp.dodaj(b)。 Avto *a = new Avto("lambo",4); Avt
Java 7 try-with-resources - try 子句中可以包含什么
我知道在try子句中必须有一个与资源关联的变量声明。但是除了被分配一个通常的资源实例化之外，它是否可以被分配一个已经存在的资源，例如: public String getAsString(HttpS
java - try-catch 语句在捕获异常时不返回 try block
我有一个写的方法。此方法仅扫描用户输入的整数输入。如果用户输入一个字符值，它将抛出一个输入不匹配异常，这是在我的 Try-Catch 语句中处理的。问题是，如果用户输入任何不是数字的东西，然后抛出异常
java - 为什么不能在 try-with-resources try 子句中重用引用变量？
我注意到这不会编译: PrintWriter printWriter = new PrintWriter("test.txt"); printWriter.append('a'); printWrit
Python:将 try 代码与 try 语句放在同一行有什么好处吗？
我经常看到人们写这样的代码: try: some_function() except: print 'something' 当我认为这样做更干净时: try: some_functio
objective-c - iOS 方向 : Tried this and tried that
该应用程序将在第二个显示器上正常显示内容。问题是当我旋转 iPad 时内容不会在 iPad 上旋转。看过: http://developer.apple.com/library/ios/#qa/qa
java - 我的 try 语句之后的所有内容都必须包含在该 try 语句中才能访问其中的变量吗？
我正在学习 java，我发现我不喜欢的一件事通常是当我有这样的代码时: import java.util.*; import java.io.*; public class GraphProblem
c++ - TRY/CATCH_ALL 与 try/catch
我使用 C++ 有一段时间了，对普通的 try/catch 很熟悉。但是，我现在发现自己在 Windows 上，在 VisualStudio 中编码以进行 COM 开发。代码的几个部分使用了如下内容:

首页

博学

6Ren·AI

商城

azure - 数据 block : I met with an issue when I was trying to use autoloader to read json files from Azure ADLS Gen2