-
Notifications
You must be signed in to change notification settings - Fork 3.6k
[enhance](hive) support reading hive table with OpenCSVSerde #42257
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[enhance](hive) support reading hive table with OpenCSVSerde #42257
Conversation
Thank you for your contribution to Apache Doris. Since 2024-03-18, the Document has been moved to doris-website. |
run buildall |
TPC-H: Total hot run time: 41344 ms
|
TPC-DS: Total hot run time: 191462 ms
|
ClickBench: Total hot run time: 32.48 s
|
run buildall |
TPC-H: Total hot run time: 41015 ms
|
TPC-DS: Total hot run time: 191816 ms
|
ClickBench: Total hot run time: 33.09 s
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR approved by at least one committer and no changes requested. |
PR approved by anyone and no changes requested. |
run p0 |
…42257) ## Proposed changes OpenCSVSerde Properties: | **Property** | **Description** | **Default Value** | **Supported in Doris** | |---------------------------------------|---------------------------------------------------------------------------------------------------|-------------------|--------------------------| | `separatorChar` | Defines the character used to separate fields (columns) in a CSV file. | `,` | Yes | | `quoteChar` | Defines the character used to quote fields that contain special characters, like the separator. | `"` | Yes | | `escapeChar` | Specifies the escape character used for escaping special characters, including quotes and delimiters. | `"` | Yes | ### Explanation: - **`separatorChar`**: This property defines the character that separates columns in the CSV file. Typically, a comma (`,`) is used as the default separator. - **`quoteChar`**: This character is used to enclose fields that contain special characters (like the separator). For example, if a field contains a comma, it is enclosed in quotes (`"`). - **`escapeChar`**: Specifies the character used to escape special characters, such as quotes or the separator. In many cases, a backslash (`\\`) is used as the escape character.
…42257) ## Proposed changes OpenCSVSerde Properties: | **Property** | **Description** | **Default Value** | **Supported in Doris** | |---------------------------------------|---------------------------------------------------------------------------------------------------|-------------------|--------------------------| | `separatorChar` | Defines the character used to separate fields (columns) in a CSV file. | `,` | Yes | | `quoteChar` | Defines the character used to quote fields that contain special characters, like the separator. | `"` | Yes | | `escapeChar` | Specifies the escape character used for escaping special characters, including quotes and delimiters. | `"` | Yes | ### Explanation: - **`separatorChar`**: This property defines the character that separates columns in the CSV file. Typically, a comma (`,`) is used as the default separator. - **`quoteChar`**: This character is used to enclose fields that contain special characters (like the separator). For example, if a field contains a comma, it is enclosed in quotes (`"`). - **`escapeChar`**: Specifies the character used to escape special characters, such as quotes or the separator. In many cases, a backslash (`\\`) is used as the escape character.
…42942) cherry pick from #42257 Co-authored-by: Socrates <[email protected]>
…42940) cherry pick from #42257 Co-authored-by: Socrates <[email protected]>
Proposed changes
OpenCSVSerde Properties:
separatorChar
,
quoteChar
"
escapeChar
"
Explanation:
separatorChar
: This property defines the character that separates columns in the CSV file. Typically, a comma (,
) is used as the default separator.quoteChar
: This character is used to enclose fields that contain special characters (like the separator). For example, if a field contains a comma, it is enclosed in quotes ("
).escapeChar
: Specifies the character used to escape special characters, such as quotes or the separator. In many cases, a backslash (\\
) is used as the escape character.