Popular repositories
-
-
-
-
puppet-hadoop Public
Puppet module for deploying Hadoop MapReduce Next Generation (MRv2)
-
1,069 contributions in the last year
Less
More
Contribution activity
September 2021
Created 13 commits in 3 repositories
Created 2 repositories
- viirya/snappy-java Java
- viirya/arrow-rs Rust
Created a pull request in apache/spark that received 98 comments
[SPARK-36670][SQL][TEST] Add FileSourceCodecSuite
What changes were proposed in this pull request? This patch mainly proposes to add some e2e test cases in Spark for codec used by main datasources. …
+90
−1
•
98
comments
Opened 16 other pull requests in 3 repositories
apache/spark
2
open
9
closed
- [WIP][SPARK-36809][SQL] Remove broadcast for InSubqueryExec used in DPP
- [SPARK-36797][SQL] Union should resolve nested columns as top-level columns
- [SPARK-36673][SQL][FOLLOWUP] Remove duplicate test in DataFrameSetOperationsSuite
- [SPARK-36673][SQL] Fix incorrect schema of nested types of union
- [SPARK-36735][SQL][FOLLOWUP] Fix indentation of DynamicPartitionPruningSuite
- [SPARK-36735][SQL] Adjust overhead of cached relation for DPP
- [SPARK-34479][SQL][DOC][FOLLOWUP] Add zstandard to avro supported codecs
- [SPARK-36669][SQL] Add Lz4 wrappers for Hadoop Lz4 codec
- [SPARK-36670][SQL][TEST][FOLLOWUP] Add AvroCodecSuite
- [SPARK-36682][CORE][TEST] Add Hadoop sequence file test for different Hadoop codecs
- [SPARK-36669][BUILD] Revert to non-shaded Hadoop client library
apache/hadoop
3
merged
xerial/snappy-java
2
open
Reviewed 42 pull requests in 3 repositories
apache/spark
38 pull requests
- [SPARK-36797][SQL] Union should resolve nested columns as top-level columns
- [WIP][SPARK-36809][SQL] Remove broadcast for InSubqueryExec used in DPP
- [SPARK-36794][SQL] Ignore duplicated join keys when building relation for SEMI/ANTI hash join
- [SPARK-34112][BUILD] Upgrade ORC to 1.7.0
- [SPARK-36673][SQL] Fix incorrect schema of nested types of union
- [SPARK-35985][SQL][3.1] push partitionFilters for empty readDataSchema
- [SPARK-36764][SS][TEST] Fix race-condition on "ensure continuous stream is being used" in KafkaContinuousTest
- [SPARK-36760][SQL] Add interface SupportsPushDownV2Filters
- [SPARK-36767][SQL] ArrayMin/ArrayMax/SortArray/ArraySort add comment and Unit test
- [SPARK-36773][SQL][TEST] Fixed unit test to check the compression for parquet
- [SPARK-36783][SQL] ScanOperation should not push Filter through nondeterministic Project
- [SPARK-36718][SQL] Only collapse projects if we don't duplicate expensive expressions
- [SPARK-36759][BUILD] Upgrade Scala to 2.12.15
- [SPARK-36735][SQL] Adjust overhead of cached relation for DPP
- [SPARK-36706][SQL][3.1] OverwriteByExpression conversion in DataSourceV2Strategy use wrong param in translateFilter
- [SPARK-36712][BUILD][FOLLOWUP] Improve the regex to avoid breaking pom.xml
- [SPARK-36702][SQL] ArrayUnion handle duplicated Double.NaN and Float.Nan
- [SPARK-36665][SQL] Add more Not operator simplifications
- [SPARK-36724][SQL] Support timestamp_ntz as a type of time column for SessionWindow
- [SPARK-36726] Upgrade Parquet to 1.12.1
- [SPARK-36719][CORE] Supporting Netty Logging at the network layer
- [SPARK-36556][SQL] Add DSV2 filters
- [SPARK-36334][K8S][FOLLOWUP] Allow equal resource version to update snapshot
- [SPARK-36670][SQL][TEST][FOLLOWUP] Add AvroCodecSuite
- [SPARK-36669][SQL] Add Lz4 wrappers for Hadoop Lz4 codec
- Some pull request reviews not shown.