JDBCRDD
is a RDD
of internal binary rows that represents a structured query over a table in a database accessed via JDBC.
Note
|
JDBCRDD represents a "SELECT requiredColumns FROM table" query.
|
JDBCRDD
is created exclusively when JDBCRDD
is requested to scanTable (when JDBCRelation
is requested to build a scan).
Name | Description |
---|---|
|
Used when…FIXME |
|
Filters as a SQL Used when…FIXME |
compute(thePart: Partition, context: TaskContext): Iterator[InternalRow]
Note
|
compute is part of Spark Core’s RDD Contract to compute a partition (in a TaskContext ).
|
compute
…FIXME
resolveTable(options: JDBCOptions): StructType
resolveTable
…FIXME
Note
|
resolveTable is used exclusively when JDBCRelation is requested for the schema.
|
scanTable(
sc: SparkContext,
schema: StructType,
requiredColumns: Array[String],
filters: Array[Filter],
parts: Array[Partition],
options: JDBCOptions): RDD[InternalRow]
scanTable
…FIXME
Note
|
scanTable is used when…FIXME
|
JDBCRDD
takes the following when created:
-
Schema (StructType)
-
Array of Filter predicates
JDBCRDD
initializes the internal registries and counters.
getPartitions: Array[Partition]
Note
|
getPartitions is part of Spark Core’s RDD Contract to…FIXME
|
getPartitions
simply returns the partitions (this JDBCRDD
was created with).