SparkSession
Defined in: spark-session.ts:127
The client handle for a Spark Connect session.
Holds the transport, session identifier, and runtime-adapter hooks (for
example the Arrow decoder). All DataFrame operations are scheduled against
a SparkSession; most applications create one session at startup and
reuse it.
Construct a session with the runtime-specific builder, for example
SparkSessionBuilder from @spark-connect-js/node.
Example
Section titled “Example”import { SparkSessionBuilder } from "@spark-connect-js/node";
const spark = await SparkSessionBuilder .remote("sc://localhost:15002") .build();
const df = await spark.sql("SELECT 1 AS n");console.log(await df.collect());Spark source: SparkSession.scala
Properties
Section titled “Properties”catalog
Section titled “catalog”readonly catalog: Catalog;Defined in: spark-session.ts:156
Access the session catalog for inspecting databases, tables, and columns.
readonly conf: RuntimeConfig;Defined in: spark-session.ts:162
Read and write Spark configuration entries on the connected server.
sessionId
Section titled “sessionId”readonly sessionId: string;Defined in: spark-session.ts:128
readonly udf: UDFRegistration;Defined in: spark-session.ts:159
Register Java UDFs and UDAFs as SQL functions.
Accessors
Section titled “Accessors”Get Signature
Section titled “Get Signature”get read(): DataFrameReader;Defined in: spark-session.ts:165
Returns a DataFrameReader for building Read plans.
Returns
Section titled “Returns”Methods
Section titled “Methods”addTag()
Section titled “addTag()”addTag(tag): void;Defined in: spark-session.ts:272
Tag every subsequent operation on this session with tag. Tags are
carried on ExecutePlanRequest.tags and let you cancel a group of
operations with interruptTag.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
tag | string |
Returns
Section titled “Returns”void
Throws
Section titled “Throws”InvalidInputError if the tag contains , or is empty.
clearTags()
Section titled “clearTags()”clearTags(): void;Defined in: spark-session.ts:288
Drop all active tags.
Returns
Section titled “Returns”void
createDataFrame()
Section titled “createDataFrame()”createDataFrame(data, schema?): DataFrame;Defined in: spark-session.ts:210
Create a DataFrame from Arrow IPC data.
Parameters
Section titled “Parameters”| Parameter | Type | Description |
|---|---|---|
data | Uint8Array | Arrow IPC streaming format bytes |
schema? | string | Optional DDL-formatted schema string (e.g. “id INT, name STRING”) |
Returns
Section titled “Returns”Example
Section titled “Example”const arrowData = ArrowEncoder.encode(rows, schema); const df = spark.createDataFrame(arrowData);getTags()
Section titled “getTags()”getTags(): string[];Defined in: spark-session.ts:283
Return a snapshot of the currently active tags.
Returns
Section titled “Returns”string[]
interruptAll()
Section titled “interruptAll()”interruptAll(): Promise<string[]>;Defined in: spark-session.ts:296
Interrupt every running operation in this session.
Returns
Section titled “Returns”Promise<string[]>
interruptOperation()
Section titled “interruptOperation()”interruptOperation(operationId): Promise<string[]>;Defined in: spark-session.ts:307
Interrupt a single running operation by its operation ID.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
operationId | string |
Returns
Section titled “Returns”Promise<string[]>
interruptTag()
Section titled “interruptTag()”interruptTag(tag): Promise<string[]>;Defined in: spark-session.ts:301
Interrupt every running operation tagged with tag.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
tag | string |
Returns
Section titled “Returns”Promise<string[]>
range()
Section titled “range()”range( startOrEnd, end?, step?, numPartitions?): DataFrame;Defined in: spark-session.ts:188
Create a DataFrame with a single id column containing a sequence of
integers from start (inclusive) to end (exclusive), incrementing by step.
Mirrors PySpark’s spark.range(start, end, step, numPartitions).
Parameters
Section titled “Parameters”| Parameter | Type | Default value |
|---|---|---|
startOrEnd | number | undefined |
end? | number | undefined |
step? | number | 1 |
numPartitions? | number | undefined |
Returns
Section titled “Returns”Example
Section titled “Example”spark.range(10) // 0, 1, 2, ..., 9 spark.range(1, 10) // 1, 2, 3, ..., 9 spark.range(0, 10, 2) // 0, 2, 4, 6, 8removeTag()
Section titled “removeTag()”removeTag(tag): void;Defined in: spark-session.ts:278
Remove a previously added tag. No-op if the tag wasn’t set.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
tag | string |
Returns
Section titled “Returns”void
sql(query): DataFrame;Defined in: spark-session.ts:170
Execute a SQL query.
Parameters
Section titled “Parameters”| Parameter | Type |
|---|---|
query | string |
Returns
Section titled “Returns”stop()
Section titled “stop()”stop(): Promise<void>;Defined in: spark-session.ts:340
Stop the session: releases server-side state and closes the transport.
Returns
Section titled “Returns”Promise<void>
version()
Section titled “version()”version(): Promise<string>;Defined in: spark-session.ts:332
Return the Apache Spark version reported by the connected server.
One AnalyzePlan RPC. Result is not cached; call once and store if you need it repeatedly.
Mirrors pyspark.sql.SparkSession.version.
Returns
Section titled “Returns”Promise<string>
builder()
Section titled “builder()”static builder(): SparkSessionBuilder;Defined in: spark-session.ts:149