Add session and statement state for all query types #2413

penghuo · 2023-11-02T06:40:58Z

Description

create session and statement for all the queryType.

Issues Resolved

Check List

New functionality includes testing.
- All tests pass, including unit test, integration test and doctest
New functionality has been documented.
- New functionality has javadoc added
- New functionality has user manual doc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Peng Huo <[email protected]>

vamsimanohar · 2023-11-02T07:00:46Z

spark/src/main/java/org/opensearch/sql/spark/dispatcher/AsyncQueryHandler.java

+    }
+    JSONObject result = new JSONObject();
+    result.put(STATUS_FIELD, statementState.getState());
+    result.put(ERROR_FIELD, Optional.of(statement.getStatementModel().getError()).orElse(""));


is error written to result index or statement model in request index?
Are these different cases?

both, Flint spark job,

write result, state, error, to result index.

write state and error to request index.

@kaituo please help confirm also.

codecov · 2023-11-02T07:10:31Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.56%. Comparing base (2f2ecd2) to head (d45376c).
Report is 207 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #2413      +/-   ##
============================================
+ Coverage     95.54%   95.56%   +0.01%     
- Complexity     4985     4987       +2     
============================================
  Files           478      478              
  Lines         13883    13919      +36     
  Branches        931      931              
============================================
+ Hits          13265    13301      +36     
  Misses          598      598              
  Partials         20       20

Flag	Coverage Δ
sql-engine	`95.56% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

penghuo

@kaituo two questions.

should we add a new StatementState cancelling and Spark job update it as cancelled?
what is the required configuration of REPL job, we have following configurations now, do we need more?
- config.put(FLINT_JOB_REQUEST_INDEX, DATASOURCE_TO_REQUEST_INDEX.apply(datasourceName));
- config.put(FLINT_JOB_SESSION_ID, sessionId);

kaituo · 2023-11-02T18:33:50Z

@kaituo two questions.

1. should we add a new StatementState `cancelling` and Spark job update it as `cancelled`?

2. what is the required configuration of REPL job, we have following configurations now, do we need more?
   
   * config.put(FLINT_JOB_REQUEST_INDEX, DATASOURCE_TO_REQUEST_INDEX.apply(datasourceName));
   * config.put(FLINT_JOB_SESSION_ID, sessionId);

Can you just update it as cancelled? If you cancel before repl pick the statement up or after repl finishes the statement, repl doesn't need to do anything else. If you cancel after repl picks it up and before repl finishes, repl may change your state, which I think is fine.
I also need these two:

val dataSource = conf.get("spark.flint.datasource.name", "unknown")
val wait = conf.get("spark.flint.job.type", "continue")

kaituo · 2023-11-02T20:51:17Z

spark/src/main/java/org/opensearch/sql/spark/dispatcher/IndexDMLHandler.java

+    createSessionAndStatement(
+        dispatchQueryRequest,
+        dispatchQueryRequest.getApplicationId(),
+        DROP_INDEX_JOB_ID,


does that mean there is only one job id for all DML queries? Would it cause issues if there is a concurrent DML running?

kaituo · 2023-11-02T20:56:37Z

spark/src/main/java/org/opensearch/sql/spark/dispatcher/StreamingQueryHandler.java

+        dispatchQueryRequest,
+        dispatchQueryRequest.getApplicationId(),
+        jobId,
+        SessionType.BATCH,


should the type be streaming?

Swiddis · 2025-01-07T18:06:50Z

Closing as stale -- reopen if needed

penghuo added 2 commits November 1, 2023 23:28

Add session and statement state for all query types

0c1d671

Signed-off-by: Peng Huo <[email protected]>

Merge branch 'main' into issue2401/addSessionState

d45376c

vamsimanohar reviewed Nov 2, 2023

View reviewed changes

penghuo marked this pull request as ready for review November 2, 2023 17:57

penghuo requested review from pjfitzgibbons, ps48, kavithacm, derek-ho, joshuali925, dai-chen, YANG-DB, rupal-bq, mengweieric, Swiddis, seankao-az, MaxKsyunz, Yury-Fridlyand, anirudha, forestmvey, acarbonetto and GumpacG as code owners November 2, 2023 17:57

penghuo commented Nov 2, 2023

View reviewed changes

kaituo reviewed Nov 2, 2023

View reviewed changes

Swiddis closed this Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add session and statement state for all query types #2413

Add session and statement state for all query types #2413

penghuo commented Nov 2, 2023 •

edited

Loading

vamsimanohar Nov 2, 2023

penghuo Nov 2, 2023

kaituo Nov 2, 2023

codecov bot commented Nov 2, 2023 •

edited

Loading

penghuo left a comment •

edited

Loading

kaituo commented Nov 2, 2023

kaituo Nov 2, 2023

kaituo Nov 2, 2023

Swiddis commented Jan 7, 2025

Add session and statement state for all query types #2413

Add session and statement state for all query types #2413

Conversation

penghuo commented Nov 2, 2023 • edited Loading

Description

Issues Resolved

Check List

vamsimanohar Nov 2, 2023

Choose a reason for hiding this comment

penghuo Nov 2, 2023

Choose a reason for hiding this comment

kaituo Nov 2, 2023

Choose a reason for hiding this comment

codecov bot commented Nov 2, 2023 • edited Loading

Codecov Report

penghuo left a comment • edited Loading

Choose a reason for hiding this comment

kaituo commented Nov 2, 2023

kaituo Nov 2, 2023

Choose a reason for hiding this comment

kaituo Nov 2, 2023

Choose a reason for hiding this comment

Swiddis commented Jan 7, 2025

penghuo commented Nov 2, 2023 •

edited

Loading

codecov bot commented Nov 2, 2023 •

edited

Loading

penghuo left a comment •

edited

Loading