Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Help] No support sample query SQL such as “select * from tableA“? #853

Open
casionone opened this issue Jan 3, 2025 · 3 comments
Open

Comments

@casionone
Copy link

casionone commented Jan 3, 2025

Environment

  • Spark 2.4.3
  • Splien agent 1.1.0

We try to exec some sample sql such as "select * from table A", but can not get the spline log information

企业微信截图_17350913834819

we found the select query will follow this logic(queryExecution.analyzed), the withAction method will not be executed

@cerveada cerveada added this to Spline Jan 3, 2025
@github-project-automation github-project-automation bot moved this to New in Spline Jan 3, 2025
@wajda
Copy link
Contributor

wajda commented Jan 4, 2025

Please share your code sample. Also note that by default the lineage is captured lazily, so you need to save the result somewhere in order to get the lineage. If you want to view lineage of temporary data you can enable it in the configuration:

spline.plugins.za.co.absa.spline.harvester.plugin.embedded.NonPersistentActionsCapturePlugin.enabled=true

See: https://github.com/AbsaOSS/spline-spark-agent?tab=readme-ov-file#spark-features-coverage

@casionone
Copy link
Author

Please share your code sample. Also note that by default the lineage is captured lazily, so you need to save the result somewhere in order to get the lineage. If you want to view lineage of temporary data you can enable it in the configuration:

spline.plugins.za.co.absa.spline.harvester.plugin.embedded.NonPersistentActionsCapturePlugin.enabled=true

See: https://github.com/AbsaOSS/spline-spark-agent?tab=readme-ov-file#spark-features-coverage

sql just like 'select * form tablea;'
And I find the support command list

Implemented
CreateDataSourceTableAsSelectCommand (org.apache.spark.sql.execution.command)
CreateHiveTableAsSelectCommand (org.apache.spark.sql.hive.execution)
CreateTableCommand (org.apache.spark.sql.execution.command)
DropTableCommand (org.apache.spark.sql.execution.command)
InsertIntoDataSourceDirCommand (org.apache.spark.sql.execution.command)
InsertIntoHadoopFsRelationCommand (org.apache.spark.sql.execution.datasources)
InsertIntoHiveDirCommand (org.apache.spark.sql.hive.execution)
InsertIntoHiveTable (org.apache.spark.sql.hive.execution)
SaveIntoDataSourceCommand (org.apache.spark.sql.execution.datasources)

the simple select sql is not support ?

@wajda
Copy link
Contributor

wajda commented Jan 7, 2025

Yes, It is supported, but the lineage is only captured when the result is stored somewhere. Or enable NonPersistentActoinsCapturePlugin as I mentioned above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: New
Development

No branches or pull requests

2 participants