-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Look into how rejected rows are handled in v2 connector #275
Comments
Note that rows are only rejected during the copying/committing of data from HDFS to Vertica. This means that the data must be valid in HDFS, but for whatever reason it does not satisfy the constraints in Vertica. A good way to test this is to create the schema ahead of time and use the |
Also note that the error when the schema in the DF and Vertica differ is the same as ticket #284, where the log indicates that 0 rows were copied but also says that 0 rows were rejected (no data was copied to Vertica, in the target table or the rejects table). |
When testing a DF containing nullable values, against a Vertica table that is not nullable, the error is slightly different. The logs do report the number of copied rows and rejected rows:
With a sufficiently high If we want to save the rejected data we will need to change the current behaviour of the connector. For example, add a For more information on how Vertica handles rejected rows, see here. |
This ticket only deals with improving the logging around the rejected data table. See #293 to persist the actual rejected data table. |
Modify an example to contain rejected rows and see if those rows are contained in rejects table. We expect rows with issues to be in this table.
The text was updated successfully, but these errors were encountered: