Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🐛 [firestore-bigquery-export] Error initializing raw change log table #2134

Open
boywijnmaalen opened this issue Jul 23, 2024 · 8 comments
Labels
extension: firestore-bigquery-export Related to firestore-bigquery-export extension type: bug Something isn't working

Comments

@boywijnmaalen
Copy link

Describe your configuration

  • Extension name: firestore-bigquery-export
  • Extension version: 0.1.51
  • Configuration values:
    BigQuery Dataset location: europe-****
    BigQuery Project ID: ***
    Collection path: ***
    Enable Wildcard Column field with Parent Firestore Document IDs: true
    Dataset ID: ***
    Table ID: ***
    BigQuery SQL table Time Partitioning option type: DAY
    BigQuery Time Partitioning column name: timestamp
    Firestore Document field name for BigQuery SQL Time Partitioning field option: timestamp
    BigQuery SQL Time Partitioning table schema field(column) type: TIMESTAMP
    BigQuery SQL table clustering: ***
    Maximum number of synced documents per second: 100
    Backup Collection Name: Parameter not set
    Transform function URL: Parameter not set
    Use new query syntax for snapshots: no
    Exclude old data payloads: no
    Use Collection Group query: no
    Cloud KMS key name: Parameter not set

Describe the problem

in the last 30 days I can find 14 occurrence of this error:

Unhandled error Error: Error initializing raw change log table: Field data has type JSON, which is not supported for clustering.

image

I could find at least 1 (but won't rule out more) missing updates for Firestore documents in BQ.

fix: a manual edit of the FS doc in question resulted in a sync to BQ (as expected).

stack trace;

Unhandled error Error: Error initializing raw change log table: Field data has type JSON, which is not supported for clustering. at FirestoreBigQueryEventHistoryTracker.initialize (/workspace/node_modules/@firebaseextensions/firestore-bigquery-change-tracker/lib/bigquery/index.js:192:23) at process.processTicksAndRejections (node:internal/process/task_queues:95:5) at async /workspace/lib/index.js:134:5 at async /workspace/node_modules/firebase-functions/lib/common/providers/tasks.js:74:17

Steps to reproduce:

unsure how to reproduce as I do not fully understand the problem

Expected result

to have every update for relevant Firestore docs synced to BQ

Actual result

missing update(s) for Firestore docs in BQ

@boywijnmaalen boywijnmaalen added the type: bug Something isn't working label Jul 23, 2024
@cabljac cabljac added the extension: firestore-bigquery-export Related to firestore-bigquery-export extension label Jul 23, 2024
@cabljac
Copy link
Contributor

cabljac commented Jul 23, 2024

This is strange to me, as we don't use the datatype JSON as far as I know, data is saved in a raw string.

Do you have any more info on your Clustering you're able to provide?

@boywijnmaalen
Copy link
Author

boywijnmaalen commented Jul 23, 2024

Hi @cabljac,

thanks for your quick reply;

schema:
image

(part of) table info;
image

@cabljac
Copy link
Contributor

cabljac commented Jul 24, 2024

Was this schema solely generated by the extension?

@boywijnmaalen
Copy link
Author

boywijnmaalen commented Jul 25, 2024

I don't really remember (the use of this extension has been in place for over 3 years)

tbh, does it matter? clustering by JSON type fields is not at all possible in BQ. our two JSON fields (data, old_data) are therefore not part of the defined clustered fields.

the error does not make sense to me. it seems somehow internally the data/old_data fields are used in such a way that is incompatible with JSON type.

what also strikes me as odd, is the fact the errors seems to be randomly appearing, if it really was related to JSON types fields then I'd expect this error to occur for every firestore>bigquery event. but this is not the case.

@boywijnmaalen
Copy link
Author

boywijnmaalen commented Jul 25, 2024

could this error be related to the fact we updated the extension to version 0.1.51?

I'm unsure when we updated this extension however it must have been recently as this version was released on June 19th

@cabljac
Copy link
Contributor

cabljac commented Jul 25, 2024

I'll have a look back at the release commits and see if anything could have caused this to start happening. Thanks for providing all this info by the way!

I'll see if we can get this issue prioritised

@boywijnmaalen
Copy link
Author

@cabljac

wondering if you have an update on this?

@boywijnmaalen
Copy link
Author

boywijnmaalen commented Sep 9, 2024

@cabljac

a kind reminder 🙏🏾

do you have an update on a potential fix?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
extension: firestore-bigquery-export Related to firestore-bigquery-export extension type: bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants