295 questions
Best practices
0
votes
0
replies
33
views
Why prefer webhooks to CDC?
QuickBooks Online's API documentation includes a recommendation that users use webhooks instead of change-data-capture to synchronize their entity records with QuickBooks's. Why are webhooks ...
0
votes
1
answer
85
views
Clickhouse race condition on materialized views
I’m using PostgreSQL as the source database with Change Data Capture (CDC) enabled via publications. I have two related tables:
-- Table 1: orders
CREATE TABLE orders (
id UUID PRIMARY KEY,
...
1
vote
1
answer
63
views
How to capture the name of the user which made the change using PostgreSQL CDC
We need to capture changes done in the PostgreSQL DB tables, and are planning on using CDC for the same.
We have a requirement to ignore/filter events which were written inn the DB by a specific user/...
0
votes
0
answers
24
views
Stream / changefeed that only emits columns that have changed in a record
I am looking for a way to design a stream that ONLY emits columns that have changed in a record. If I use the stream, it emits the entire record even if only one of the column values has changed in ...
0
votes
1
answer
95
views
Debezium connector failing with `Unexpected Kafka request of type METADATA during SASL handshake` error
We have a Kafka cluster that we're trying to connect Debezium to. We are able to successfully deploy a Producer or Consumer using the following producer.config/consumer.config (these are temporary ...
0
votes
0
answers
84
views
pyFlink job - Could not find any factory for identifier 'mysql-cdc' when creating a Flink-cdc table
I'm running a Flink cluster in Docker in my local env, and I've copied these jar files to the /opt/flink/lib/ of the image:
flink-cdc-dist-3.3.0.jar
flink-cdc-pipeline-connector-mysql-3.3.0.jar
flink-...
1
vote
1
answer
60
views
Azure Data Factory trigger off a SQL table re-load
I want to have a ADF job that triggers on a SQL (Azure SQL DB) table re-load. I know that there is a Change Data Capture (CDC) trigger that can be used in ADF, but that seems to be something I would ...
0
votes
1
answer
326
views
How to Configure CDC in Azure Data Flow with Parameterized Datasets for Daily Partition Processing?
I am fairly new to Azure Data Factory (ADF) but have been learning and experimenting with some of its advanced features. I'm currently working on a use case involving Change Data Capture (CDC) and ...
3
votes
2
answers
132
views
How to Handle Rows with and without Status Changes in History Table
I have a table in SQL Server that tracks the status of different IDs over time. The table includes both historical data and current data, and I need to write a query that will return the correct ...
1
vote
1
answer
359
views
Salesforce API - Query of Change Data Capture's Selected Entities jsforce
Issue
I am trying to make a query to the Salesforce API to retrieve the "Selected Entities" that are listed under "Change Data Capture" using jsforce v1.11.1. The goal would be to ...
0
votes
1
answer
31
views
CDC in PieCloudDB
I’m looking to implement Change Data Capture(CDC) so I can synchronize other systems with the changes. I learned that PostgreSQL can use Debezium for CDC. Can PieCloudDB also support Debezium? Or does ...
0
votes
1
answer
895
views
DebeziumException "The connector previously stopped while taking a snapshot, but now the connector is configured to never allow snapshots"
I am using debezium in distributed mode to push the cdc events of mssql db to kafka topics.
Command: %KAFKA_HOME%\bin\windows\connect-distributed.bat
Debezium starts fine, I deployed the connector ...
0
votes
2
answers
745
views
Handling DELETE operations with ClickHouse Kafka Connect Sink
I'm working on a project related to building streaming change data capture using Kafka Connect. The source of changes is MySQL, then they are sent to the corresponding Kafka topic, and then applied to ...
3
votes
1
answer
261
views
Which column to use for ordering change data read from change tables in SQL Server?
Should I rely on seqval for ordering change data captured from change tables in SQL Server, documentation provides contradictory statements in two places. If not, is there any option?
If we check cdc....
0
votes
0
answers
241
views
Add column to _CT table
As a test, I enabled CDC on a DB and table for specific columns.
CREATE TABLE [dbo].[testCDC]
(
[ID] [bigint] IDENTITY(1,1) NOT NULL,
[Col1] [nchar](10) NULL,
[Col2] [nchar](10) NULL,
...
1
vote
0
answers
452
views
What is the function of `microBatchDF._jdf.sparkSession().sql` in this code?
What is the function of microBatchDF._jdf.sparkSession().sql in this code?
def upsert_to_delta(microBatchDF, batchId):
microBatchDF.createOrReplaceTempView("updates")
microBatchDF....
0
votes
1
answer
436
views
Is it possible to retrieve the CDC data that the cleanup job deleted?
We are using CDC of a SQL Server database for the audit trail of an application.
Data capture was enabled and works correctly, however the retention of the cleanup job was too short and deleted ...
1
vote
0
answers
190
views
wal streaming and logical replication slot: offset of stream
Good day!
I was using pgsync for change data capture (CDC) from postgres (wal=logical) to elasticsearch.
What I noticed while using pgsync is that it captures all data from postgres on initial run.
...
3
votes
0
answers
914
views
How to hash multiple columns of different types in Trino/Athena
I am trying to generate a hash based on multiple columns in a table. The columns could be of different types.
I currently have some complex code that one needs to remember to update anytime another ...
0
votes
1
answer
397
views
aws dms for postgres updates not showing before image
I am working on POC to migrate from Debezium to AWS DMS for PostgreSQL. PostgreSQL as Source and Kafka as Destination (AWS MSK). In Debezium i used to see before and after image for updates, but in ...
0
votes
1
answer
502
views
Is CDF feature possible using delta-spark on Cloudera distribution?
We have our application using the on-premise CDP (Cloudera) cluster for submitting pyspark jobs.
Version of spark is 2.x
We are now exploring the option to have CDC datasets processed and merge with ...
0
votes
1
answer
292
views
MongoDB connection error : Oracle goldengate mongodb change data capture setup
While connecting to mongodb from goldengate big data(19c classic) GGSCI interface. I'm facing below error
GGSCI (meteor-VirtualBox) 1> alter credentialstore add user mongodb://@meteor.rjzzi46....
0
votes
1
answer
420
views
Change Data Capture is not Deleting Rows Older than the Configured Retention Period
I enabled change data capture and set the retention to 2 days. I have CU update 31 and above installed. Old CDC records are not deleting.
The SQL Agent cleanup jobs are running successfully.
What ...
1
vote
0
answers
287
views
Kafka Connect's Debezium Connector for DB2 not capturing column with XML data type
I have a local instance of a kafka connect cluster. I am trying to replicate data from a DB2(v11.5.8.0) database. I am using debezium connector(v2.2.1) to capture a table in DB2 that contains a column ...
0
votes
0
answers
826
views
Change data capture in a read-only replica
I have a SQL Server instance, and a read-only replica of that instance that is used for ETL and analytics pipelines.
The source instance has change data capture (CDC) enabled.
What are best practices ...