TapData Cloud Release Notes
TapData Cloud offers you cloud services that are suitable for scenarios requiring rapid deployment and low initial investment, helping you focus more on business development rather than infrastructure management. Free trial with TapData Cloud.To enhance the user experience, TapData Cloud continuously enriches and optimizes product features and rectifies known defects by releasing new versions. This article provides an update log for TapData Cloud, helping you grasp the new feature specifications more effectively.
2024-11-29β
Enhancementsβ
- Enabled copy all selected table names during task configuration, improving operational efficiency.
- Expanded the range of built-in error codes for better issue identification and diagnosis.
- Enhanced milestone tracking and display logic during task execution.
- Improved log viewing experience for script processing nodes by supporting split log display.
Bug Fixesβ
- Fixed an issue where syncing PostgreSQL to SQL Server failed to sync newly added partitioned child tables if the parent tableβs partitions were not created before task execution.
- Resolved an issue where MongoDB indexes were not correctly loaded, causing schema loading failures.
- Fixed an issue where data extraction tasks could get stuck at the table structure replication stage.
2024-11-15β
New Featuresβ
- Added support for real-time synchronization of PostgreSQL partitioned tables to SQL Server.
Enhancementsβ
- Expanded the range of built-in error codes for faster issue identification and diagnosis.
Bug Fixesβ
- Fixed an issue where resetting tasks on the edit page failed, causing a βcurrent status not allowedβ error when saving the task.
- Resolved an issue where removing and re-adding a table being synchronized in a replication task failed to resume synchronization correctly.
2024-10-30β
New Featuresβ
- Added HTTPS connection support for Elasticsearch data sources, enhancing data transmission security to meet more stringent data security and compliance requirements.
- Enabled support for synchronizing tables without primary keys by adding a hash field (default name:
_no_pk_hash
), ensuring data consistency and stable synchronization in non-primary key scenarios.
Enhancementsβ
- Enhanced data filtering logic in Row Filter nodes, ensuring that target data is updated to maintain consistency when data status changes from meeting to not meeting filter conditions.
- Improved support for handling changes in composite primary keys in Oracle.
Bug Fixesβ
- Fixed an issue preventing the display of all tables (completed, in progress, and not started) in full sync details.
- Corrected inaccuracies in time and milestone statistics.
- Resolved an issue with MongoDB Atlas functionality when DNS resolution fails.
2024-10-17β
New Featuresβ
- Kafka-Enhanced and TiDB have passed the TapData certification testing process and have been upgraded to Certified Data Sources, providing more advanced features and enhanced production stability.
Enhancementsβ
- Added a Multi-threaded CT Table Polling option to improve incremental data collection performance for SQL Server environments with a large number of tables (over 500), significantly increasing synchronization efficiency.
- Optimized the cache management logic for processing nodes, enhancing resource usage efficiency and improving task execution speed.
- Introduced an automatic retry mechanism for Oracle LogMiner errors caused by exceeding PGA limits, improving fault tolerance.
Bug Fixesβ
- Fixed an issue where, after enabling the heartbeat table, tasks displayed no delay but data was not synchronized.
- Fixed an issue where not all tags could be viewed when setting tags.
- Fixed an issue where the task retry start time was incorrectly displayed as 1970.
- Fixed an issue where index creation failed when Elasticsearch was used as the target database.
2024-10-10β
New Featuresβ
- Doris, ClickHouse, KingBaseES-R6, PostgreSQL, SQL Server, and MongoDB have passed the TapData certification testing process and have been upgraded to Certified Data Sources, providing more advanced features and enhanced production stability.
- When using PostgreSQL as a source, it is now possible to specify the time point for incremental data in task settings.
Enhancementsβ
- When configuring an Elasticsearch data source, the task setup now allows you to select an update strategy for data writing.
- For data replication tasks, the source node's table selection defaults to primary key tables, with an added prompt message.
Bug Fixesβ
- Fixed an issue where tasks would encounter errors during the incremental phase after enabling the heartbeat table in new tasks.
- Fixed the issue where tasks got stuck in the full phase and could not move to the incremental phase after a reset.
2024-09-20β
New Featuresβ
- MySQL has passed the TapData certification testing process, upgrading it to a certified data source, providing more comprehensive features and enhanced production stability.
- Added a form-based mode for building replication tasks, simplifying the task creation process and improving operational convenience.
Enhancementsβ
- Added a new sorting feature for mining tasks based on today's mined volume, making task management and filtering more convenient.
Bug Fixesβ
- Fixed an issue where regular indexes were not properly synchronized after enabling the Sync Indexes on Table Creation option, ensuring data synchronization integrity.
2024-08-21β
New Featuresβ
- Oracle, Dameng, and Db2 have passed the TapData certification testing process and have been upgraded to Certified Data Sources, offering richer features and higher production stability.
- Added traffic billing feature for fully managed instances, supporting traffic bill viewing and payment, enabling users to easily monitor traffic usage and manage bills conveniently.
- For PostgreSQL data sources, incremental data synchronization is now supported using the walminer plugin, catering to more use cases.
- Data replication tasks now support reading from multiple tables simultaneously, improving parallel processing capabilities and task execution efficiency.
Enhancementsβ
- Significantly enhanced data synchronization performance.
- Improved error messages and high-risk operation warnings.
- For data sources that do not support hash validation, hash validation is now disabled by default.
- After full sync tasks are completed, restarting the task will trigger a full resynchronization to ensure data consistency.
- The Agent deployment page now includes a network whitelist configuration guide, making it easier to configure communication between the Agent and the management console.
Bug Fixesβ
- Fixed an issue where some task monitoring metrics were lost after task completion.
- Fixed a query efficiency issue caused by missing necessary indexes in the intermediate database, reducing data scan volume.
- Fixed an issue where selecting "Show only different fields" when downloading data validation discrepancies resulted in downloading all fields.
- Fixed an issue where the old engine name still appeared in task settings after changing the engine name in cluster management.
- Fixed a problem where task editing could get stuck during model generation, improving the task editing experience.
- Fixed an issue where large-scale Agents could not start due to insufficient memory on low-configuration servers.
- Fixed possible OOM error problems with Agents, enhancing memory management and stability.
- Fixed an issue where full sync tasks in the cloud version sometimes got stuck in a running state, improving task execution smoothness.
- Fixed an issue where editing an API showed a duplicate name warning.
- Fixed an issue where, after stopping a data replication task in the incremental phase and restarting it, the full completion time displayed incorrectly.
- Fixed an issue with TDengine where SQL statement length exceeded limits when writing to super tables with many fields.
- Fixed an error occurring in data transformation tasks using TDengine as a source when the table name contained Chinese characters.
- Fixed potential exceptions when running mining tasks on PostgreSQL data sources.
- Fixed an issue in Oracle to Doris shared mining tasks where source table DDL events could not be parsed.
- Fixed issues with inserting and deleting operations when syncing Oracle to PostgreSQL for tables without primary keys, enhancing synchronization reliability.
- Fixed specific exception issues during the incremental phase of MongoDB to Kafka data transformation tasks.
- Fixed an issue where an unexpected
_id
field appeared in the model when synchronizing MongoDB oplog to Kafka. - Fixed an issue where MongoDB oplog data replication tasks could not replicate properly during synchronization.
New Featuresβ
- Oracle, Kafka, and Db2 have completed the TapData certification testing process, upgraded to GA-level data sources, offering enhanced capabilities and production stability.
- Added traffic billing view and payment features in the cloud version.
Enhancementsβ
- Optimized the layout structure of the menu entries.
- Improved error messages and risk warnings for high-risk operations.
- Significantly improved data synchronization performance.
- Optimized memory allocation logic during Agent startup.
Bug Fixesβ
- Fixed the issue where some monitoring metrics were lost after task completion.
- Fixed potential runtime issues in PostgreSQL data source mining tasks.
- Fixed the issue where large-scale Agents on low-configuration servers might fail to start due to insufficient memory.
- Fixed the issue where full data synchronization tasks remained in a running state for an extended period.
2024-08-06β
New Featuresβ
- Enhanced Data Transformation Task Configuration to support reloading of single table models in the source node model preview area, improving loading efficiency.
- Introduced time detection functionality that automatically detects the time difference between the engine deployment server and the database server and displays it on the task monitoring page.
Optimizationsβ
- User-defined field business descriptions can now be directly displayed in the column name position of the table sample data.
Bug Fixesβ
- Fixed an issue where some table data counts in the real-time data platform were empty.
- Fixed an issue where the host was not displayed in the path when publishing APIs in the real-time data platform.
- Fixed an issue where MongoDB database cursor timeout prevented normal full synchronization.
- Fixed an issue where the custom SQL filter switch could not be turned on in the source node data filtering settings.
- Fixed an formatting error in email alerts for the fully managed Agent.
2024-07-20β
New Featuresβ
- Added a Union Node to data replication tasks, enabling the merging (UNION) of multiple tables within the same database. This is useful for data integration and analysis scenarios.
- Doris data source now supports certificate-free HTTPS connections.
- MySQL, Oracle, OpenGauss, SQL Server, and PostgreSQL data sources now support enabling the Hash Sharding feature in the advanced settings of nodes during task configuration, significantly improving the full data sync speed for large tables.
- Added support for VastBase data source, with a maturity level of Beta, further enriching the variety of data sources.
Enhancementsβ
- Improved synchronization logic for time zone fields.
- Optimized the display logic of continuous log mining settings, automatically hiding related buttons if the database version does not support this feature.
Bug Fixesβ
- Fixed an issue with incremental sync delay display in MongoDB sync tasks that used shared mining.
- Addressed the unclear error messages and lack of detailed information in the error codes when the source MySQL does not support incremental.
- Corrected the format of task warning alerts.
- Resolved an issue where imported tasks showed running records and the current running record status appeared as "deleting."
- Fixed an issue with incorrect display when renaming tables in FDM replication tasks.
- Addressed an issue where editing tasks incorrectly modified the association key when the target table association key was set.
- Fixed a potential failure when removing fields in Python nodes.
- Resolved an issue where deleting the primary node in master-slave merge operations caused configuration errors in the master-slave merge node, leading to task errors.
- Corrected an issue where creating an application in application management incorrectly prompted that the tag name already existed.
- Fixed garbled text issue with Chinese node names in tasks when a source-side DDL occurs and the engine server is not set to UTF character encoding.
2024-07-05β
Enhancementsβ
- Optimized features in the Real-Time Data Hub:
- The data processing layer now displays all models in the database.
- The platform cache layer and platform processing layer can be configured with different connections, which cannot be adjusted after setting.
- Added an API publishing entry.
- Improved the display of model details.
- Added field restriction configuration parameters for the ElasticSearch data source.
- Optimized exception handling logic when enabling the preimage capability for the MongoDB data source.
Bug Fixesβ
- Fixed an issue where some task event statistics might occasionally be missing when reported.
- Fixed an issue where shared cache tasks without shared mining might encounter errors due to exceeding the log time window if data does not change upon restarting or upgrading the engine.
- Fixed an issue where disabling the slave node under the MDM_model led to task startup failures.
- Fixed an issue where the lineage graph in the real-time data hub occasionally failed to display.
- Fixed an issue where the unset operation on the source table could cause task errors in scenarios where the write mode is updating sub-documents.
- Fixed an issue where joining collections with time types in MongoDB and MySQL caused errors.
- Fixed an issue where tasks created in the real-time data hub could not add master-slave merge nodes.
- Fixed an issue where incremental update events unexpectedly performed lookups in master-slave merge scenarios.
- Fixed conflict errors when modifying columns in master-slave merge nodes.
2024-06-21β
New Featuresβ
- Enhanced TiDB data source capabilities with support for real-time incremental synchronization.
Enhancementsβ
- Improved the display of primary keys and indexes in the task's table model.
- Enhanced the model deduction logic, supporting model deduction directly in the engine.
Bug Fixesβ
- Fixed an issue where some exceptions were ignored during data source error handling.
- Fixed an issue where aggregation tasks using time fields as join keys could not backtrack data.
- Fixed an issue with delayed times in mining tasks.
- Fixed an issue where MySQL as a source would consume a large amount of database memory during initial synchronization of large tables.
2024-06-07β
New Featuresβ
- Introduced Mock Source and Mock Target data sources for data migration testing scenarios.
Enhancementsβ
- Improved the interaction logic for skipping errors when starting tasks.
- Improved the loading speed of the connection list.
Bug Fixesβ
- Fixed inconsistencies between the task runtime model and configuration model.
- Fixed inaccurate task event statistics after filtering source data.
- Fixed timezone handling issues in Oracle and PostgreSQL synchronization scenarios.
- Fixed an issue where heartbeat task reset failures could prevent related tasks from starting.
2024-05-21β
New Featuresβ
- Added support for dynamically generating date suffixes for target table names when configuring data transformation tasks, suitable for daily batch processing scenarios.
- Added support for setting partitions when configuring Doris data sources.
- Added support for the Oracle mode of OceanBase data sources, with the data source name OceanBase(Oracle).
Enhancementsβ
- Optimized data handling logic when syncing MongoDB to relational databases (e.g., MySQL).
- Enhanced the Dummy data source to support quickly adding large fields for performance testing scenarios.
Bug Fixesβ
- Fixed an issue where MariaDB could not write data in the
0000-00-00 00:00:00
format to the target. - Fixed an issue where heartbeat tasks could not automatically recover after the heartbeat table was mistakenly deleted.
- Fixed an issue where shared extraction tasks could not be serialized after an error occurred.
2024-05-06β
New Featuresβ
- Support for bidirectional data synchronization between MySQL instances and between PostgreSQL instances, better meeting the needs of active-active and disaster recovery scenarios.
- Support for importing files from MongoDB Relmig version 1.3.0 and above, further enhancing ecosystem integration capabilities.
- Support for synchronizing MongoDB Oplog (operation log) data.
- Support for filtering the time field of tables in the source nodeβs Advanced Settings when configuring data transformation tasks (e.g., relative dates).
- Display milestone information for tasks on the Task List page, helping users quickly understand key progress statuses.
Enhancementsβ
- Improved Unwind Node functionality, allowing users to set expansion modes, such as Embedded Objects or Flatten Fields.
- Enhanced full synchronization detail page display, supporting quick table name filtering.
Bug Fixesβ
- Fixed an issue where adjusting alarm settings could affect normal task operations in certain scenarios.
- Fixed an issue where adding a new digging table caused the task to display digging task errors.
2024-04-26β
New Featuresβ
- Data replication tasks now support table-level checkpoint resumption, allowing tasks to continue syncing from the last incomplete table upon restart.
- Added the ability to quickly set task labels by dragging and dropping.
- Added support for MySQL replica architecture, ensuring tasks continue to sync data normally after a failover event.
Enhancementsβ
- The Windows version of the Cloud Agent now includes digital certificate signing to avoid installation delays caused by system security prompts.
- Improved the User Center page layout.
Bug Fixesβ
- Fixed an issue where tasks were failing with Aliyun PolarDB MySQL data sources due to unsupported event types.
- Corrected a statistical progress display error in the completion metrics of full data synchronization tasks.
2024-04-12β
New Featuresβ
- Added support for real-time log parsing of TiDB data sources, fulfilling incremental data synchronization needs.
- During the full sync phase from Oracle to MySQL, support has been added for syncing unique and normal indexes that do not utilize functions.
- Enhanced the task start process to include an option to skip errors encountered during the last run.
Enhancementsβ
- Improved DDL synchronization settings in data sync tasks by allowing users to configure DDL statements to ignore (based on regular expressions) when DDL errors occur.
- Enhanced data verification capabilities to support tasks that include processing nodes.
- Optimized the data verification results page to quickly filter between consistent and inconsistent tables.
Bug Fixesβ
- Fixed an issue where MongoDB used as external storage failed when storing values in a Map format with keys containing the
.
character. - Addressed a looping error that occurred during connection tests for Kafka data sources containing non-JSON topics.
- Resolved a bug where JS nodes reported errors during trial runs under specific conditions.
- Fixed an issue with incorrect data results when changing join keys in master-slave merge nodes.
- Fixed a problem where using RocksDB as cache storage could cause task errors.
2024-03-29β
Enhancementsβ
- To further enhance user experience, Beta and Alpha data sources now require an application for use, allowing TapData to provide better technical support based on your business scenarios.
Bug Fixesβ
- Resolved an issue where Agents crashed under specific circumstances.
- Fixed a bug related to importing RM files in MongoDB.
2024-03-08β
New Featuresβ
- Support for setting default alarm recipients, allowing customization of alarm receipt email addresses (supports multiple addresses).
- New options in DDL synchronization settings: Stop Task on DDL Error and Automatically Ignore All DDLs, catering to different business scenario needs.
- Added a time field injection node, allowing the addition of a custom timestamp field to data during synchronization. This provides a more flexible way to capture incremental changes from the source database.
Enhancementsβ
- Optimized task retry logic and interface prompt information.
- Enhanced the setting for incremental collection timing, supporting quick selection of the incremental time point from the last incremental run.
- Improved the interaction logic for using external storage with the master-slave merge node.
2024-01-26β
New Featuresβ
- Added support for Shared Mining, allowing multiple tasks to share incremental logs from the source database, thus avoiding redundant reads and significantly reducing the load on the source database during incremental synchronization.
- The Shared Mining feature now supports using RocksDB as local external storage to extend storage for incremental logs.
Enhancementsβ
- Improved the onboarding process for users from the Google Cloud Marketplace.
- Added a time filter option for the incremental phase in the Task Monitoring Page to quickly observe RPS (Records Per Second) during the incremental phase.
- Added warning messages for critical operations that might impact the database (e.g., filtering source table data).
- Refined the logic for unsubscribing after instance subscription expiration.
Bug Fixesβ
- Fixed an issue with the Primary-Secondary Merge Node where changes in the key conditions between the primary and secondary tables resulted in data not matching expectations.
2024-01-12β
New Featuresβ
- Added support for Capped Collections in data synchronization between MongoDB database.
- Data replication/transformation tasks now have import capabilities. Design your data flow process on MongoDB Relational Migrator, export it, and then directly import it into TapData data pipelines from the top right corner, enhancing the convenience of data pipeline design.
Enhancementsβ
- Enhanced the new user onboarding process, including the ability to collapse prompts and return to previous steps.
Bug Fixesβ
- Fixed an issue where JS node model declaration settings showed incorrect prompts on the task editing page.
- Fixed an issue where the DROP COLUMN operation in Oracle to MySQL synchronization was not syncing correctly.
- Addressed an issue causing DDL errors when syncing from MySQL to ClickHouse.
- Fixed instability in tasks due to frequent WebSocket reconnections.
- Corrected several UI interaction experience issues.
2023-12-26β
New Featuresβ
- Added support for Time Series collections in MongoDB 5.x and above versions.
- Added support for preImage in MongoDB 6.x and above versions.
Enhancementsβ
- Improved system prompts when enabling scheduled tasks while reaching the task limit.
Bug Fixesβ
- Fixed inaccuracies in checkpoints in multi-table data replication scenarios.
- Resolved issues with unsubscribed and deleted Agent instances continuing to report heartbeat information.
- Addressed known UI interaction experience issues.
2023-12-08β
New Featuresβ
- Added Azure Cosmos DB as a new data source, enabling full data synchronization to facilitate quick cloud data transfers.
Enhancementsβ
- Upgraded data source connections, with SQL Server now supporting SSL connections, enhancing data security.
- Optimized field type adjustments in data replication tasks, allowing for direct selection of common types from the target database.
- Improved task source node settings, enabling customization of the number of rows read per batch in the incremental phase, catering to performance needs of incremental synchronization.
Bug Fixesβ
- Addressed issues with enhanced JS nodes failing or causing exceptions under certain scenarios.
- Corrected several UI interaction experience issues for better usability.
2023-11-24β
New Featuresβ
- Support for loading table comments on Oracle data sources, which can be enabled in the Advanced Settings when configuring the data source. This makes it easier to quickly identify the business meaning of tables through their comments.
- In the task monitoring page, support viewing RPS (Records Per Second) information based on the size of events.
Enhancementsβ
- Enhanced the display effects of resource management and the subscription center pages.
- When performing data source connection tests, support for displaying connector download progress is now available, helping to quickly grasp connection progress and pinpoint timeout issues.
Bug Fixesβ
- Fixed an issue where incremental information was not successfully cleared after resetting and rerunning a task.
- Fixed an issue where some SaaS data sources displayed incremental timestamps during full data synchronization.
Enhancementsβ
Bug Fixesβ
2023-11-03β
Enhancementsβ
- Enhanced Data Source Connection methods, supporting SSL connections for data sources like MySQL, PostgreSQL, Kafka, TiDB, MariaDB, etc., to further enhance data security.
- Improved user interface interaction logic.
- To better manage data duplication for updates on non-primary keys, TapData Cloud now supports creating unique indexes.
Bug Fixesβ
- Fixed an issue where data synchronization could fail when table names contain
.
. - Fixed an issue where task exception messages did not include table names.
- Fixed an issue with incorrect judgment of task quotas and task count limits when specifying an Agent for a task.
2023-10-20β
New Featuresβ
- Added support for automatically creating sharded collections when MongoDB Cluster is set as the target.
- Add support for Unwind Processing Node, which can help you efficiently "unwind" each element in an array, converting each element into independent data rows.
- Added support for disabling node capabilities when configuring tasks. You can access this feature by hovering over a node, which can help reduce the cost of data flow during processing.
Enhancementsβ
- When configuring data replication tasks, you can now quickly filter tables with or without primary keys through the "Selectable table range" dropdown. Tables with primary keys include those without primary keys but with unique indexes.
- Added a Demo data source to the onboarding guide flow for new users, helping you quickly complete the tutorial and set up your first data flow task.
- Optimized the front-end display effects of operation buttons on the engine interface.
Bug Fixesβ
- Fixed an issue where an error occurred in MongoDB as a target during an INSERT operation when there was no shard key.
- Fixed an issue where MongoDB did not support REPLACE properly, and the fields deleted by REPLACE could not be properly removed.
2023-10-08β
New Featuresβ
- Introduced the Create Materialized View feature for swift construction of real-time data models.
- Added capability to fetch read-only access information of subscribed MongoDB Atlas.
- Kafka data source now supports settings for replication factor and partition count.
- For synchronization between MongoDB instances, added support for
$unset
operations.
Enhancementsβ
- During the task guidance process, when creating a connection for a fully managed Agent, instructions about the public IP address of the fully managed Agent have been added.
- Enabled rapid target node location through node search at the top of the data replication/data transformation configuration page.
Bug Fixesβ
- Fixed an issue where the wrong category of operation logs was recorded when restarting the Agent via the webpage.
2023-09-20β
New Featuresβ
- Added Python processing node, which supports customizing data processing logic through Python scripts. This offers improved performance compared to the JS processing node.
- Added a "Contact Us" entry point, making it easier for users to quickly reach out to technical support when faced with issues.
Feature Improvementsβ
- Enhanced error codes for data sources, covering more scenarios and providing solutions.
- While setting up email alert notifications, added guidance for binding email addresses.
- Improved reminders and easy upgrade guide for when the task count reaches its limit.
2023-08-28β
New Featuresβ
- Introduced the Primary-Secondary Merge Node, enabling quick construction and real-time updates of wide tables, assisting you in achieving better data analysis.
- Real-Time Data Hub now offer a storage instances for free trial, with more new specifications available, including M10, M20, and M30.
- Added support for connecting existing MongoDB Atlas instances as data storage for the Real-Time Data Hub.
Feature Improvementsβ
- Changed the display of help documentation on the right side during data source connection to embedded online documentation, assisting users in accessing the most recent help information.
- For core data sources (such as Oracle, PostgreSQL, etc.), improved the page parameter descriptions and guidance when creating connections.
Bug Fixesβ
- Fixed the issue where users couldn't view the monitoring page for previously run tasks.