TapData Enterprise Release Notes
TapData Enterprise can be deployed in your local data center, making it suitable for scenarios with strict requirements on data sensitivity or network isolation. It can serve to build real-time data warehouses, enable real-time data exchange, data migration, and more.3.18.0β
Enhancementsβ
- Added a built-in help documentation link at the top of the page, allowing quick access to usage guides in network-isolated environments.
- Improved data validation logic to support comparisons between Boolean values and numeric 0/1, ensuring accuracy in heterogeneous data source synchronization scenarios.
- Enabled copy all selected table names during task configuration, improving operational efficiency.
- Expanded the range of built-in error codes for better issue identification and diagnosis.
- Enhanced milestone tracking and display logic during task execution.
- Improved log viewing experience for script processing nodes by supporting split log display.
Bug Fixesβ
- Fixed an issue where syncing PostgreSQL to SQL Server failed to sync newly added partitioned child tables if the parent tableβs partitions were not created before task execution.
- Resolved an issue where MongoDB indexes were not correctly loaded, causing schema loading failures.
- Fixed an issue where data extraction tasks could get stuck at the table structure replication stage.
3.17.0β
New Featuresβ
- Added support for real-time synchronization of PostgreSQL partitioned tables to SQL Server.
Enhancementsβ
- Expanded the range of built-in error codes for faster issue identification and diagnosis.
Bug Fixesβ
- Fixed an issue where resetting tasks on the edit page failed, causing a βcurrent status not allowedβ error when saving the task.
- Resolved an issue where removing and re-adding a table being synchronized in a replication task failed to resume synchronization correctly.
3.16.0β
New Featuresβ
- Added HTTPS connection support for Elasticsearch data sources, enhancing data transmission security to meet more stringent data security and compliance requirements.
- Enabled support for synchronizing tables without primary keys by adding a hash field (default name:
_no_pk_hash
), ensuring data consistency and stable synchronization in non-primary key scenarios.
Enhancementsβ
- Enhanced data filtering logic in Row Filter nodes, ensuring that target data is updated to maintain consistency when data status changes from meeting to not meeting filter conditions.
- Improved support for handling changes in composite primary keys in Oracle.
Bug Fixesβ
- Fixed an issue preventing the display of all tables (completed, in progress, and not started) in full sync details.
- Corrected inaccuracies in time and milestone statistics.
- Resolved an issue with MongoDB Atlas functionality when DNS resolution fails.
3.15.0β
New Featuresβ
- Kafka-Enhanced and TiDB have passed the TapData certification testing process and have been upgraded to Certified Data Sources, providing more advanced features and enhanced production stability.
Enhancementsβ
- Added a Multi-threaded CT Table Polling option to improve incremental data collection performance for SQL Server environments with a large number of tables (over 500), significantly increasing synchronization efficiency.
- Optimized the cache management logic for processing nodes, enhancing resource usage efficiency and improving task execution speed.
- Introduced an automatic retry mechanism for Oracle LogMiner errors caused by exceeding PGA limits, improving fault tolerance.
Bug Fixesβ
- Fixed an issue where, after enabling the heartbeat table, tasks displayed no delay but data was not synchronized.
- Fixed an issue where not all tags could be viewed when setting tags.
- Fixed an issue where the task retry start time was incorrectly displayed as 1970.
- Fixed an issue where index creation failed when Elasticsearch was used as the target database.
- Fixed an issue in verification tasks where a full re-verification was triggered when there was a discrepancy in the comparison results.
- Fixed an issue where the detailed download was empty when verification tasks showed inconsistent results in the associated key verification.
3.14.0β
New Featuresβ
- Doris, ClickHouse, KingBaseES-R6, PostgreSQL, SQL Server, and MongoDB have passed the TapData certification testing process and have been upgraded to Certified Data Sources, providing more advanced features and enhanced production stability.
- Support for user login authentication via LDAP integration with Active Directory (AD), enabling unified user identity management.
- When using PostgreSQL as a source, it is now possible to specify the time point for incremental data in task settings.
Enhancementsβ
- When configuring an Elasticsearch data source, the task setup now allows you to select an update strategy for data writing.
- For data replication tasks, the source node's table selection defaults to primary key tables, with an added prompt message.
Bug Fixesβ
- Fixed an issue where tasks would encounter errors during the incremental phase after enabling the heartbeat table in new tasks.
- Fixed the issue where only one page of data validation tasks was displayed in the task list.
- Fixed the issue where tasks got stuck in the full phase and could not move to the incremental phase after a reset.
- Fixed the issue where the shared mining source node would automatically refresh and deselect tables after selection.
3.13.0β
New Featuresβ
- MySQL has passed the TapData certification testing process, upgrading it to a certified data source, providing more comprehensive features and enhanced production stability.
Enhancementsβ
- Optimized the performance of the API monitoring page, significantly increasing access speed.
- Added a new sorting feature for mining tasks based on today's mined volume, making task management and filtering more convenient.
Bug Fixesβ
- Fixed an issue where regular indexes were not properly synchronized after enabling the Sync Indexes on Table Creation option, ensuring data synchronization integrity.
- Fixed an issue where the
admin
user lost permissions after changing the username, ensuring proper permission management. - Fixed an issue where a success message was returned despite the failure of sending test emails, improving operational feedback accuracy.
3.12.0β
New Featuresβ
- Oracle, Dameng, and Db2 have passed the TapData certification testing process and have been upgraded to Certified Data Sources, offering richer features and higher production stability.
- When configuring alert recipient email, support for using proxy services has been added, allowing for timely alert notifications even in restricted network environments.
- For PostgreSQL data sources, incremental data synchronization is now supported using the walminer plugin, catering to more use cases.
- Data replication tasks now support reading from multiple tables simultaneously, improving parallel processing capabilities and task execution efficiency.
- Added support for batch API publishing, simplifying multi-interface management and enhancing publishing efficiency.
Feature Enhancementsβ
- Significantly enhanced data synchronization performance.
- Optimized the layout and structure of menu entries.
- Improved error messages and high-risk operation warnings.
- For data sources that do not support hash validation, hash validation is now disabled by default.
- After full sync tasks are completed, restarting the task will trigger a full resynchronization to ensure data consistency.
Bug Fixesβ
- Fixed an issue where some task monitoring metrics were lost after task completion.
- Fixed a query efficiency issue caused by missing necessary indexes in the intermediate database, reducing data scan volume.
- Fixed an issue where selecting "Show only different fields" when downloading data validation discrepancies resulted in downloading all fields.
- Fixed an issue where the old engine name still appeared in task settings after changing the engine name in cluster management.
- Fixed a problem where task editing could get stuck during model generation, improving the task editing experience.
- Fixed possible OOM error problems with Agents, enhancing memory management and stability.
- Fixed an issue where full sync tasks in the cloud version sometimes got stuck in a running state, improving task execution smoothness.
- Fixed an issue where editing an API showed a duplicate name warning.
- Fixed an issue where, after stopping a data replication task in the incremental phase and restarting it, the full completion time displayed incorrectly.
- Fixed an issue with TDengine where SQL statement length exceeded limits when writing to super tables with many fields.
- Fixed an error occurring in data transformation tasks using TDengine as a source when the table name contained Chinese characters.
- Fixed potential exceptions when running mining tasks on PostgreSQL data sources.
- Fixed an issue in Oracle to Doris shared mining tasks where source table DDL events could not be parsed.
- Fixed issues with inserting and deleting operations when syncing Oracle to PostgreSQL for tables without primary keys, enhancing synchronization reliability.
- Fixed specific exception issues during the incremental phase of MongoDB to Kafka data transformation tasks.
- Fixed an issue where an unexpected
_id
field appeared in the model when synchronizing MongoDB oplog to Kafka. - Fixed an issue where MongoDB oplog data replication tasks could not replicate properly during synchronization.
3.11.0β
New Featuresβ
- Added table name and API address display functionality in the Service Management List Page, supporting quick search and filtering by keywords.
- Enhanced Data Transformation Task Configuration to support reloading of single table models in the source node model preview area, improving loading efficiency.
- Introduced time detection functionality that automatically detects the time difference between the engine deployment server and the database server and displays it on the task monitoring page.
Enhancementsβ
- User-defined field business descriptions can now be directly displayed in the column name position of the table sample data.
Bug Fixesβ
- Fixed an "System error: null" issue that occurred after importing APIs.
- Fixed an issue where deleted APIs were still shown in the lineage graph.
- Fixed an issue where some table data counts in the real-time data platform were empty.
- Fixed an issue where the host was not displayed in the path when publishing APIs in the real-time data platform.
- Fixed an issue where MongoDB database cursor timeout prevented normal full synchronization.
- Fixed an issue where the custom SQL filter switch could not be turned on in the source node data filtering settings.
3.10.0β
New Featuresβ
- Data Verification feature now allows downloading detailed discrepancy data from the verification task details page for in-depth analysis.
- Added a Union Node to data replication tasks, enabling the merging (UNION) of multiple tables within the same database. This is useful for data integration and analysis scenarios.
- Doris data source now supports certificate-free HTTPS connections.
- MySQL, Oracle, OpenGauss, SQL Server, and PostgreSQL data sources now support enabling the Hash Sharding feature in the advanced settings of nodes during task configuration, significantly improving the full data sync speed for large tables.
- Added support for VastBase data source, with a maturity level of Beta, further enriching the variety of data sources.
Enhancementsβ
- Improved synchronization logic for time zone fields.
- In data sync/development task settings, users can now choose the scheduling method as random scheduling or priority scheduling of Agents, in addition to tag-based Agent scheduling.
- Optimized the display logic of continuous log mining settings, automatically hiding related buttons if the database version does not support this feature.
Bug Fixesβ
- Fixed an issue with incremental sync delay display in MongoDB sync tasks that used shared mining.
- Addressed the unclear error messages and lack of detailed information in the error codes when the source MySQL does not support incremental.
- Corrected the format of task warning alerts.
- Resolved an issue where imported tasks showed running records and the current running record status appeared as "deleting."
- Fixed an issue with incorrect display when renaming tables in FDM replication tasks.
- Addressed an issue where editing tasks incorrectly modified the association key when the target table association key was set.
- Fixed a potential failure when removing fields in Python nodes.
- Resolved an issue where deleting the primary node in master-slave merge operations caused configuration errors in the master-slave merge node, leading to task errors.
- Corrected an issue where creating an application in application management incorrectly prompted that the tag name already existed.
- Fixed garbled text issue with Chinese node names in tasks when a source-side DDL occurs and the engine server is not set to UTF character encoding.
3.9.0β
New Featuresβ
- Data Verification now includes differential data repair capabilities, enhancing data consistency and accuracy.
- Added a new button for using CDC log Caching when creating Live Cache, simplifying cache task configuration and improving the efficiency and flexibility of cache sharing.
Enhancementsβ
- Optimized features in the Real-Time Data Hub:
- The data processing layer now displays all models in the database.
- The platform cache layer and platform processing layer can be configured with different connections, which cannot be adjusted after setting.
- Added an API publishing entry.
- Improved the display of model details.
- Added field restriction configuration parameters for the ElasticSearch data source.
- Optimized exception handling logic when enabling the preimage capability for the MongoDB data source.
Bug Fixesβ
- Fixed an issue where some task event statistics might occasionally be missing when reported.
- Fixed an issue where shared cache tasks without shared mining might encounter errors due to exceeding the log time window if data does not change upon restarting or upgrading the engine.
- Fixed an issue where disabling the slave node under the MDM_model led to task startup failures.
- Fixed an issue where the lineage graph in the real-time data hub occasionally failed to display.
- Fixed an issue where the unset operation on the source table could cause task errors in scenarios where the write mode is updating sub-documents.
- Fixed an issue where the verification task search function was unavailable.
- Fixed an issue where joining collections with time types in MongoDB and MySQL caused errors.
- Fixed an issue where tasks created in the real-time data hub could not add master-slave merge nodes.
- Fixed an issue where incremental update events unexpectedly performed lookups in master-slave merge scenarios.
- Fixed conflict errors when modifying columns in master-slave merge nodes.
3.8.0β
New Featuresβ
- Enhanced TiDB data source capabilities with support for real-time incremental synchronization.
- Data Validation now supports automatic difference checking, allowing real-time tasks to automatically perform difference checks based on incremental delay.
Enhancementsβ
- Improved the display of primary keys and indexes in the task's table model.
- Enhanced the model deduction logic, supporting model deduction directly in the engine.
Bug Fixesβ
- Fixed an issue where some exceptions were ignored during data source error handling.
- Fixed an issue where aggregation tasks using time fields as join keys could not backtrack data.
- Fixed an issue with delayed times in mining tasks.
- Fixed an issue where MySQL as a source would consume a large amount of database memory during initial synchronization of large tables.
3.7.0β
New Featuresβ
- Added support for granting data verification permissions to users, enhancing permission management granularity.
- Introduced Mock Source and Mock Target data sources for data migration testing scenarios.
Enhancementsβ
- Improved the interaction logic for skipping errors when starting tasks.
- Improved the loading speed of the connection list.
Bug Fixesβ
- Fixed inconsistencies between the task runtime model and configuration model.
- Fixed inaccurate task event statistics after filtering source data.
- Fixed timezone handling issues in Oracle and PostgreSQL synchronization scenarios.
- Fixed an issue where heartbeat task reset failures could prevent related tasks from starting.
3.6.0β
New Featuresβ
- Added support for dynamically generating date suffixes for target table names when configuring data transformation tasks, suitable for daily batch processing scenarios.
- Added support for integrating with third-party platforms via Webhook to enable more alert notification channels.
- Added support for performing Hash validation between MySQL, Oracle, SQL Server, PostgreSQL, and GaussDB data sources when configuring data validation tasks, improving validation efficiency.
- Added support for setting partitions when configuring Doris data sources.
- Added support for the Oracle mode of OceanBase data sources, with the data source name OceanBase(Oracle).
Enhancementsβ
- Optimized data handling logic when syncing MongoDB to relational databases (e.g., MySQL).
- Enhanced the Dummy data source to support quickly adding large fields for performance testing scenarios.
Bug Fixesβ
- Fixed an issue where MariaDB could not write data in the
0000-00-00 00:00:00
format to the target. - Fixed an issue where heartbeat tasks could not automatically recover after the heartbeat table was mistakenly deleted.
- Fixed an issue where shared extraction tasks could not be serialized after an error occurred.
3.5.16β
New Featuresβ
- Support for bidirectional data synchronization between MySQL instances and between PostgreSQL instances, better meeting the needs of active-active and disaster recovery scenarios.
- Support for importing files from MongoDB Relmig version 1.3.0 and above, further enhancing ecosystem integration capabilities.
- Support for synchronizing MongoDB Oplog (operation log) data.
- Support for filtering the time field of tables in the source nodeβs Advanced Settings when configuring data transformation tasks (e.g., relative dates).
- Display milestone information for tasks on the Task List page, helping users quickly understand key progress statuses.
Enhancementsβ
- Improved Unwind Node functionality, allowing users to set expansion modes, such as Embedded Objects or Flatten Fields.
- Enhanced full synchronization detail page display, supporting quick table name filtering.
Bug Fixesβ
Fixed an issue where adjusting alarm settings could affect normal task operations in certain scenarios.
Fixed an issue where adding a new digging table caused the task to display digging task errors.
Fixed an issue where tasks would still retrieve data from deleted shared cache tasks after being restarted.
3.5.15β
New Featuresβ
- Data replication tasks now support table-level checkpoint resumption, allowing tasks to continue syncing from the last incomplete table upon restart.
- Added the ability to quickly set task labels by dragging and dropping.
- Added support for MySQL replica architecture, ensuring tasks continue to sync data normally after a failover event.
Bug Fixesβ
- Fixed an issue with Alibaba Cloud PolarDB MySQL data source, where tasks failed due to unsupported events.
- Fixed incorrect progress indicators in statistics for completed full replication tasks.
3.5.14β
New Featuresβ
- Support for assigning labels to sync governance services (Agents), allowing for subsequent assignment of tasks to agents with specific labels.
- Supported real-time log parsing for TiDB data sources, meeting the needs for incremental data synchronization.
- During the full synchronization phase from Oracle to MySQL, support for syncing unique indexes and regular indexes that do not utilize functions.
- Added the ability to skip errors encountered during the last run when starting tasks.
Enhancementsβ
- Optimized the data synchronization task scenario, allowing source nodes to configure DDL synchronization settings and specify DDL statements to ignore (based on regular expressions) in case of DDL errors.
- Enhanced data verification capabilities to support tasks that include processing nodes.
- Improved the data verification results page display, enabling quick filtering of consistent and inconsistent tables.
Bug Fixesβ
- Fixed an issue where MongoDB used as external storage and storing keys with dots (.) in strings, and values as maps, caused exceptions.
- Fixed a loop error when performing connection tests on Kafka data sources containing non-JSON topics.
- Fixed an error that occurred during trial runs of JS nodes in specific scenarios.
- Fixed incorrect data issues caused by changing key values in master-slave merge nodes.
- Fixed an issue where using RocksDB as cache storage could cause task errors.
3.5.13β
New Featuresβ
- When configuring data verification tasks, custom filtering based on time fields is now available for MongoDB aggregation queries.
- Supported hash verification for MySQL/Oracle homogeneous data source synchronization.
Bug Fixesβ
- Fixed an issue where DDL operations could not be synchronized normally in multi-threaded synchronization scenarios.
- Fixed an occasional issue where file handles were not released in time when JS nodes printed logs.
- Fixed an issue with the import of MongoDB RM files.
3.5.12β
New Featuresβ
- Support for sending email reminders one week before the license expires (once a day), which can be combined with configuring SMTP email services to enhance operational convenience.
- New options in DDL synchronization settings: Stop Task on DDL Error and Automatically Ignore All DDLs, catering to different business scenario needs.
- Added a time field injection node, allowing the addition of a custom timestamp field to data during synchronization. This provides a more flexible way to capture incremental changes from the source database.
- Support for setting the expiration time and size of engine logs, enabling automatic log cleanup.
Enhancementsβ
- Optimized task retry logic and interface prompt information.
- Enhanced the setting for incremental collection timing, supporting quick selection of the incremental time point from the last incremental run.
- Improved the interaction logic for using external storage with the master-slave merge node.
V3.5.11β
New Featuresβ
- Shared Mining functionality supports using RocksDB as local external storage for incremental log storage expansion.
- TDengine Connector supports using multiple databases as incremental sources.
Enhancementsβ
- Task Monitoring Page adds a time filter option for the incremental phase to quickly observe the RPS (Records Per Second) of the incremental phase.
- Added related prompt information for key operations that may affect the database (such as filtering source table data).
Bug Fixesβ
- Fixed the issue where the final data does not match the expectation when the primary and secondary table key conditions change in Primary-Secondary Merge Node.
V3.5.10β
New Featuresβ
- For data synchronization scenarios between MongoDBs, newly supports Capped Collections.
- Data replication/transformation tasks support import capability. You can design the data flow process on MongoDB Relational Migrator and export it, then directly import it in the TapData data pipeline, enhancing the convenience of data pipeline design.
Bug Fixesβ
- Fixed the issue where JS node model declaration settings showed an error in the task editing page.
- Fixed the issue where the DROP COLUMN operation was not properly synchronized when synchronizing from Oracle to MySQL.
- Fixed the problem of DDL errors when synchronizing from MySQL to ClickHouse.
- Fixed the issue of task instability caused by frequent reconnections of WebSocket.
- Fixed several UI interaction experience issues.
V3.5.9β
New Featuresβ
- For MongoDB data sources version 5.x and above, newly supports Time Series Collections.
- For MongoDB data sources version 6.x and above, newly supports preImage.
Bug Fixesβ
- Fixed the issue of inaccurate breakpoints in multiple table data replication scenarios.
- Fixed known UI interaction experience issues.
V3.5.8β
New Featuresβ
- Newly supports Azure Cosmos DB as a data source, capable of synchronizing full data from the source to help facilitate rapid data flow in the cloud.
Enhancementsβ
- Enhanced data source connection methods, SQL Server supports SSL connections to further enhance data security.
- Optimized the method of adjusting field types in data replication tasks; in addition to manual input, it now supports direct selection of common types from the target database.
- For the source node settings of the task, supports setting the number of records read per batch during the incremental phase to better meet the performance requirements of incremental synchronization.
Bug Fixesβ
- Fixed the issue where the enhanced JS node model declaration did not take effect or showed an exception in specific scenarios.
- Fixed several UI interaction experience issues.
V3.5.7β
New Featuresβ
- Supports loading table comments for Oracle data sources, which can be enabled in the advanced options during data source configuration, allowing quick identification of tables' business meanings through comments.
- Supports deployment of TapData on Windows platform, further expanding the range of supported deployment platforms.
- In the task operation monitoring page, supports viewing RPS (Records Per Second) information based on the dimension of event size.
Bug Fixesβ
- Fixed the issue where incremental information was not successfully cleared after resetting the task and rerunning it.
- Fixed the issue where the incremental time point was displayed in scenarios of full data synchronization for some SaaS data sources.
V3.5.6β
Enhancementsβ
- Optimized data source connections, with MySQL, PostgreSQL, Kafka, TiDB, MariaDB, etc., supporting SSL connections to further enhance data security.
- Enhanced the filtering function of data verification, supporting custom query and aggregation query filtering through SQL.
- Optimized interface interaction logic.
- For non-primary key update conditions, created a unique index to solve the problem of data duplication.
Bug Fixesβ
- Fixed the issue where the table name contained
.
could lead to data synchronization failure. - Fixed the problem where the task exception information did not include the table name.
V3.5.5β
New Featuresβ
- Newly supports Hive3 as a target.
- When MongoDB is the target, newly supports automatic creation of sharded collections.
- Newly added Unwind Processing Node, helping you efficiently "unwind" elements in an array, converting each element into a separate data row.
- When configuring tasks, newly supports the ability to disable nodes. Hovering over a node now offers this functionality, helping to reduce the cost of data flow in the process.
Enhancementsβ
- Optimized the setting of published API scope, allowing adjustments without needing to publish.
- When configuring data replication tasks, the selectable table range dropdown box allows quick filtering of tables with or without primary keys, where tables with primary keys include those without primary keys but with unique indexes.
Bug Fixesβ
- Fixed the issue where the MongoDB target encountered an error in INSERT operations when no sharding key was found.
- Fixed the problem where MongoDB does not support REPLACE, meaning that fields deleted by REPLACE could not be properly removed.
V3.5.4β
New Featuresβ
- Added building materialized views feature, enabling quick construction of real-time data models.
- Added support for configuring source nodes of shared mining tasks, including settings for enabling incremental multi-threaded writing and supplementing updated data with complete fields.
- Kafka data source added support for setting the number of replicas and partitions.
- Added support for the
$unset
operation during synchronization between MongoDB instances.
Enhancementsβ
- Data verification feature field filtering experience optimization.
- Supported quick node targeting at the top of the data replication/data transformation configuration page through node search.
V3.5.2β
New Featuresβ
- Added Python Processing Node, supporting custom data processing logic through Python scripts, offering performance improvements compared to JS processing nodes.
- Added support for data synchronization between Redis instances.
Enhancementsβ
- Enhanced data source error codes, covering more scenarios and providing solutions.
V3.5.1β
New Featuresβ
- Now when creating a role, it supports the granular granting of functional and data rights.
Enhancementsβ
- Enhanced the UI prompts and guidance when setting up core data sources like PostgreSQL, Redis, etc.
- Improved test scenarios when using MongoDB as external storage.
Bug Fixesβ
- Resolved an issue where users were unable to access the run-monitoring page for executed tasks.
V3.4β
New Featuresβ
- When task configurations are set for full + incremental sync, there's now support to turn on the scheduled periodic task feature. The task will automatically stop, reset, and run again at the set time.
- For the add/remove field node, field order adjustment is now supported.
- A new feature to dynamically adjust memory has been introduced (enabled by default). During the full synchronization phase, it identifies memory usage and auto-adjusts the memory queue, effectively preventing memory overflow scenarios.
- The data panel has been renamed to the Real-time Data Center, with added guidance on usage and task creation.
- Introduced a target write strategy, where if an update event does not exist, it can be written to a local log.
Enhancementsβ
- Data validation usability and UI interactions have been enhanced.
- Error code implementation for the MongoDB data source has been added.
- Optimized the incremental delay metric on the run-monitoring page, using the task's incremental delay warning threshold as the Y-axis data source.
- Enhanced the display of sample data.
Bug Fixesβ
- Fixed task count limits in the engine specs.
- Resolved an issue where MongoDB showed unsupported error prompts when used with custom SQL as the target.
- Fixed an issue where, after turning on automatic periodic scheduling for a task, if auto-reset fails, the task won't attempt a reset in the next cycle and won't be scheduled to run.
V3.3β
New Featuresβ
- Kafka data source now supports custom message body formats.
- Added the API interface documentation export feature to help teams quickly establish and enhance API usage documents.
- Shared mining functionality supports configuring task alerts, allowing alerts via system notifications or emails for better task monitoring.
- The data validation function allows setting data filters, enabling validation of specific conditional data only, reducing validation scope and increasing efficiency.
- In data service platform mode, when dragging a data table to the platform cache layer to generate a task, it supports setting the synchronization type of the task to be full or incremental.
Enhancementsβ
- Introduced rolling upgrades, which, compared to the downtime upgrade method, further reduces business impacts.
- Post-error in shared mining tasks, associated tasks now include alert prompts.
- In the row filter processing node, added usage examples when filtering with the DATE type.
- Time operation node now displays adjusted fields.
- Optimized algorithm for estimating remaining time for full synchronization.
- Field processing nodes now support one-click copy and paste for configurations.
Bug Fixesβ
- Fixed an issue where launching TM without setting the java environment variable would prevent it from starting, adding log output for this issue.
- Addressed a problem where the admin user, after changing the username in the personal center, could not view any menus.
- Fixed an issue in data replication where, during task creation, all data sources did not support DDL.
- Resolved a problem in data replication tasks, where during node configuration, adding a prefix or suffix would reload after each character input.
V3.2β
New Featuresβ
- In the data platform mode, it can directly display the relationship of table-level traceability, helping you to visually show the link relationship of data tables.
- In the data platform mode, it supports deleting tables from the platform processing layer.
- When configuring the target node of a task, it supports adjusting field length by a coefficient to avoid data write failures due to different character encodings.
- Data verification feature supports SelectDB data source.
- In scenarios where Redis is the target node, and data is stored in List or Hash format with a single key, it supports writing the source table schema into a Hash key (default name is
-schema-key-
). The value is used to store the source table's table name and column name information. - Added type filter processing node, which can quickly filter columns of the same type. Filtered fields will not be passed to the next node.
- Field editing processing node supports conversion between snake_case and camelCase naming.
- Data copy tasks, data conversion tasks, data panels, and caching creation support displaying table description information, defaulting to table comment information.
Enhancementsβ
- Product menu adjustments: data development is renamed to data conversion. Some functions have been moved to advanced settings (e.g., shared cache).
- Improved interaction for tables without primary keys, e.g., support for filtering non-primary key tables and adding primary key table identification when configuring data copy tasks.
- For external storage configurations of MongoDB data sources, connection testing capability has been added.
- When creating a new external storage and choosing MongoDB, it supports using SSL connections.
- Creating an HttpReceiver data source now supports script trial runs and access authentication functionality.
- Standard JS node capabilities adjusted, adding Linked HashMap data structure and context.global object.
- Field editing processing node's UI interaction has been improved.
- Redundant prompts for task startup and schema reload have been optimized.
- Data copy tasks support manually adding new tables. New tables can achieve full + incremental data synchronization.
- Improved user experience and interaction for data verification.
- Optimized task node configuration processing logic.
- In the data panel's platform cache layer and data processing layer, you can display connection and table information generated by data copy/transfer tasks.
- In the data directory mode of the data panel, you can add description information for tables and fields.
- Optimized deployment flow and prompts for TapData.
- TapData launcher optimization: restarting the service does not require re-registering the data source.
- When starting and stopping the Agent node, PDK registration is automatically stopped.
- Overall optimization of interaction for data copy and data conversion task configuration.
Bug Fixesβ
- Fixed 2 issues where specifying different external storage for Oracle data sources resulted in external storage not being user-specified after merging.
- Fixed an issue where turning on shared mining for import task data sources made external storage configuration display as id and unmodifiable.
- Fixed a task merging issue from the data source to the platform cache layer.
V3.1β
New Featuresβ
- Data panel functionality now supports table-level traceability capabilities. You can view data lineage relationships through table details.
- When configuring data copy tasks, you can view the table model in the processing node.
- Supports publishing API data services based on Doris data source Release API Data Services.
- Cluster management page allows downloading thread resource monitoring and data source usage data.
Enhancementsβ
- Shared mining task management improved, supporting starting/stopping mining tasks for individual tables.
- Shared cache, functions, API data services support import/export functions.
- Data verification supports configuring alert rules and notification methods.
- Auto-fill table logic for data verification has been optimized.
- Frontend added explanations for the distinction between standard JS and enhanced JS.
- JS processor standardization, JS usage, and trial run have been restructured.
- In all processing nodes supporting JS scripting, typing
record.
automatically prompts for the current model's field names. - Resolving timeout issues caused by clearing external storage data during a reset has been optimized.
- Support for modifying primary keys.
- Supports setting the default interval for incremental synchronization tasks via scripting.
- License optimization: Binding to IP addresses prevents hardware changes from invalidating the License.
- Enhanced usage prompts for Excel data sources.
- Performance enhancements:
- JS Node processing performance improved.
- Field processing node processing performance improved.
- Master-slave merge node performance improved.
- Field editing node frontend display optimization in multi-field scenarios.
- Optimized data type boundary prompts and handling logic.
- Connection management filter bar database type dropdown supports search and clear selection.
- Error code pop-ups now offer a one-click copy feature for the error stack.
Bug Fixesβ
- Fixed issues with incremental metrics lacking incremental time points for polling sources.
- Fixed an issue where model changes forcibly delete updated fields.
- Fixed node configuration issues for type modification, field addition/deletion, and field renaming; configurations are reset when loading the model.
- Fixed errors occurring when turning on the full custom switch and the target is MongoDB.
V3.0β
New Featuresβ
- Integrated GraphQL capability, enriching API query methods.
- Added application categorization capability for APIs, facilitating categorization based on business.
- Introduced time calculation processing node for flexible handling of discrepancies in source and destination database time zones.
- Introduced full-scale partitioning capability, currently only supported for MongoDB.
Enhancementsβ
- Shared cache function improved, offering an observable page to monitor mining progress and troubleshoot failures.
- Full custom query function relaxed the restriction of only using JS nodes, now allowing the addition of other processing nodes with the node model directly utilizing the source table's model.
- The field processing node supporting operations like adding/deleting fields, type modifications, and renaming fields now includes a field search function.
- Adjusted wording for Schema loading frequency configuration in connection settings.
- Optimization of table name modification logic in the Table Editing Node; removed the apply button for direct configuration effectiveness.
- During the startup of the management process (frontend), it now includes heapDump and stackTrace parameters, similar to the synchronization governance process.
- Introduced task editing versioning to prevent overwriting of higher-version configurations by lower versions when multiple users edit the same task.
- Documentation on the right side of the data source configuration now supports image enlargement.
- Implementation of error codes for Oracle data sources.