Publish-subscribe
Publish-Subscribe (Pub/Sub for short) of a database is a messaging model in which Publisher sends messages to one or more Subscribers, and Subscribers The message is received and processed. In this mode, publishers and subscribers are loosely coupled, and no direct communication is required between them, thus improving the scalability and flexibility of the application.
In databases, the publish-subscribe function is usually used in scenarios such as real-time data updates, cache synchronization, and business event notification. For example, when the data of a particular table in the database changes, the subscribers can be notified in real-time through the publish and subscribe function, to realize real-time data synchronization and processing. In addition, the notification of business events can also be recognized through the publish and subscribe function, such as an order being canceled, a certain inventory quantity is insufficient, and so on.
There can be a many-to-many relationship between publishers and subscribers; one publisher can publish messages to multiple subscribers, and one subscriber can also subscribe to various messages/data. Usually, the publish-subscribe function of the database consists of two parts: Publisher and Subscriber. Publisher is responsible for publishing messages, while Subscriber subscribes to corresponding messages to achieve data synchronization.
Application scenarios
The publish-subscribe function has many typical application scenarios:
-
Data Synchronization: When a database needs to be kept in sync with another database, the publish-subscribe feature can send data changes to the subscriber database. For example, when a website needs to transfer data from one geographic location to another, publish-subscribe functionality can ensure data synchronization between the two databases.
-
Business data distribution: The publish and subscribe function can distribute business data to different systems or processes. For example, when a bank needs to distribute customer account information to multiple business systems, the publish-subscribe function can distribute data to corresponding systems to ensure data consistency between various business processes.
-
Data backup: The publish-subscribe function can back up data. For example, when one database needs to be backed up to another database, the publish-subscribe part can be used to back up the data to the subscriber database so that the data can be recovered in the event of failure of the primary database.
-
Real-time data processing: The publish-subscribe function can be used to realize real-time data processing. For example, when a website needs to process data from different users, the publish-subscribe part can be used to transmit data to a processing program for processing, to realize real-time data analysis and decision-making.
Concepts
-
Publication: In a database, a publication often refers to the process of setting a database object to be accessible by other accounts. It is a crucial step in data sharing and replication, where the published objects can be subscribed to by other accounts, and their data can be accessed.
-
Subscription: A subscription refers to a database choosing to receive and replicate the data of a published database object.
-
Publisher (Pub): The Publisher is the database that performs the publishing operation. The Publisher is responsible for creating and managing the published objects, as well as managing the access permissions of databases subscribing to these published objects.
-
Subscriber (Sub): The Subscriber is the account that subscribes to the published objects.
-
Published Object: A published object is a database created by the Publisher and made available for publication, namely a database. The data of these objects can be accessed and replicated by the Subscriber.
-
Subscribed Object: A subscribed object is a published object replicated and stored on the Subscriber. The data of the subscribed object is updated according to the data on the Publisher.
Publication/Subscription Scope Explanation
Publish/Subscribe Application Scope
Both Publisher and Subscriber are accounts of MatrixOne.
Publishable/Subscribable Permissions
- Only ACCOUNTADMIN or MOADMIN role can create publications and subscriptions on the Publisher.
- Subscribers are controlled by ACCOUNTADMIN or MOADMIN roles to access subscription data permissions.
Publication/Subscription Data Scope
- A single Publication can only be associated with one database.
- Publications and subscriptions are only implemented at the database level, with no current support for direct publication and subscription at the table level.
- The Subscriber only has read access to the Subscribed database.
- If the Publisher adjusts the sharing scope of the publication, those accounts that are no longer within the new scope and have already created a subscribed database will find that their access to the Subscribed database is invalid.
- If the Publisher attempts to delete a database that has been published, the deletion will fail.
- If the Publisher deletes a Publication, but the corresponding object still exists in the subscribed database, an error will be triggered when the Subscriber attempts to access this object. The Subscriber will need to delete the corresponding Subscription.
- If the Publisher deletes a Published object, but the corresponding object still exists in the subscribed database, an error will be triggered when the Subscriber attempts to access this object. The Subscriber must delete the corresponding Subscribed object.
Examples
This chapter will give an example to introduce that there are currently three accounts in the MatrixOne cluster, sys, acc1, and acc2, and operate on the three accounts according to the order of operations:
-
Publisher: sys account creates database sub1 and table t1, and publishes pub1:
create database sub1; create table sub1.t1(a int,b int); create publication pub1 database sub;
-
Subscriber: both acc1 and acc2 create a subscription database syssub1, and thus get the shared table t1:
-- The SQL statements for acc1 and acc2 to create the subscription library are the same, so there will not repeat them create database syssub1 from sys publication pub1; use syssub1; show tables; mysql> show tables; +--------------------+ | Tables_in_syssub1 | +--------------------+ | t1 | +--------------------+ 2 rows in set (0.02 sec)
-
Publisher: sys account creates table t2:
create table sub1.t2(a text);
-
Subscribers: acc1 and acc2 get shared tables t1 and t2:
show tables; +--------------------+ | Tables_in_syssub1 | +--------------------+ | t1 | +--------------------+ | t2 | +--------------------+ 2 rows in set (0.02 sec)
-
Publisher: sys account creates database sub2 and table t2, and publishes pub2 to accounts acc1 and acc3:
create database sub2; create table sub2.t1(a float); create publication pub2 database sub2 account acc1,acc3;
-
Subscriber: both acc1 and acc2 create the subscription database syssub2, and acc1 gets the shared data table t1; acc2 fails to create the subscription database syssub2:
- acc1
create database syssub2 from sys publication pub2; use syssub2; mysql> show tables; +--------------------+ | Tables_in_syssub2 | +--------------------+ | t1 | +--------------------+ 2 rows in set (0.02 sec)
- acc2
create database syssub2 from sys publication pub2; > ERROR 20101 (HY000): internal error: the account acc3 is not allowed to subscribe the publication pub2
-
Publisher: The sys account modifies and publishes pub2 to all accounts:
alter publication pub2 account all;
-
Subscriber: acc2 successfully created the subscription database syssub2, and got the shared data table t1:
create database syssub2 from sys publication pub2; use syssub2; mysql> show tables; +--------------------+ | Tables_in_syssub2 | +--------------------+ | t1 | +--------------------+ 2 rows in set (0.02 sec)
-
Publisher: sys account deletes publication pub1:
drop publication pub1;
-
Subscriber: acc1 failed to connect to syspub1:
use syssub1; ERROR 20101 (HY000): internal error: there is no publication pub1
-
Subscriber: acc2 delete syspub1:
drop database syssub1;
-
Publisher: sys account recreates pub1:
create publication pub1 database sub;
-
Subscriber: acc1 connects to syspub1 successfully:
create database syssub1 from sys publication pub1; use syssub1; mysql> show tables; +--------------------+ | Tables_in_syssub1 | +--------------------+ | t1 | +--------------------+ 2 rows in set (0.02 sec)