Posts

Showing posts from November, 2021

Exploring the Functions of the Amazon S3 Data Lake

Image
 Amazon S3 (Simple Storage Service) is a cloud-based data storage service that stores unstructured, semi-structured, and structured data in their native formats. Data durability of S3 is at an amazing 99.999999999 (11 9s) with data protected in a highly optimized and secure environment. Data files having metadata and objects are stored in buckets. For uploading metadata or files, the object has to be uploaded to Amazon S3 after which permissions can be granted on the metadata or the related objects in the buckets. Several competencies are used for building an S3 data lake on Amazon S3. These include media data processing applications, Artificial Intelligence (AI), Machine Learning (ML), big data analytics, and high-performance computing (HPC). All these together help businesses get access to vital business intelligence and analytics from the S3 data lake and unstructured data sets. A benefit of S3 data lake is that computing and storage facilities are in different silos. Hence, all da

Oracle CDC Software and Technology

Image
 The Change Data Capture technology was first launched as a built-in feature of the Oracle database in the 9i version. It was very effective in tracking and recording all changes made in user tables in a database. These changes were stored in change tables for use in ETL applications. The data could then be processed and loaded into other data warehouses and databases. The first release of the Oracle Change Data Capture functioned through triggers placed in the source. However, this method did not find favor with the DBAs as it was too invasive.   Subsequently, a new and improved version of the Oracle Change Data Capture, named Oracle Streams,was introduced with the 10g version of the Oracle database. It leveraged the redo logs of the source database and the built-in replication tool of Oracle. Streams was a very optimized process to identify and move change data to a target database without adversely affecting the performance of the source database. Unfortunately, despite its efficie