Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Datastream adalah layanan replikasi dan pengambilan data perubahan (CDC) yang serverless serta mudah digunakan yang memungkinkan Anda menyinkronkan data dengan andal, dan dengan latensi minimal.
Datastream menyediakan replikasi data yang lancar dari database operasional ke BigQuery. Selain itu, Datastream mendukung penulisan aliran peristiwa perubahan ke Cloud Storage, dan menawarkan integrasi yang disederhanakan dengan template Dataflow untuk membangun alur kerja kustom guna memuat data ke berbagai tujuan, seperti Cloud SQL dan Spanner. Anda juga dapat menggunakan Datastream untuk memanfaatkan aliran peristiwa langsung dari Cloud Storage guna mewujudkan arsitektur berbasis peristiwa. Datastream mendukung sumber Oracle, MySQL, SQL Server, PostgreSQL (termasuk AlloyDB untuk PostgreSQL), MongoDB (Pratinjau), dan Salesforce (Pratinjau).
Manfaat Datastream meliputi:
Penyiapan pipeline ELT (Ekstraksi, Pemuatan, Transformasi) yang lancar untuk replikasi data latensi rendah guna mengaktifkan insight mendekati real-time di BigQuery.
Bersifat serverless sehingga tidak ada resource yang perlu disediakan atau dikelola, dan layanan ini dapat diskalakan secara otomatis, sesuai kebutuhan, dengan periode nonaktif minimal.
Pengalaman penyiapan dan pemantauan yang mudah digunakan untuk mencapai waktu pemerolehan manfaat yang sangat cepat.
Integrasi di seluruh portofolio layanan data terbaik untuk integrasi data di seluruh Datastream, Dataflow, Pub/Sub, BigQuery, dan lainnya. Google Cloud
Menyinkronkan dan menyatukan aliran data di berbagai database dan aplikasi yang heterogen.
Keamanan, dengan opsi konektivitas pribadi dan keamanan yang Anda harapkan dari
Google Cloud.
Akurat dan andal, dengan pelaporan status yang transparan dan fleksibilitas pemrosesan yang kuat dalam menghadapi perubahan data dan skema.
Mendukung beberapa kasus penggunaan, termasuk analisis, replikasi database, dan sinkronisasi untuk migrasi dan konfigurasi hybrid cloud, serta untuk membangun arsitektur berbasis peristiwa.
Kasus penggunaan
Kemampuan streaming Datastream memungkinkan berbagai kasus penggunaan:
Mereplikasi dan menyinkronkan data di seluruh organisasi Anda dengan latensi
minimal
Anda dapat menyinkronkan data di berbagai database dan aplikasi heterogen dengan andal, dengan latensi rendah, dan dengan dampak minimal pada performa sumber. Manfaatkan keandalan aliran data untuk analisis, replikasi database, migrasi cloud, dan arsitektur berbasis peristiwa di seluruh lingkungan hybrid.
Tingkatkan atau turunkan skala dengan arsitektur serverless secara lancar
Siapkan dan jalankan secara cepat dengan layanan tanpa server dan mudah digunakan yang dapat menskalakan secara lancar saat volume data Anda berubah. Fokuslah untuk mendapatkan insight terbaru dari data Anda dan menanggapi masalah prioritas tinggi, bukan mengelola infrastruktur, menyesuaikan performa, atau menyediakan resource.
Berintegrasi dengan Google Cloud kumpulan alat integrasi data
Hubungkan data di seluruh organisasi Anda dengan rangkaian produk integrasi data. Google Cloud Mengintegrasikan Datastream dengan template tugas Dataflow untuk membaca data dari bucket Cloud Storage dan memuatnya ke berbagai tujuan, seperti BigQuery, Spanner, dan Cloud SQL.
Elemen pengalaman
Ada tiga elemen utama di Datastream:
Konfigurasi konektivitas pribadi memungkinkan Datastream berkomunikasi dengan sumber data melalui jaringan pribadi (secara internal dalamGoogle Cloud, atau dengan sumber eksternal yang terhubung melalui VPN atau Interconnect). Komunikasi ini terjadi melalui koneksi peering Virtual Private Cloud (VPC).
Profil koneksi menunjukkan informasi konektivitas ke sumber dan tujuan. Informasi ini akan digunakan oleh aliran.
Aliran data menggunakan informasi dalam profil koneksi untuk mentransfer data CDC dan pengisian ulang dari sumber ke tujuan.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-08-12 UTC."],[[["\u003cp\u003eDatastream is a serverless change data capture (CDC) and replication service that synchronizes data from various operational databases, including Oracle, MySQL, SQL Server, PostgreSQL, and Salesforce, into BigQuery, Cloud Storage, and other destinations.\u003c/p\u003e\n"],["\u003cp\u003eThis service offers low-latency data replication, enabling near real-time insights, seamless scaling, and easy setup and monitoring without the need for manual resource management.\u003c/p\u003e\n"],["\u003cp\u003eDatastream integrates with Google Cloud's data services like Dataflow, Pub/Sub, and BigQuery to build ELT pipelines and is designed to unify data streams across heterogeneous databases and applications.\u003c/p\u003e\n"],["\u003cp\u003eThe platform supports a variety of use cases, such as analytics, database replication, migration and synchronization across hybrid-cloud environments, and building event-driven architectures with minimal latency.\u003c/p\u003e\n"]]],[],null,["# Datastream overview\n\nDatastream is a serverless and easy-to-use change data capture (CDC) and replication service that lets you synchronize data reliably, and with minimal latency.\n\nDatastream provides seamless replication of data from operational databases into BigQuery. In addition, Datastream supports writing the change event stream into Cloud Storage, and offers streamlined integration with Dataflow templates to build custom workflows for loading data into a wide range of destinations, such as Cloud SQL and Spanner. You can also use Datastream to take advantage of the event stream directly from Cloud Storage to realize event-driven architectures. Datastream supports Oracle, MySQL, SQL Server, PostgreSQL (including AlloyDB for PostgreSQL), MongoDB ([Preview](/products#product-launch-stages)) and Salesforce ([Preview](/products#product-launch-stages)) sources.\n\nBenefits of Datastream include:\n\n- Seamless setup of ELT (Extract, Load, Transform) pipelines for low-latency data replication to enable near real-time insights in BigQuery.\n- Being serverless so there are no resources to provision or manage, and the service scales up and down automatically, as needed, with minimal downtime.\n- Easy-to-use setup and monitoring experiences that achieve super-fast time-to-value.\n- Integration across the best of Google Cloud data services' portfolio for data integration across Datastream, Dataflow, Pub/Sub, BigQuery, and more.\n- Synchronizing and unifying data streams across heterogeneous databases and applications.\n- Security, with private connectivity options and the security you expect from Google Cloud.\n- Being accurate and reliable, with transparent status reporting and robust processing flexibility in the face of data and schema changes.\n- Supporting multiple use cases, including analytics, database replication, and synchronization for migrations and hybrid-cloud configurations, and for building event-driven architectures.\n\nUse cases\n---------\n\nThe streaming capabilities of Datastream enable a variety of use cases:\n\n- **Replicating and synchronizing data across your organization with minimal\n latency**\n\n You can synchronize data across heterogeneous databases and applications\n reliably, with low latency, and with minimal impact to the performance of\n your source. Unlock the power of data streams for analytics, database\n replication, cloud migration, and event-driven architectures across hybrid\n environments.\n- **Scale up or down with a serverless architecture seamlessly**\n\n Get up and running fast with a serverless and easy-to-use service that\n scales seamlessly as your data volumes shift. Focus on deriving up-to-date\n insights from your data and responding to high-priority issues, instead of\n managing infrastructure, performance tuning, or resource provisioning.\n- **Integrate with the Google Cloud data integration suite**\n\n Connect data across your organization with the Google Cloud data\n integration suite of products. Integrate Datastream with\n Dataflow job templates to read data from a Cloud Storage bucket\n and load it into a variety of destinations, such as BigQuery,\n Spanner, and Cloud SQL.\n\nExperience elements\n-------------------\n\nThere are three main elements in Datastream:\n\n- **Private connectivity configurations** enable Datastream to communicate with a data source over a private network (internally within Google Cloud, or with external sources connected over VPN or Interconnect). This communication happens through a Virtual Private Cloud (VPC) peering connection.\n- **Connection profiles** represent connectivity information to both a source and a destination. This information will be used by a stream.\n- **Streams** use the information in the connection profiles to transfer CDC and backfill data from the source to the destination.\n\nWhat's next\n-----------\n\n- Start replicating your data [from a source database to BigQuery datasets](/datastream/docs/quickstart-replication-to-bigquery).\n- Learn more about [key concepts and features](/datastream/docs/behavior-overview) of Datastream.\n- Find out how to create [private connectivity configurations](/datastream/docs/create-a-private-connectivity-configuration), [connection profiles](/datastream/docs/create-connection-profiles) and [streams](/datastream/docs/create-a-stream)."]]