Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
5421756
feat: add akvelon logo
v-safiullin May 14, 2025
db5ea64
feat: add akvelon case study
v-safiullin May 14, 2025
29089d9
fix: remove white space
v-safiullin May 14, 2025
92332f4
feat: add akvelon to main page
v-safiullin May 14, 2025
ef3d284
feat: use new images
v-safiullin May 15, 2025
dec329d
fix: typos
v-safiullin May 15, 2025
e2160db
fix: change order of akvelon case-study
v-safiullin May 15, 2025
b933511
fix: update text
v-safiullin May 19, 2025
a21d027
fix: update mainPage text
v-safiullin May 19, 2025
da43db7
fix: update images
v-safiullin May 19, 2025
2f17f59
fix: about akvelon section update
v-safiullin May 19, 2025
a318704
Merge branch 'feat/akvelon-case-study' of https://github.com/bullet03…
v-safiullin May 19, 2025
7f23500
fix: update akvelon card
v-safiullin May 19, 2025
6680f79
fix: update akvelon header
v-safiullin May 19, 2025
3670dee
fix: update code tag
v-safiullin May 19, 2025
3d5d585
fix: update about akvelon
v-safiullin May 20, 2025
0c660d0
fix: update date and order
v-safiullin May 29, 2025
ca68daa
fix: add link and change img
v-safiullin May 30, 2025
c80643b
fix: change CDAP text
v-safiullin Jun 2, 2025
4249a50
fix: add bold weight
v-safiullin Jun 2, 2025
684573c
fix: solve conflicts
v-safiullin Jun 2, 2025
88904d2
fix: remove unused code
v-safiullin Jun 2, 2025
ee39495
Merge branch 'master' into feat/akvelon-case-study
bullet03 Jun 2, 2025
de41588
fix: delete whitespace
v-safiullin Jun 2, 2025
88dc848
Merge branch 'feat/akvelon-case-study' of https://github.com/bullet03…
v-safiullin Jun 2, 2025
e215949
fix: indents format
v-safiullin Jun 2, 2025
6097691
fix: add bold text
v-safiullin Jun 2, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
173 changes: 169 additions & 4 deletions website/www/site/content/en/case-studies/akvelon.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,17 @@
---
title: "Akvelon"
icon: /images/logos/powered-by/akvelon.png
hasNav: true
cardDescription: "<p><a href='https://akvelon.com/' target='_blank' rel='noopener noreferrer'>Akvelon</a> is a software engineering company that helps start-ups, SMBs, and Fortune 500 companies unlock the full potential of cloud, data, and AI/ML to empower their strategic advantage. Akvelon team has deep expertise in integrating Apache Beam with diverse data processing ecosystems and is an enthusiastic Apache Beam community contributor.</p>"
title: "Secure and Interoperable Apache Beam Pipelines by Akvelon"
name: "Akvelon"
icon: "/images/logos/powered-by/akvelon.png"
category: "study"
cardTitle: "Secure and Interoperable Apache Beam Pipelines by Akvelon"
cardDescription: "To support data privacy and pipeline reusability at scale, Akvelon developed Beam-based solutions for Protegrity and a major North American credit reporting company, enabling tokenization with Dataflow Flex Templates. Akvelon also built a CDAP Connector to integrate CDAP plugins with Apache Beam, enabling plugin reuse and multi-runtime compatibility."
authorName: "Vitaly Terentyev"
coauthorName: "Ashley Pikle"
authorPosition: "Software Engineer @Akvelon"
coauthorPosition: "Director of AI Business Development @Akvelon"
authorImg: /images/case-study/akvelon/terentyev.png
coauthorImg: /images/case-study/akvelon/pikle.png
publishDate: 2025-05-25T00:12:00+00:00
---
<!--
Licensed under the Apache License, Version 2.0 (the "License");
Expand All @@ -17,3 +26,159 @@ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->
<div class="case-study-opinion">
<div class="case-study-opinion-img">
<img src="/images/logos/powered-by/akvelon.png"/>
</div>
<blockquote class="case-study-quote-block">
<p class="case-study-quote-text">
“To support data privacy and pipeline reusability at scale, Akvelon developed Beam-based solutions for Protegrity and a major North American credit reporting company, enabling tokenization with Dataflow Flex Templates. Akvelon also built a CDAP Connector to integrate CDAP plugins with Apache Beam, enabling plugin reuse and multi-runtime compatibility.”
</p>
<div class="case-study-quote-author">
<div class="case-study-quote-author-img">
<img src="/images/case-study/akvelon/pikle.png">
</div>
<div class="case-study-quote-author-info">
<div class="case-study-quote-author-name">
Ashley Pikle
</div>
<div class="case-study-quote-author-position">
Director of AI Business Development @Akvelon
</div>
</div>
</div>
</blockquote>
</div>
<div class="case-study-post">

# Secure and Interoperable Apache Beam Pipelines by Akvelon

## Background

To meet growing enterprise needs for secure, scalable, and interoperable data processing pipelines, **Akvelon** developed multiple Apache Beam-powered solutions tailored for real-world production environments:
- Data tokenization and detokenization capabilities for **Protegrity** and a leading North American credit reporting company
- A connector layer to integrate **CDAP** plugins into Apache Beam pipelines

By leveraging [Apache Beam](https://beam.apache.org/) and [Google Cloud Dataflow](https://cloud.google.com/products/dataflow?hl=en), Akvelon enabled its clients to achieve scalable data protection, regulatory compliance, and platform interoperability through reusable, open-source pipeline components.

## Use Case 1: Data Tokenization for Protegrity and a Leading Credit Reporting Company

### The Challenge

**Protegrity**, a leading enterprise data-security vendor, sought to enhance its data protection platform with scalable tokenization support for batch and streaming data. Their goal: allow customers such as a major North American credit reporting company to tokenize sensitive data using Google Cloud Dataflow. The solution needed to be fast, secure, reusable, and compliant with privacy regulations (e.g., HIPAA, GDPR).

### The Solution

Akvelon designed and implemented a **Dataflow Flex Template** using Apache Beam that allows users to tokenize and detokenize sensitive data within both batch and streaming pipelines.

<div class="post-scheme">
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

minor: Beam Runner -> Dataflow Runner

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed in img

<a href="/images/case-study/akvelon/diagram-01.png" target="_blank" title="Click to enlarge">
<img src="/images/case-study/akvelon/diagram-01.png" alt="Protegrity & Equifax Tokenization Pipeline">
</a>
</div>

### Key features
- **Seamless integration with Protegrity UDFs**, enabling native tokenization directly within Beam transforms without requiring external service orchestration
- **Support for multiple data formats** such as CSV, JSON, Parquet, allowing flexible deployment across diverse data pipelines
- **Stateful processing with `DoFn` and timers**, which improves streaming reliability and reduces overall pipeline latency
- **Full compatibility with Google Cloud Dataflow**, ensuring autoscaling, fault tolerance, and operational simplicity through managed Apache Beam execution

This design provided both Protegrity and its enterprise clients with a reusable, open-source architecture for scalable data privacy and processing.

### The Results
- **Enabled data tokenization at** for regulated industries
- **Accelerated adoption of Dataflow templates** across Protegrity’s customer base
- **Delivered an [open-source Flex Template](https://github.com/apache/beam/blob/master/examples/java/src/main/java/org/apache/beam/examples/complete/datatokenization/README.md)** that benefits the entire Apache Beam community

<blockquote class="case-study-quote-block case-study-quote-wrapped">
<p class="case-study-quote-text">
In collaboration with Akvelon, Protegrity utilized a Dataflow Flex template that helps us enable customers to tokenize and detokenize streaming and batch data from a fully managed Google Cloud Dataflow service. We appreciate Akvelon’s support as a trusted partner with Google Cloud expertise.
</p>
<div class="case-study-quote-author">
<div class="case-study-quote-author-img">
<img src="/images/case-study/akvelon/chitnis.png">
</div>
<div class="case-study-quote-author-info">
<div class="case-study-quote-author-name">
Jay Chitnis
</div>
<div class="case-study-quote-author-position">
VP of Partners and Business Development @Protegrity
</div>
</div>
</div>
</blockquote>

## Use Case 2: CDAP Connector for Apache Beam

### The Challenge

**CDAP** had extensive plugin support for Spark but lacked native compatibility with Apache Beam. This limitation prevented organizations from reusing CDAP's rich ecosystem of data connectors (e.g., Salesforce, HubSpot, ServiceNow) within Beam-based pipelines, constraining cross-platform integration.

### The Solution

Akvelon engineered a **shim layer** (CDAP Connector) that bridges CDAP plugins with Apache Beam. This innovation enables CDAP source and sink plugins to operate seamlessly within Beam pipelines.

<div class="post-scheme">
<a href="/images/case-study/akvelon/diagram-02.png" target="_blank" title="Click to enlarge">
<img src="/images/case-study/akvelon/diagram-02.png" alt="CDAP Connector Integration with Apache Beam">
</a>
</div>

### Highlights

- Supports `StructuredRecord` format conversion to Beam schema (`BeamRow`)
- Enables CDAP plugins to run seamlessly in both Spark and Beam pipelines
- Facilitates integration testing across third-party data sources (e.g., Salesforce, Zendesk)
- Complies with Beam’s development and style guide for open-source contributions

The project included prototyping, test infrastructure, and Salesforce plugin pipelines to ensure robustness.

### The Results

- **Enabled seamless reuse of CDAP plugins in Beam**
- **30+ CDAP plugins** now work seamlessly with Beam pipelines
- **Integration time** reduced **from hours to just a few minutes**
- **Simplified execution and migration of CDAP pipelines to Beam**
- Enabled **seamless execution** of CDAP pipelines on the Beam runtime
- Simplified **migration of existing CDAP pipelines** to Beam with minimal changes
- **Accelerated delivery and validated performance for Google Cloud customers**
- Delivered **rapid development cycles** with standardized plugin configurations
- Successfully processed **5 million records** in end-to-end tests for **batch and streaming**

## Technology Stack

- Apache Beam
- Google Cloud Dataflow
- Protegrity Data Protection Platform
- CDAP (Cloud Data Fusion)
- BigQuery
- Salesforce, Zendesk, HubSpot, ServiceNow plugins

## Final words

Akvelon’s contributions to Apache Beam-based solutions - from advanced tokenization for Protegrity and its enterprise customers to enabling plugin interoperability through the CDAP Connector - demonstrate the value of open-source, cloud-native data engineering. By delivering reusable and secure components, Akvelon supports enterprises in modernizing and unifying their data infrastructure.

## Watch the Solution in Action

[Architecture Walkthrough Video ](https://www.youtube.com/watch?v=IQIzdfNIAHk)

## About Akvelon, Inc.

Akvelon guides enterprises through digital transformation on Google Cloud - applying deep expertise in data engineering, AI/ML, cloud infrastructure, and custom application development to design, deploy, and scale modern workloads.

At Akvelon, we’ve built a long-standing partnership with Google Cloud—helping software-driven organizations implement, migrate, modernize, automate, and optimize their systems while making the most of cloud technologies.

As a **Google Cloud Service** and **Build Partner**, we contribute actively to the ecosystem:
- Contributing code and guidance to **Apache Beam**—including Playground, Tour of Beam, and the Duet AI training set
- Improving project infrastructure and supporting the Apache Beam community—now with an official Apache Beam Committer on our team

Backed by deep expertise in data engineering, AI/ML, cloud architecture, and application development, our engineers deliver reusable, secure, and production-ready solutions on Google Cloud for enterprises worldwide.

- [Akvelon on Google Cloud](https://cloud.google.com/find-a-partner/partner/akvelon)
- [Akvelon Data and Analytics Accelerators](https://github.com/akvelon/DnA_accelerators)

{{< case_study_feedback "Akvelon" >}}

</div>
<div class="clear-nav"></div>
4 changes: 4 additions & 0 deletions website/www/site/data/en/quotes.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -81,6 +81,10 @@
logoUrl: /images/logos/powered-by/accenture.png
linkUrl: case-studies/accenture_baltics/index.html
linkText: Learn more
- text: Akvelon built Beam-based solutions for Protegrity and a major North American credit reporting company, enabling tokenization with Dataflow Flex Templates and reducing infrastructure and deployment complexity.
icon: icons/quote-icon.svg
logoUrl: /images/logos/powered-by/akvelon.png
linkUrl: case-studies/akvelon/index.html
- text: With Apache Beam and Dataflow, Credit Karma achieved a 99% uptime for critical data pipelines, a significant jump from 80%. This reliability, coupled with faster development (1 engineer vs. 3 estimated), has been crucial for enabling real-time financial insights for our more than 140 million members.
icon: icons/quote-icon.svg
logoUrl: images/logos/powered-by/credit-karma.png
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading