-
Notifications
You must be signed in to change notification settings - Fork 533
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Umbrella] InLong Transform feature #10022
Comments
This was referenced Apr 29, 2024
Closed
Merged
2 tasks
This was referenced May 6, 2024
Merged
Merged
This issue is stale because it has been open for 60 days with no activity. |
github-actions
bot
added
the
stage/stale
Issues or PRs that had no activity for a long time
label
Jul 7, 2024
This was referenced Jul 10, 2024
Closed
[Feature][SDK] Transform SQL support arithmetic functions(Including log10, log2, log and exp)
#10607
Closed
github-actions
bot
removed
the
stage/stale
Issues or PRs that had no activity for a long time
label
Jul 19, 2024
This was referenced Aug 8, 2024
Closed
2 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Motivation
InLong Transform empowers InLong to expand its access and distribution capabilities, adapting to a richer variety of data protocols and reporting scenarios on the access side, and adapting to complex and diverse data distribution scenarios on the distribution side. This improves data quality and collaboration, providing connection, aggregation, filtering, grouping, value extraction, sampling, and other computing capabilities that are decoupled from the computing engine. It simplifies users' pre-processing operations for reporting data, lowers the threshold for data usage, simplifies users' pre-processing operations before starting data analysis, and focuses on the business value of data.
Scenarios
Data Cleansing: During the data integration process, it is necessary to clean data from different sources to eliminate errors, duplicates, and inconsistencies. Transform capabilities can help companies perform data cleansing more effectively and improve data quality.
Data Fusion: Combining data from different sources for unified analysis and reporting. Transform capabilities can handle data in different formats and structures, enabling data fusion and integration.
Data Standardization: Converting data into a unified standard format for cross-system and cross-platform data analysis. Transform capabilities can help companies achieve data standardization and normalization.
Data Partitioning and Indexing: To improve the performance of data queries and analysis, data needs to be partitioned and indexed. Transform capabilities can dynamically adjust field values for partitioning and indexing, thereby improving the performance of the data warehouse.
Data Aggregation and Calculation: During the data analysis process, data needs to be aggregated and calculated to extract valuable information. Transform capabilities can perform complex data aggregation and calculations, supporting multi-dimensional data analysis.
Data Security and Privacy Protection: During the data integration process, it is essential to ensure data security and privacy. Transform capabilities can implement data de-identification, encryption, and authorization management to protect data security and privacy.
Cross-team Data Sharing: For data security reasons, only filtered subsets of data streams are shared; for data dependency decoupling considerations, data interfaces are agreed upon with collaborating teams, dynamically adjusting the merging of multiple streams into the data stream interface.
Feature list
Rich Data Protocols
Decoupling from the Computing Engine
Seamless and Lossless Changes
Automatic Scaling
Task list
InLong Component
Other for not specified component
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: