Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Integration Request: Azure Table Storage #3066

Closed
geekwhocodes opened this issue Apr 26, 2021 · 7 comments
Closed

New Integration Request: Azure Table Storage #3066

geekwhocodes opened this issue Apr 26, 2021 · 7 comments
Labels
area/connectors Connector related issues new-connector

Comments

@geekwhocodes
Copy link
Contributor

geekwhocodes commented Apr 26, 2021

Tell us about the new integration you’d like to have

  • Which source and which destination?
  • Do you need a specific version of the underlying data source e.g: you specifically need support for an older version of the API or DB?

We use azure table storage extensively in our projects. It would be great if we add this as a source and eventually destination.

Describe the context around this new integration

  • Which team in your company wants this integration, what for? This helps us understand the use case.
  • How often do you want to run syncs?
  • If this is an API source connector, which entities/endpoints do you need supported?

Currently, we don't see any use case around it but in my opinion this is very generic and widely used service.
There are REST APIs and python client library available.

Describe the alternative you are considering or using

What are you considering doing if you don’t have this integration through Airbyte?
I came across this awesome project from YC. Currently, I don't have use case in my mind but this is good to have byte.

Note: I would like to work on this source.

┆Issue is synchronized with this Asana task by Unito

@geekwhocodes geekwhocodes added area/connectors Connector related issues new-connector labels Apr 26, 2021
@geekwhocodes
Copy link
Contributor Author

Please let me know if anyone is already working on it :)

@michel-tricot
Copy link
Contributor

Hey @geekwhocodes!!

I don't think anyone is working on it and you're most welcome to take a stab at it.

Do you need any help to get started?

@geekwhocodes
Copy link
Contributor Author

@michel-tricot thanks for confirming it. I went through documentation and I think I am good for now but
I am sure I will need help in near future, will ask on Slack :)

@geekwhocodes
Copy link
Contributor Author

geekwhocodes commented May 5, 2021

folks,

I have few questions regarding Azure table storage connector. I am seeking some guidance to proceed further.

  1. It does not have anything like table definition or pre-defined schema
  2. There is no map reduce like functionality to fetch all properties

    You have to read all entities in side that table to fetch all properties available

Solutions :

  1. Fetch first x entities to infer schema (not reliable)

  2. Fetch/Read all entities in discover method/phase (not efficient)

  3. There are three system properties that table storage provides - PartitionKey, RowKey, Timestamp. Can we define static schema like:
    PartitionKey
    RowKey
    Timestamp
    DynamicProperties - This will JSON serialized string which will contain all properties and values. (This breaks original document schema)

  4. Instead of supporting all tables in storage account, Can we support one table at a time and ask end user to fill in column names(We will fetch first x entities and infer property type from it - just like you are doing it in MongoDB connector)

@marcosmarxm
Copy link
Member

hey @geekwhocodes sorry for the delayed replied. I'm not super familiar with Azure Table Storage, but in a way is similar to S3/GCS storing JSON (slightly, maybe) can I suggest you take a look at File source connector implementation? https://docs.airbyte.io/integrations/sources/file
I'll take a look at it later today

@geekwhocodes
Copy link
Contributor Author

Thanks @marcosmarxm, I will look into file connector's implementation.

@marcosmarxm
Copy link
Member

Implemented #7095

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/connectors Connector related issues new-connector
Projects
None yet
Development

No branches or pull requests

3 participants