Airbyte CDK (Deprecated)
Note: AirbyteCDKLoader
is deprecated. Please use AirbyteLoader
instead.
Airbyte is a data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. It has the largest catalog of ELT connectors to data warehouses and databases.
A lot of source connectors are implemented using the Airbyte CDK. This loader allows to run any of these connectors and return the data as documents.
Installation
First, you need to install the airbyte-cdk
python package.
%pip install --upgrade --quiet airbyte-cdk
Then, either install an existing connector from the Airbyte Github repository or create your own connector using the Airbyte CDK.
For example, to install the Github connector, run
%pip install --upgrade --quiet "source_github@git+https://github.com/airbytehq/airbyte.git@master#subdirectory=airbyte-integrations/connectors/source-github"
Some sources are also published as regular packages on PyPI