Local Ingestion Agent
The Local Ingestion Agent is designed to facilitate metadata ingestion for hybrid deployments, allowing organizations to securely push metadata from their infrastructure into the Collate platform without exposing their internal systems. It provides a secure and efficient way for running ingestion workflows while maintaining full control over data processing within your network.
The main difference between the Local Ingestion Agent and the Hybrid Runner[link to docs Hybrid SaaS page] is that the Local Ingestion Agent can quickly be set up in your own laptop, without relying on any other infrastructure.
Overview
The Collate Ingestion Agent is ideal for scenarios where running connectors on-premises is necessary, providing a secure and efficient way to process metadata within your infrastructure. This eliminates concerns about data privacy and streamlines the ingestion process. With the Collate Ingestion Agent, you can:- Set up ingestion workflows easily from a UI.
- Handle the end-to-end metadata extraction workflows: metadata, profiling, lineage, usage, dbt, auto classification and data quality.
Setting Up the Collate Ingestion Agent
1. Prepare Your Environment
You need to be able to run Docker images on your laptop. Run the following commands:- Log in to Docker with the credentials provided by Collate to authenticate. You can reach out to [email protected] to your credentials.
- Run the Docker Image to start the Local Agent:
2. Configure the Agent
Access the Local Agent UI:
- Open your browser and navigate to http://localhost:8001
Set Up the Connection:
- Enter your Collate platform URL (e.g., https://(your-company).getcollate.io/api).
- Add the ingestion bot token from the Collate settings under Settings > Bots > Ingestion Bot.
3. Add a New Service
- Navigate to the Database Services section in the Ingestion Agent UI
- Click Add New Service and select the database type
- Enter the necessary service configuration
- Test the connection to ensure the service is properly configured
4. Run Metadata Ingestion
- After creating the service, navigate to the Ingestion tab and click Add Ingestion.
- Select the ingestion type (e.g., metadata) and specify any additional configurations for the workflow.
- Run the ingestion & Monitor the logs in real-time to track the ingestion process.
5. Verify Ingested Data
- Return to the Collate platform at https://(your-company).getcollate.io and refresh the database services page. Your new service will appear there now. 2.Verify that the ingested metadata, including schemas, tables, and column details, is available.
Additional Features
The Collate Ingestion Agent supports various ingestion workflows, allowing you to:- Generate YAML Configurations: Download YAML files for external scheduling.
- Manage Ingestion Types: Run metadata, profiling, lineage, usage, and other workflows as needed.
- Monitor Progress: View logs and monitor real-time ingestion activity.