Documentation Index
Fetch the complete documentation index at: https://docs.getcollate.io/llms.txt
Use this file to discover all available pages before exploring further.
Local Ingestion Agent
The Local Ingestion Agent lets you securely push metadata from your own infrastructure into Collate — without exposing your internal systems or relying on additional server infrastructure. It’s the right choice when you need to run connectors on-premises and want to be up and running from your local system.
Unlike the Hybrid Ingestion Runner, the Local Ingestion Agent doesn’t require a Kubernetes cluster or any cloud infrastructure — just Docker.
With the Local Ingestion Agent, you can:
- Set up ingestion workflows from a UI.
- Run end-to-end metadata extraction workflows: metadata, profiling, lineage, usage, dbt, auto-classification, and data quality.
Before you start: Contact Collate support to get your Docker credentials before setting up the Local Ingestion Agent.
Setting Up the Collate Ingestion Agent
Follow these steps to install, connect, and configure the Local Ingestion Agent on your machine.Step 1: Install and Start the Agent
You need Docker installed on your laptop. Run the following commands:-
Authenticate with the Docker registry using the credentials provided by Collate support. For example, the following command shows the registry URL specific to AWS provided by Collate:
-
Run the Docker image to start the Local Agent:
Replace
<collate-version>with your Collate instance version. For example,1.11.1.
Step 2: Connect the Agent to Collate
Generate a JWT token from your Collate instance and use it to authenticate the agent.- In your web browser, navigate to
http://localhost:8001to open the local agent UI. - Enter your Collate platform URL (for example,
https://<your-company>.getcollate.io/api). - In your Collate instance, navigate to Settings > Bots and click Add Bot.
- Enter Email, Display Name, Token Expiration, and Description, then click Create.
- Assign the appropriate roles – typically DefaultBotPolicy and Ingestion Bot Policy.
- Copy and save the JWT token for later use.

Step 3: Add a New Service
Add a data service in the Local Collate UI and verify the connection before running ingestion.- Navigate to Settings > Services and select your service type (for example, Databases).
- Click + Add New Service and choose your connector type, then click Next.
- Enter the unique Name and Description, then click Next.
- Fill the connection details.
- Click Test Connection to verify the connection is successful.
Step 4: Verify Ingested Data
Confirm that metadata from your data source is now visible in the Collate platform.- Return to the Collate platform at
https://<your-company>.getcollate.ioand refresh the database services page. Your new service will appear there now. - Verify that the ingested metadata, including schemas, tables, and column details, is available.
Step 5: Configure Metadata Ingestion
After your agent is connected and your service is set up, proceed to Step 2 - Configure Metadata Ingestion to schedule your first ingestion pipeline.Additional Features
The Collate Ingestion Agent supports various ingestion workflows, allowing you to:- Generate YAML Configurations: Download YAML files for external scheduling.
- Manage Ingestion Types: Run metadata, profiling, lineage, usage, and other workflows as needed.
- Monitor Progress: View logs and monitor real-time ingestion activity.