Lugal — Managed Data Pipeline Platform

Private Preview

Submit Your Pipeline.
We Handle Everything Else.

Lugal runs large-scale, long-running data pipelines on cloud compute you never have to manage. Send us a config, get back results and a transparent bill — no orphaned instances, no surprise costs.

Talk to Us About Your Pipeline Request a Demo

From Config to Results — No DevOps Required

Lugal abstracts away every layer of infrastructure so researchers and data teams can focus entirely on their science.

Submit

Upload your data to S3 and send a pipeline config via API or CLI. Lugal validates the spec and queues it immediately.

Execute

Jobs launch in sequence on dedicated cloud compute. Each job passes its output to the next — no manual handoffs.

Monitor

Heartbeats and structured logs flow back in real time. If a job fails, Lugal records the halt point and waits for your instruction.

Deliver

Your output lands in S3. Compute shuts down the moment the pipeline completes. You receive results and an itemised bill.

Use Cases

Lugal Is Built for These Workloads

Long-running, data-intensive pipelines that need reliable execution, automatic cost tracking, and full traceability.

Bioinformatics & Genomics

The challenge

Sequencing workflows involve many tools — FastQC, alignment, variant calling, annotation — each needing different compute and producing outputs that feed the next step. Running dozens of samples in parallel while containing per-sample failures requires significant pipeline infrastructure.

How Lugal helps

one_per_input mode runs each sample through the full pipeline independently. Failed samples retry without touching clean ones. Outputs flow between steps automatically via ${taskName} references — no manual path wiring between tools.

HIPAA-compatibleGxP audit trail

Research Computing

The challenge

Large-scale analyses — feature computation, statistical workflows, data transformations — need more compute than local machines can provide. Manually managing cloud VMs is slow, and forgetting to terminate instances leads to surprise charges.

How Lugal helps

Lugal provisions dedicated compute per job and destroys it the moment the job finishes. Failed steps restart from the exact failure point. Every run produces a complete execution record — inputs, outputs, timing — for reproducibility and peer review.

Automatic teardownReproducible runs

Data Science Batch Workloads

The challenge

Feature engineering, model training, and evaluation pipelines have multiple interdependent steps that need different compute profiles. Managing intermediate data between steps adds fragile handoffs that are hard to monitor and debug.

How Lugal helps

Each pipeline step is a discrete job with defined inputs and outputs. Data flows automatically between steps via S3. You receive a per-step cost breakdown — not just a total bill — so you can see exactly which step consumed what compute.

Per-step cost trackingAutomatic data handoffs

Simulation & Parameter Sweeps

The challenge

Running the same simulation across hundreds of parameter configurations requires tracking which inputs produced which outputs, identifying failed runs, and reproducing exact results months later without rebuilding the environment.

How Lugal helps

Each configuration runs as a discrete job. Inputs, outputs, and timing are captured in the execution record. Failed configurations retry independently, and the plan.json maps every output back to its exact input set.

Parallel executionInput/output traceability

Compliance-Sensitive Data Operations

The challenge

Regulated workflows — clinical data processing, GxP research, HIPAA-covered pipelines — need data lineage, encryption, access controls, and audit trails as engineering requirements. Most pipeline platforms weren't designed to provide this.

How Lugal helps

Audit trails are first-class outputs. Input data expires automatically after 30 days. Every resource carries tags mapping to client, project, and run. The execution record provides timestamped evidence of exactly what ran, when, and on what input.

21 CFR Part 11HIPAA-compatibleGDPR-friendly

Full traceability. No Surprises.

Lugal was designed around a single principle: every unit of compute must be traceable to a billable job. There are no silent retries, no orphaned instances, and no untracked resources left running after a pipeline finishes.

✓

Every AWS resource created is recorded before it starts

✓

Resources are tagged to client → project → pipeline → job

✓

Cleanup runs automatically on success or failure

✓

Billing exports are a first-class output, not a separate report

✓

Restart from the failed job — not from the beginning

What You Get at the End of Every Run

📦

Pipeline Outputs

Final results in your S3 bucket — files, reports, or whatever your jobs produce.

📋

Full Execution Record

A plan.json capturing every task, its inputs, outputs, timing, and exit state.

📊

Itemised Bill

Every AWS cost driver mapped to your pipeline, job, and task. No line item goes unexplained.

🔍

Logs & Audit Trail

Structured logs and heartbeat history for every job, queryable long after the compute is gone.

Transparent Billing

Every Dollar Traced to a Job

Lugal is designed so every unit of compute maps back to a specific job, pipeline, client, and execution record. The goal is to eliminate surprise cloud usage and make long-running workloads financially traceable.

🧾

Per-Job Billing Records

Each pipeline run generates a billing record tied to exactly the jobs that ran. Compute costs accumulate per job — not per pipeline as a flat unit — so you can see which step cost what.

🔍

Failed Runs Are Visible

A job that fails mid-run still records the compute it consumed. Retries are logged separately. You never receive a bill for invisible work — every charge has a corresponding execution record.

🛑

No Orphaned Compute

Lugal tracks every AWS resource it creates before it starts. Cleanup runs automatically on success or failure — no EC2 instances left running, no EBS volumes silently accruing charges.

🏷️

Tagged to Client and Project

Every resource carries tags for client, project, pipeline, and job. This makes cost allocation straightforward whether you're running a single project or billing across multiple clients.

Pricing Is Custom, Not Hidden

Lugal does not publish a fixed rate card because pipeline costs depend on your compute requirements — instance types, job duration, data volume, and retry behaviour all vary by workload. We scope pricing per engagement and provide a cost estimate before any work begins.

There is no invisible markup. The itemised bill you receive maps directly to the AWS resources Lugal provisioned on your behalf.

✓

Cost estimate provided before work starts

✓

No surprise line items — every charge is explained

✓

Job-level detail in every final bill

✓

Retries and failures reported separately

✓

Billing exports available after every run

Execution Model

What Happens During a Run

Submit a config. Lugal handles everything — from provisioning to teardown — and delivers a complete record when the pipeline finishes.

Submit

Upload data to S3 and send a job.json via API or CLI. Lugal validates the schema and queues the pipeline immediately.

Validate

Task names, input paths, execution modes, and dependency references are verified before any compute starts.

Provision

A dedicated EC2 instance launches per job. Every resource is tagged: client → project → pipeline → job.

Execute

Tasks run in sequence. Each task reads inputs from S3, runs your code or tools, and writes outputs back to S3.

Monitor

Heartbeats and structured logs flow back in real time. Failures trigger retries up to the configured limit — each attempt recorded.

Teardown

Compute is destroyed the moment the pipeline completes — success or failure. No orphaned instances, no lingering costs.

Deliver

Outputs arrive in your S3 bucket. Lugal sends a plan.json execution record and an itemised billing report covering every job that ran.

job.json

{
  "name": "variant-calling-run-042",
  "tasks": [
    {
      "name": "fastqc",
      "mode": "one_per_input",
      "restarts": 1
    },
    {
      "name": "align",
      "mode": "one_per_input",
      "inputs": [
        {
          "path": "${fastqc}",
          "filter": "*.fastq.gz"
        }
      ],
      "restarts": 2
    },
    {
      "name": "call_variants",
      "inputs": [
        {
          "path": "${align}",
          "filter": "*.bam"
        }
      ]
    },
    {
      "name": "annotate",
      "args": { "genome": "hg38" }
    }
  ]
}

one_per_inputRuns one task instance per input file — if fastqc receives 6 samples, 6 tasks execute concurrently.

"path": "${fastqc}"References the output directory of the fastqc task. No manual path wiring required.

restarts: 2Lugal retries this task up to 2 times on failure. Each attempt is recorded separately in plan.json.

Pipelines Are Simple to Define

A Lugal pipeline is a JSON config — a named list of jobs, each with its own tools, inputs, and outputs. No YAML sprawl, no DAG frameworks to learn.

Pipeline

The top-level unit. Has a name, a client, and an ordered list of jobs. Submitted once via API.

Client & project attribution
Ordered job sequence
Resource quotas & retry policy

Job

A single unit of work that runs on its own EC2 instance. Reads from S3, writes back to S3.

Dedicated compute per job
Defined inputs & outputs
Configurable parallelism

Task

The individual steps within a job — your actual tools and scripts, run in sequence.

Your code, your tools
Automatic input/output wiring
Per-task retry & failure handling

Deployment Model

Deployment and Security Model

Lugal runs in a Telos Sphere-managed AWS account. You bring data and a pipeline config — we own every layer of infrastructure.

Where Your Workload Runs

Lugal runs in a Telos Sphere-managed AWS account. You do not need to set up, configure, or share any cloud infrastructure. You bring your data and a pipeline config — everything else is on our side of the account boundary.

We provision the VPC, S3 buckets, EC2 instances, and IAM roles per engagement. When a pipeline run completes, compute is destroyed. Input data carries a 30-day automatic expiry; you retrieve outputs and records directly from S3.

✓

Data uploads to S3 over HTTPS — unencrypted transport is rejected at the bucket level

✓

Job instances run in a private VPC — S3 traffic routes through a VPC Gateway endpoint, never the public internet

✓

No API keys or long-lived credentials are shared between parties

✓

Every provisioned resource is tagged client → project → pipeline → job and destroyed after the run

Private VPC

Job instances have no inbound ports. Access is managed exclusively via AWS Systems Manager — no SSH exposure.

Encrypted at rest

All S3 buckets use AES-256 server-side encryption. Public access is blocked at the account and bucket level.

IAM instance profiles

EC2 instances authenticate to AWS via IAM instance profile. No hardcoded credentials exist on any compute node.

Automatic teardown

EC2 instances are terminated immediately when the pipeline finishes — on success or failure. Nothing persists between runs.

What Is Retained After a Run

📋

Execution record

A plan.json file capturing every task, inputs, outputs, timing, exit code, and retry history — delivered to your S3 output bucket.

📊

Itemised billing report

A per-job cost breakdown mapping every AWS charge to the specific pipeline, job, and task that incurred it.

📡

Per-job metrics

Heartbeats, structured logs, and resource metrics are labeled per job and retained in Lugal's monitoring stack for post-run diagnostics.

Private Preview

Available for Early Technical Conversations

Lugal is currently available for selected early technical conversations and private-preview use cases.

If you run long-running, data-heavy workloads and want better cost visibility, auditability, and operational control — contact us for a pipeline review.

Request a Pipeline Review Get in Touch

Submit Your Pipeline.We Handle Everything Else.

From Config to Results — No DevOps Required

Submit

Execute

Monitor

Deliver

Lugal Is Built for These Workloads

Bioinformatics & Genomics

Research Computing

Data Science Batch Workloads

Simulation & Parameter Sweeps

Compliance-Sensitive Data Operations

Full traceability. No Surprises.

What You Get at the End of Every Run

Every Dollar Traced to a Job

Per-Job Billing Records

Failed Runs Are Visible

No Orphaned Compute

Tagged to Client and Project

Pricing Is Custom, Not Hidden

What Happens During a Run

Pipelines Are Simple to Define

Deployment and Security Model

Where Your Workload Runs

What Is Retained After a Run

Available for Early Technical Conversations

Submit Your Pipeline.
We Handle Everything Else.