Job VC
Middle/Senior Data Engineer
Technologies
Description
Project:
The client is developing an AI-powered investment intelligence platform to help venture capital firms and angel investors streamline startup discovery and deal-flow sourcing. The product aggregates and structures data from multiple sources, including accelerator programs, SEC Form D filings, and company profiles, allowing users to identify relevant investment opportunities more efficiently.
The platform includes features such as natural language alerts, automated sourcing workflows, and AI-driven insights. The solution is built with Next.js, Vercel, Clerk, and Stripe and is already live with paying customers while continuously expanding its automation and agentic UI capabilities.
Cooperation:
Long-term.
Stage:
Existing product, early-stage / actively growing.
Position:
New role.
Tech Stack:
Python, Supabase, GCP Cloud Run, OpenAI API, LLM Pipelines, Web Scraping, ETL.
Timezone Requirements:
Possible 50/50 overlap Kyiv/New York (EST).
Location Requirements:
Remote.
English:
Advanced — strong spoken English required.
Requirements:
4+ years of experience in data engineering
Strong Python skills
Experience with web scraping at scale (Playwright, Scrapy or similar)
Hands-on experience with LLM API integration (OpenAI, Anthropic or similar)
Experience building ETL / data pipelines
GCP experience (Cloud Run or managed services)
Supabase or PostgreSQL — data storage and schema design
Ability to work independently and own projects end-to-end
Advanced spoken English
Responsibilities:
Build new datasets — structured scraping of accelerators, Form D filings, company profiles
Parse and extract content using LLMs, define schemas in Supabase
Create eval frameworks to benchmark LLM model performance across pipeline calls
Design and improve scalable pipeline architecture on GCP
Work independently within 2-week sprints with daily check-ins
Benefits from 8allocate:
Team & Culture: Team events, offsites, and a culture that keeps people connected.
Learning & Development: Budget for courses, certifications, and conferences.
Wellbeing: Flexible support in line with company policy, with options to support your physical and mental wellbeing (sport, mental health, or medical insurance).
Rest & Recovery: Paid vacation and sick leave.
The client is developing an AI-powered investment intelligence platform to help venture capital firms and angel investors streamline startup discovery and deal-flow sourcing. The product aggregates and structures data from multiple sources, including accelerator programs, SEC Form D filings, and company profiles, allowing users to identify relevant investment opportunities more efficiently.
The platform includes features such as natural language alerts, automated sourcing workflows, and AI-driven insights. The solution is built with Next.js, Vercel, Clerk, and Stripe and is already live with paying customers while continuously expanding its automation and agentic UI capabilities.
Cooperation:
Long-term.
Stage:
Existing product, early-stage / actively growing.
Position:
New role.
Tech Stack:
Python, Supabase, GCP Cloud Run, OpenAI API, LLM Pipelines, Web Scraping, ETL.
Timezone Requirements:
Possible 50/50 overlap Kyiv/New York (EST).
Location Requirements:
Remote.
English:
Advanced — strong spoken English required.
Requirements:
4+ years of experience in data engineering
Strong Python skills
Experience with web scraping at scale (Playwright, Scrapy or similar)
Hands-on experience with LLM API integration (OpenAI, Anthropic or similar)
Experience building ETL / data pipelines
GCP experience (Cloud Run or managed services)
Supabase or PostgreSQL — data storage and schema design
Ability to work independently and own projects end-to-end
Advanced spoken English
Responsibilities:
Build new datasets — structured scraping of accelerators, Form D filings, company profiles
Parse and extract content using LLMs, define schemas in Supabase
Create eval frameworks to benchmark LLM model performance across pipeline calls
Design and improve scalable pipeline architecture on GCP
Work independently within 2-week sprints with daily check-ins
Benefits from 8allocate:
Team & Culture: Team events, offsites, and a culture that keeps people connected.
Learning & Development: Budget for courses, certifications, and conferences.
Wellbeing: Flexible support in line with company policy, with options to support your physical and mental wellbeing (sport, mental health, or medical insurance).
Rest & Recovery: Paid vacation and sick leave.