Member of Security (Corporate Technology)
RemoteMember of Engineering (Pre-training / Data Research)
RemotePoolside is hiring a Data Quality Engineer to join their data team focused on improving pretraining datasets for their coding agents and AI models aimed at accelerating software development. You'll own dataset quality through synthetic data generation and mix optimization, running experiments to identify what data best improves model capabilities, while collaborating closely with Pretraining, Posttraining, Evals, and Product teams. You need strong ML and Python engineering skills with hands-on experience building trillion-scale pretraining datasets, understanding of LLM training dynamics (transformers, scaling laws, data curation, deduplication), and familiarity with large GPU clusters and distributed data pipelines; research papers or published work in deep learning/LLMs is a nice-to-have. Poolside is a well-funded AGI-focused startup founded in the US with a distributed team across Europe and North America that does mandatory in-person collaboration in Paris 3 days each month (Monday-Wednesday), plus annual longer off-sites. This is a hands-on technical role with access to massive GPU infrastructure and the chance to define what data fuels frontier models, ideal if you're deeply curious about dataset design and LLM pretraining mechanics.
Member of Engineering (Pre-training / Data Acquisition)
Remote · London, United KingdomPoolside is hiring its first dedicated data acquisition engineer to build and operate large-scale systems for collecting pre-training data for frontier LLMs focused on software development. You'll design and run web crawlers at massive scale, develop specialized deep crawlers for high-value sources, build monitoring and debugging tooling, and collaborate with pre-training and research teams to align data sourcing with model training needs. Required experience includes a strong distributed systems background with proven large-scale infrastructure work, proficiency in Python with performance optimization skills, hands-on web crawling or data extraction experience, and familiarity with AWS and container orchestration (Kubernetes/Docker). The company is a well-funded AI startup with 15+ people distributed across Europe and North America, holding mandatory monthly in-person collaboration in Paris (Mon-Wed) plus annual off-sites. This is an unusually foundational role at a company betting everything on becoming an AGI leader through developer-focused AI agents, with the unique constraint that you'll be the first person dedicated to this critical upstream function.
Member of Engineering (Design Engineer, Full Experience)
RemoteMember of Engineering (Design Engineer, Product)
RemoteMember of Engineering (Technical Support Engineer)
RemoteMember of Engineering (Reinforcement Learning Infrastructure)
RemoteMember of Engineering (Reinforcement Learning)
RemoteMember of Engineering (Post-training)
RemoteMember of Engineering (Evaluations / Engineering)
RemoteMember of Engineering (Evaluations)
RemoteMember of Engineering (Pre-training / Data Engineering)
Remote · London, United KingdomMember of Engineering (Pre-training / Synthetic Data)
RemoteMember of Engineering (Scalability)
Remote
Want Poolside roles matched to you?
Swoopd scores fresh postings against your résumé so you only see the matches that matter.