Machine Learning Data Engineer
Nagish
Description
As a Machine Learning Data Engineer at Nagish, you will build and maintain the data systems that power our video-based ML research. You’ll handle everything from acquiring and cleaning datasets to managing cloud pipelines and ensuring smooth, reproducible workflows for our researchers.
On a day to day, you will:
- Source, scrape, ingest, clean, and catalog datasets
- Maintain data pipelines for pose estimation, segmentation, video masking, quantization, metadata, standardization etc.
- Manage cloud infrastructure including storage, servers, GPU clusters, and backups
- Interface with ML researchers to ensure data pipeline reproducibility
- Handle ingestion from new data recording systems
Requirements
- 3+ years experience in Python and data infrastructure
- Hands-on experience working with video data (advantage)
- Proven experience managing cloud-based data pipelines and data ingestion workflows
- Experience with SQL databases and cloud buckets
- A PhD in related field (advantage)
- Independent research capabilities and strong scripting skills
Benefits:
😁 Work on a fulfilling life-changing product (Literally)
🗝️ Join as a key player at an early stage, and receive generous options
🏖 Unlimited time off and sick days
👯♂️ Annual company get-together
🐶 Bring your pet to work
About Us:
Nagish makes communication accessible for people who are Deaf or hard of hearing.
Our team is passionate about making the world more accessible using our state-of-the-art tech - made for consumers and enterprises.
We are backed by some of the best investors out there: Comcast, Techstars, Vertex, Precursor, Contour, Cardumen, and more.