Data Engineer
We are seeking versatile, experienced Data Engineers to support the design, development, and optimization of our modern enterprise data platforms. This role is critical for building scalable data pipelines, implementing robust lakehouse architectures, and ensuring high-quality data is available for analytics and reporting across the organization. The ideal candidate is a collaborative problem-solver with a strong technical background in ETL/ELT processes, dimensional modeling, and cloud-based data ecosystems.
About First Factory
We are a software development company with over two decades of experience, boasting a dynamic team of 175+ professionals actively engaged in diverse projects across various industries. We invite you to join us on this journey as we thrive and embrace fresh challenges.
Key Responsibilities
Pipeline Development: Design, build, and maintain optimal ETL/ELT pipelines to ingest data from diverse sources, including APIs, flat files, and relational databases.
Architecture Management: Develop and manage lakehouse medallion architectures (Bronze, Silver, Gold layers) to support analytics and data science use cases.
Data Modeling: Build and optimize star schemas, conformed dimensions, and fact tables to enable performant reporting and business intelligence.
Data Quality & Governance: Implement built-in data quality checks, automated testing, and governance practices, including data lineage and security mechanisms for sensitive data.
Stakeholder Collaboration: Partner with cross-functional teams, including product owners, data analysts, and architects, to translate business requirements into technical solutions.
Operational Excellence: Monitor system performance, troubleshoot pipeline failures, and promote best practices in platform reliability and CI/CD workflows.
Requirements
3-5+ years of professional experience in data engineering or data warehouse development.
Proficiency in SQL (complex transformations, window functions) and Python or PySpark for data processing.
Hands-on experience with at least one major cloud service provider, such as AWS, Microsoft Azure, or Microsoft Fabric.
Data Technologies: Strong understanding of medallion architectures and 'big data' technologies like Apache Spark.
Database Knowledge: Experience working with relational database platforms (e.g., SQL Server, MySQL, Oracle).
Soft Skills: Excellent written and verbal English communication skills with the ability to work in a fast-paced, dynamic environment.
Nice to have
Experience with Databricks (Mosaic AI, Lakeflow) or legacy Microsoft stacks (SSIS, SSAS, SSRS).
Familiarity with Power BI semantic models or stream-processing systems like AWS Kinesis and Azure Stream Analytics.
Knowledge of Apache Kafka, Hadoop, Hive, or NoSQL databases like MongoDB and Cassandra.
Background in Financial Services, Retail, or QSR (Quick Service Restaurant) data ecosystems.
Familiarity with agentic coding tools (e.g., Claude Code, OpenAI Codex) and modernization of legacy tools into cloud-based architectures.
- Department
- Software Engineering
- Role
- Data Warehouse - Data Engineer
- Locations
- Heredia
- Remote status
- Hybrid
About First Factory
For over 25 years, First Factory has been a place where collaborative excellence meets modern technologies. We’re a strong team building exceptional software solutions from Costa Rica and LATAM for primarily US-based clients. With industry-low turnover, top eNPS globally, and 5 consecutive Inc. 5000 awards, we foster an environment where talented engineers thrive on challenging projects using modern tech stacks.