Senior Data Engineer
We are seeking a highly skilled Senior Data Engineer to join a mission-critical client project, acting as a bridge between complex business requirements and scalable technical execution. In this role, you will collaborate closely with Data Architects, Engineering leads, and Stakeholders to design and maintain sophisticated data models that enhance accessibility and governance. Your expertise will be the foundation of a high-performance data mesh environment, driving measurable business impact and ensuring data quality across the entire enterprise ecosystem.
About First Factory
We are a software development company with over two decades of experience, boasting a dynamic team of 175+ professionals actively engaged in diverse projects across various industries. We invite you to join us on this journey as we thrive and embrace fresh challenges.
Key Responsibilities
Design and develop efficient and scalable data pipelines between enterprise transactional systems, third-party, and analytics platforms
Must be a crack coder in one of the following (go-lang, node.js or python)
Must be proficient in writing and refactoring efficient SQL queries.
Must be able to explain features of good data model design.
Build and maintain a data environment for speed, accuracy, consistency and ‘up’ time
Support analytics and data science by building a world-class data mesh environment that empowers analysts to determine insights into revenue and power products across the organization.
Integrate third-party data sources and API’s into the Bright data mesh ecosystem
Work closely with Data Science team and participate in development of feature engineering pipelines.
Design and develop data products with modern AWS cloud technologies such as S3, Redshift, EMR, Hive, Presto, Flink, and Spark
Work with the machine learning engineering team to build a data ecosystem that supports AI products at scale.
Design and deploy an enterprise data warehouse that supports internal and market-facing analytics products at scale
Ensure data governance principles are adopted, data quality checks and data lineage are implemented in each hop of the data
Partner with adjacent organizations to ensure proper integration and adherence to standards
Be in tune with emerging trends in data management and cloud technologies, and participate in the evaluation of new technologies
Ensure compliance through the adoption of enterprise standards and promotion of best practices / guiding principles aligned with the organization's standards
Requirements
8+ years of experience as Data Engineer at an innovative organization
4+ years of hands-on experience in implementing data lake systems using AWS cloud technologies such as S3, Redshift, EMR, Hive, Kafka, and Spark
Expert managing AWS services (EC2, S3, Route 53, ELB, VPC, CloudWatch, Lambda) in a multi-account production environment
Experience with development frameworks as well as data and integration technologies such as Informatica, Python, and Scala
Create new ETLs in AWS Glue with Python or Node.js as the scripting language
Create AWS Lambdas using Python or Node.js as the scripting language
Modify existing ETLs to fix issues where approach is appropriate
Use Glue for ETLs inside of AWS to and from all AWS types of data sources
Support the migration of data into S3, Redshift, DynamoDB, AWS RDS
Experience with Machine Learning Libraries and Frameworks (TensorFlow, MLlib) is an added advantage
Exposure to R, SparklyR, and Other R packages is a Plus
Expert knowledge of Agile approaches to software development and able to put key Agile principles into practice to deliver solutions incrementally.
Monitors industry trends and directions; develops and presents substantive technical recommendations to senior management
Excellent analytical thinking, interpersonal, oral and written communication skills with strong ability to influence both IT and business partners
Ability to prioritize and manage work to critical project timelines in a fast-paced environment
Advanced knowledge of Microsoft SQL Server for future migration to an AWS Database Platform
Nice to have
Previous experience with cloud development (AWS, GCP)
Previous experience in design and deployment of data lakes, data mesh, data warehouses, and streaming platforms
Previous experience with data quality projects and public records
Previous experience with: AWS DynamoDB, AWS Elastic Map Reduce, AWS Lambda, AWS Step Functions, AWS Redshift, AWS RDS, Terraform, or CloudFormation
AWS Architect Certification is a plus
- Department
- Software Engineering
- Role
- Data Warehouse - Data Engineer
- Locations
- Heredia
- Remote status
- Hybrid
About First Factory
For over 25 years, First Factory has been a place where collaborative excellence meets modern technologies. We’re a strong team building exceptional software solutions from Costa Rica and LATAM for primarily US-based clients. With industry-low turnover, top eNPS globally, and 5 consecutive Inc. 5000 awards, we foster an environment where talented engineers thrive on challenging projects using modern tech stacks.