Data Engineer

TETRIXX PTE. LTD.
Hey YOU!
At Tetrixx, when it comes to improving the environment and building sustainable solutions, no challenge is too big to take on. Tetrixx is where your years of intense learning come to make sense, giving you real purpose, immense pride in what you do and, more importantly, clarity on why you do it.
We are a data obsessed cloud native startup, swimming intently in the Logistics and Supply Chain waters. Our mission is to revolutionize the way businesses manage their logistics operations through cutting-edge technology solutions. We are seeking a self driven Data Engineer to join our dynamic team and play a crucial role in shaping the future of Logistics. So, if you are passionate about being an instrumental part of our journey and delivering world class end-to-end solutions that engage our clients, generate tangible value for them and help make their respective industries a little bit smarter and waste free, then stop what you are doing and get in touch. We would love to hear from you!
Job Summary:
We are seeking a skilled and motivated Data Engineer to join our growing team. In this role, you will be responsible for designing, building, and maintaining our data infrastructure, with a focus on supporting machine learning initiatives and optimizing our OCR-based data pipeline. You will collaborate with data scientists, machine learning engineers, and other stakeholders to ensure data is accessible, reliable, and efficiently processed.
Key Responsibilities:
Qualifications:
Required:
Preferred:
Skills:
What We Offer:
At Tetrixx, when it comes to improving the environment and building sustainable solutions, no challenge is too big to take on. Tetrixx is where your years of intense learning come to make sense, giving you real purpose, immense pride in what you do and, more importantly, clarity on why you do it.
We are a data obsessed cloud native startup, swimming intently in the Logistics and Supply Chain waters. Our mission is to revolutionize the way businesses manage their logistics operations through cutting-edge technology solutions. We are seeking a self driven Data Engineer to join our dynamic team and play a crucial role in shaping the future of Logistics. So, if you are passionate about being an instrumental part of our journey and delivering world class end-to-end solutions that engage our clients, generate tangible value for them and help make their respective industries a little bit smarter and waste free, then stop what you are doing and get in touch. We would love to hear from you!
Job Summary:
We are seeking a skilled and motivated Data Engineer to join our growing team. In this role, you will be responsible for designing, building, and maintaining our data infrastructure, with a focus on supporting machine learning initiatives and optimizing our OCR-based data pipeline. You will collaborate with data scientists, machine learning engineers, and other stakeholders to ensure data is accessible, reliable, and efficiently processed.
Key Responsibilities:
- Data Pipeline Development and Maintenance:
- Design, develop, and maintain robust and scalable data pipelines for ingesting, processing, and transforming large datasets.
- Optimize and troubleshoot existing data pipelines, including those relying on OCR technology, to improve performance and reliability.
- Implement data quality checks and monitoring systems to ensure data accuracy and consistency.
- Utilize tools like Apache Spark, Airflow, or similar to automate data workflows.
- OCR Pipeline Optimization:
- Analyze and optimize the performance of our OCR-based data extraction pipeline.
- Implement strategies to improve OCR accuracy and efficiency.
- Integrate OCR output with downstream data processing and machine learning workflows.
- Work with text data and document processing.
- Machine Learning Support:
- Build and maintain data infrastructure to support machine learning model development and deployment.
- Design and implement data pipelines for feature engineering and model training.
- Collaborate with data scientists to optimize data access and processing for machine learning tasks.
- Deploy and maintain ML models.
- Data Storage and Management:
- Design and implement efficient data storage solutions, including data warehouses and data lakes.
- Manage data security and access control.
- Ensure data compliance with relevant regulations.
- Collaboration and Communication:
- Work closely with data scientists, machine learning engineers, and other stakeholders to understand data requirements.
- Document data pipelines and processes.
- Communicate technical concepts effectively to both technical and non-technical audiences.
Qualifications:
Required:
- Bachelor's or Master's degree in Computer Science, Data Science, or a related field.
- Proven experience as a Data Engineer, with a focus on 1 data pipeline development and maintenance
- Strong proficiency in SQL and Python.
- Experience with distributed data processing frameworks such as Apache Spark.
- Experience with workflow management tools such as Apache Airflow.
- Experience working with Cloud platforms (AWS, Azure, GCP).
- Experience with OCR technologies and pipelines.
- Experience with ML model deployment.
Preferred:
- Experience with containerization and orchestration tools (Docker, Kubernetes).
- Knowledge of machine learning concepts and techniques.
- Experience with data visualization tools.
- experience with text processing and NLP.
Skills:
- Data pipeline development and optimization.
- OCR pipeline optimization.
- Machine learning support.
- Data warehousing and data lakes.
- SQL, Python, and other relevant programming languages.
- Cloud computing (AWS, Azure, GCP).
- Distributed data processing (Spark).
- Workflow management (Airflow).
- Data security and compliance.
- Strong problem-solving and analytical skills.
- Excellent communication and collaboration skills.
What We Offer:
- Opportunity to work on cutting-edge projects in data science, ML, and OR with real-world impact.
- Opportunity to see first hand how a highly ambitious startup works and make an impact.
- Mentorship from experienced data scientists and researchers.
- Access to advanced computing resources and tools.
- A dynamic, collaborative, fast-paced and inclusive work environment.
- Potential for future employment opportunities based on performance and availability.
JOB SUMMARY
Data Engineer

TETRIXX PTE. LTD.
Singapore
24 days ago
N/A
Full-time
Data Engineer