Data Engineer- Remote

DeepSource·Remote(India)
Software Development

WFA Digital Insight

The demand for skilled data engineers has skyrocketed in recent years, with a 25% increase in job postings in 2025 alone. As the remote job market continues to evolve, professionals with expertise in digital skills like Python, Pyspark, and SQL are in high demand. DeepSource, a cutting-edge company, is now seeking a talented Data Engineer to join their team. With the rise of big data and cloud computing, this role offers a unique opportunity to work on complex data pipelines and infrastructure. Before applying, candidates should be aware of the company's focus on innovation and collaboration, as well as the need for strong problem-solving skills and attention to detail.

Job Description

About the Role

As a Data Engineer at DeepSource, you will be responsible for designing, building, and maintaining the company's data architecture and infrastructure. This will involve working closely with the data science team to develop and deploy data solutions that drive business growth. The ideal candidate will have a strong background in data engineering, with experience in languages such as Python and Pyspark.

The role is remote, allowing you to work from anywhere in India, and offers a unique opportunity to work with a talented team of professionals who are passionate about data and technology. You will be part of a dynamic and fast-paced environment, where innovation and collaboration are encouraged.

The team you will be working with is responsible for developing and maintaining the company's data pipelines and infrastructure. You will be reporting to a Senior Data Engineer and will have the opportunity to work on a wide range of projects, from data warehousing to machine learning.

What You Will Do

  • Design and implement data architecture and infrastructure to meet the company's growing data needs
  • Develop, test, and deploy data solutions using Python, Pyspark, and SQL
  • Collaborate with the data science team to develop and deploy data models and algorithms
  • Work with the engineering team to integrate data solutions with other systems and applications
  • Develop and maintain data pipelines and workflows using tools such as Databricks and Azure Data Factory
  • Optimize data storage and retrieval systems for better performance and scalability
  • Ensure data quality and integrity by implementing data validation and testing procedures
  • Collaborate with cross-functional teams to identify and prioritize data needs and projects
  • Stay up-to-date with emerging trends and technologies in data engineering and make recommendations for implementation

What We Are Looking For

  • 3+ years of experience in data engineering or a related field
  • Strong proficiency in Python, Pyspark, and SQL
  • Experience with data manipulation and analysis using Pandas and other libraries
  • Knowledge of distributed data processing using Pyspark and other frameworks
  • Familiarity with Databricks and Azure Synapse Spark for big data workloads
  • Experience with Azure Data Factory for data integration workflows
  • Strong understanding of data modeling and database design principles
  • Experience with data warehousing and ETL processes
  • Familiarity with cloud-based data platforms such as Azure and AWS

Nice to Have

  • Experience with Terraform for Azure and other cloud-based infrastructure management tools
  • Knowledge of NoSQL databases and graph databases
  • Experience with machine learning and deep learning frameworks such as TensorFlow and PyTorch
  • Familiarity with containerization using Docker and Kubernetes
  • Experience with agile development methodologies and version control systems such as Git

Benefits and Perks

  • Competitive salary and benefits package
  • Opportunity to work with a talented team of professionals
  • Flexible working hours and remote work options
  • Professional development and training opportunities
  • Access to the latest tools and technologies
  • Recognition and reward for outstanding performance
  • Comprehensive health insurance and wellness programs
  • Generous PTO and holiday policy
  • Remote stipend and equipment allowance

How to Stand Out

  • Tip: Make sure to highlight your experience with Python, Pyspark, and SQL in your resume and cover letter, as these are essential skills for the role.
  • Tip: Be prepared to discuss your experience with data pipelines and infrastructure, and how you have optimized data storage and retrieval systems in previous roles.
  • Tip: Showcasing your ability to work collaboratively with cross-functional teams and communicate complex technical concepts to non-technical stakeholders is crucial.
  • Tip: Having a strong understanding of data modeling and database design principles is key, so be prepared to discuss your experience with data warehousing and ETL processes.
  • Tip: If you have experience with Terraform or other cloud-based infrastructure management tools, be sure to highlight this in your application, as it is a desirable skill for the role.
  • Tip: Prepare to discuss your experience with agile development methodologies and version control systems, and how you have used these in previous roles to manage and collaborate on projects.
  • Tip: Be prepared to provide examples of how you have optimized data solutions for better performance and scalability, and how you have ensured data quality and integrity in previous roles.

This is a remote position listed on WFA Digital, the platform for professionals who work from anywhere. Browse more remote jobs across all categories.