Job Openings Senior Data Engineer (HCM)

About the job Senior Data Engineer (HCM)

About Us
We're a world-leading smart mobility SaaS tech company with almost 2,000,000 active users. Our teams are collaborative, vibrant, and fast-growing, and all team members are empowered with the freedom to influence our products and technology.

Are you curious, innovative, and passionate?
Do you take ownership, embrace challenges, and love problem-solving?

We are looking for a Senior Data Engineer who will help us build robust pipelines and infrastructure to process and analyze audio and video data, revolutionizing the way our customers use connected technology.

Your Role

  • Design and develop efficient pipelines to ingest audio and video data from various sources, such as microphones, cameras, and cloud storage.
  • Implement techniques for cleaning and preprocessing data, including noise reduction, normalization, and video frame extraction.
  • Extract meaningful features from audio and video data, including acoustic, visual, and temporal attributes.
  • Oversee labeling and annotation of datasets for machine learning models.
  • Build and maintain scalable data pipelines using tools like Apache Airflow and Apache Spark.
  • Design efficient storage solutions for large-scale data, including data lakes and object storage.
  • Collaborate with machine learning engineers to integrate models into production environments.
  • Continuously monitor and optimize data pipelines for performance and scalability.

Your Qualifications

  • Minimum 5 years of relevant experience.
  • Strong experience in Python and data engineering tools (e.g., Pandas, NumPy, SQL).
  • Hands-on experience with audio and video processing libraries (e.g., Librosa, OpenCV).
  • Knowledge of cloud platforms (AWS, GCP, Azure) and their data services.
  • Familiarity with data warehousing and data lake architectures.
  • Excellent problem-solving and analytical skills.
  • Strong ownership mindset and willingness to learn new technologies.
  • Knowledge of machine learning frameworks (e.g., TensorFlow, PyTorch) is a plus.