A hybrid Data & ML role at Brillio.
How Sydicom helps: we read this listing’s requirements and tune your CV and cover letter to the keywords its ATS (Lever) is scanning for, for candidates in India, then help you apply.
Original listing text, shown exactly as published by the company.
1. Data Engineering & Pipeline Development
Design and develop scalable ETL/ELT pipelines using PySpark and Spark SQL
Build and maintain Databricks workflows (Jobs) for orchestration
Implement Delta Live Tables (DLT) for declarative pipeline development
Develop and manage batch and streaming data pipelines
2. Lakehouse Architecture Implementation
Design and implement Medallion Architecture (Bronze, Silver, Gold layers)
Build curated datasets for analytics and reporting
Optimize storage using Delta Lake best practices
3. Data Ingestion & Integration
Databases (RDBMS, NoSQL)
APIs and streaming platforms (Kafka, Event Hubs)
Files (CSV, JSON, Parquet)
Handle structured and semi-structured data efficiently
4. Delta Lake & Performance Optimization
ACID transactions
Schema enforcement and evolution
Change Data Capture (CDC)
Partitioning strategies
Caching and broadcast joins
File compaction and indexing (Z-ORDER)
5. Data Quality & Governance
Implement data validation and quality checks
Ensure compliance with data governance standards
Use Unity Catalog for access control and data lineage
Maintain auditability and data traceability
6. Monitoring & Reliability
Build logging, monitoring, and alerting for pipelines
Troubleshoot failures and optimize performance
Ensure high availability and fault tolerance
7. Collaboration & Delivery
Work closely with Data Analysts, Data Scientists, and stakeholders
Participate in Agile ceremonies (Sprint planning, stand-ups, retrospectives)
Bachelor’s degree in Computer Science, Engineering, or related field
4–8+ years of experience in Data Engineering
Databricks platform
PySpark and Spark SQL
Delta Lake
ETL/ELT pipeline development
Distributed data processing
Data modeling (Star/Snowflake schemas)
Data warehousing concepts
Experience with Delta Live Tables (DLT)
Knowledge of CI/CD pipelines (Azure DevOps, GitHub Actions)
Experience with streaming frameworks (Kafka, Spark Streaming)
Azure / AWS / GCP
Experience with MLflow and MLOps workflows
Domain experience (e.g., Healthcare, Finance, Retail)
🛠️ Technical Skills
Languages: Python, SQL
Frameworks: Apache Spark
Tools: Databricks, Delta Lake, MLflow
Data Formats: Parquet, Delta, JSON, Avro
Orchestration: Databricks Workflows / Airflow
Version Control: Git
💡 Soft Skills
Strong problem-solving and analytical thinking
Excellent communication skills
Ability to work in a collaborative environment
Attention to detail and data quality
Experience with real-time analytics
Exposure to data governance tools
Certification in Databricks or Cloud platforms
Specialization
Brillio
Data & ML
99 open roles on Sydicom
Brillio is an Indian-owned company focused on digital technologies and big data analytics headquartered in Santa Clara, California, United States.
Source: Wikipedia