Dodge Construction Network is HIRING A

Senior ML & Automation Engineer

πŸ“ India 🌐 Fully Remote ⏰ Full Time
POSTED May 22, 2026

Please mention you found this job on TestDev Jobs. It helps us get more people to hire on our site. Thanks and good luck!


Dodge Construction Network (Dodge) is looking for a Senior ML & Automation Engineer to design and operate AI-powered pipelines that transform how we acquire, enrich, and validate construction project intelligence. This role blends machine learning, document automation, and conversational AI β€” spanning internal operational tooling today and expanding into automated outbound AI communications to the field. You will partner closely with Data Engineering and Operations to move pipelines from prototype to production and maintain them rigorously at scale.

This is a full-time position and reports directly to the Director, Data Operations.

Preferred Location

This is a remote, home-office based role and candidates located in India will be considered.

Travel Requirements

Occasional travel to the Kochi office is required.

Essential Functions

Document Intelligence & Extraction

  • Design, develop, and evaluate machine learning models to automate data enrichment, classification, and validation across structured and unstructured project documents
  • Implement OCR, NLP, and layout recognition pipelines to extract metadata, contacts, deadlines, and technical requirements from plan sets, specifications, and bid documents
  • Build Python-based classification microservices to categorize documents by type and extract structured fields (e.g., bid dates, scope of work, discipline sheets, spec sections)
  • Integrate LLM APIs (AWS Bedrock, Anthropic Claude, or equivalent) for intelligent extraction and classification tasks; optimize prompts and model calls for accuracy and cost efficiency
  • Own model performance monitoring for one or more document domains β€” tracking accuracy drift, false positives, and cost per-document over time

Conversational AI & AWS Connect

  • Design and implement conversational AI solutions using Amazon Connect and Amazon Lex, including contact flows, IVR design, and agent assist integrations for internal operational tooling
  • Build and iterate on automated outbound AI calling workflows to collect project updates from contractors, subcontractors, and field contacts β€” capturing structured responses and routing them into data pipelines
  • Ensure all outbound communication automation is implemented in compliance with applicable regulations (TCPA, Do Not Call rules, B2B communication standards); partner with Legal and Compliance ahead of any production deployment
  • Design conversation scripts with dynamic branching logic and fallback handling; continuously improve containment rates and data capture quality
  • Monitor call performance, intent recognition accuracy, and fallback rates; iterate on Lex models and contact flows based on outcomes

Data Pipelines & Entity Resolution

  • Build pipelines that integrate scraped and API-sourced project data with external enrichment sources (ZoomInfo, LinkedIn, government open data APIs) to enrich company, contact, and project records
  • Implement entity resolution and record deduplication logic β€” including fuzzy name matching, license number anchoring, and cross-source reconciliation β€” to maintain a clean entity master
  • Develop automation scripts and microservices to reduce manual effort in project matching, contact discovery, and quality checks
  • Collaborate with Data Engineers to ensure ML pipelines integrate seamlessly with existing data warehouses (Redshift) and meet latency and cost targets
  • Partner with data specialists to design feedback loops that validate and continuously improve model outputs

Education Requirement

Bachelor’s degree in a related field or equivalent education and work experience.

Required Experience, Knowledge and Skills

  • 5+ years of experience in machine learning, automation engineering, or a closely related discipline
  • Proficiency in Python with hands-on experience using ML libraries (scikit-learn, spaCy, TensorFlow, or PyTorch) and production API integration
  • Hands-on experience with OCR frameworks β€” Tesseract, PaddleOCR, AWS Textract, or Google Document AI
  • Demonstrated experience implementing AWS Connect solutions β€” including contact flow design, Amazon Lex bot development, and IVR configuration
  • Practical knowledge of LLM APIs (AWS Bedrock, OpenAI, Anthropic, or equivalent) for production extraction or classification workloads
  • Familiarity with document layout analysis tools (LayoutLM, Donut, DocTR, or similar)
  • Strong knowledge of entity extraction, NER, regex-based parsing, and rules-based approaches
  • Experience with entity resolution, deduplication, or fuzzy record matching at scale
  • Strong knowledge of data pipelines and ETL frameworks; experience deploying and monitoring ML models in production
  • Solid understanding of relational databases and SQL; experience with large-scale warehouses (Redshift, Snowflake, or similar)
  • Awareness of outbound communication compliance (TCPA, Do Not Call regulations) in automated or AI-driven calling contexts
  • Strong problem-solving skills with the ability to translate operational business needs into ML and automation solutions

Preferred Experience, Knowledge and Skills

  • Experience with outbound AI calling automation β€” proactive conversational agents, dynamic call scripting, and structured data capture from voice interactions
  • Familiarity with government and open data APIs (Socrata, ArcGIS, Legistar, Granicus, or similar) for large-scale public data ingestion
  • Experience with AWS analytics and ML services β€” SageMaker, Comprehend, Lambda, Step Functions
  • Awareness of inference cost optimization and batch processing strategies for LLM-powered production pipelines
  • Exposure to CI/CD and MLOps tooling β€” MLflow, Git, Docker, Kubernetes
  • Prior experience with sales intelligence or contact enrichment data (ZoomInfo, LinkedIn, or comparable sources)
  • Experience in Agile delivery environments using Jira or Confluence

About Dodge Construction Network

Dodge Construction Network exists to deliver the comprehensive data and connections the construction industry needs to build thriving communities. Our legacy is deeply rooted in empowering our customers with transformative insights, igniting their journey towards unparalleled business expansion and success. We serve decision-makers who seek reliable growth and who value relationships built on trust and quality. By combining our proprietary data with cutting-edge software, we deliver to our customers the essential intelligence needed to excel within their respective landscapes. We propel the construction industry forward by transforming data into tangible guidance, driving unparalleled advancement.

Dodge is the catalyst for modern construction.

Please mention you found this job on TestDev Jobs. It helps us get more people to hire on our site. Thanks and good luck!