Job Openings
NLP Engineer _ Machine Learning Engineer|5+ years| Remote
About the job NLP Engineer _ Machine Learning Engineer|5+ years| Remote
Job description:
NLP Engineer / Machine Learning Engineer Document Understanding & Knowledge Graphs
- Overview Were looking for a hands-on NLP/ML engineer to lead the development of an intelligent document understanding pipeline for extracting structured data from complex, unstructured RFQ documents (40100+ pages, in German and English).
- You will be responsible for building scalable systems that combine document parsing, layout analysis, entity extraction, and knowledge graph construction ultimately feeding downstream (e.g. Analytics and LLM applications.)
- Key Responsibilities - - - - - -
- Design and implement document hierarchy and section segmentation pipelines using layout-aware models (e.g., DocLayout-YOLO, LayoutLM, Donut).
- Build multilingual entity recognition and relation extraction systems across both English and German texts.
- Use tools like NLTK, transformers, and spaCy to develop custom tokenization, parsing, and information extraction logic.
- Construct and maintain knowledge graphs representing semantic relationships between extracted elements using graph data structures and graph databases (e.g. Neo4j) Integrate outputs into structured LLM-friendly formats (e.g., JSON, Mark Down) for downstream extraction of building material elements.
- Collaborate with product and domain experts to align on information schema, ontology, and validation methods. What Were Looking For - - - -
- Strong experience in NLP, document understanding, and information extraction from unstructured/multilingual documents.
- Proficiency in Python, with experience using libraries such as transformers, spaCy, and NLTK. Hands-on experience with layout-aware models like DocLayout-YOLO, LayoutLM, Donut, or similar.
- Familiarity with knowledge graphs and graph databases such as Neo4j, RDF