Machine Learning Engineer

AiLogic Neural Network Pvt Ltd

HyderabadFull-timeMid LevelOn-site

Job Description

About Us We are building advanced transformer models focused on multilingual translation and custom transformer architectures. Our team works on large-scale NLP and transformer-based models with a strong focus on research, experimentation, model optimization, and production-grade AI systems. We are looking for a highly skilled Machine Learning Engineer who is passionate about designing, developing, and optimizing AI/ML models, working with transformer architectures, and contributing to cutting-edge NLP research and development.

This is a core ML engineering role β€” not a prompt engineering or API integration role. Role Overview As a Machine Learning Engineer, you will be responsible for developing, training, fine-tuning, evaluating, and deploying transformer-based NLP models and Large Language Models (LLMs). You will work closely with AI researchers, data scientists, and software engineering teams to build scalable multilingual AI solutions.

The ideal candidate should have strong hands-on experience with ML model development pipelines, data preprocessing, tokenization techniques, transformer architectures, GPU-based training environments, and model optimization techniques. A strong interest in experimentation, improving model performance, and deploying AI systems at scale is essential. Key Responsibilities Design, develop, and optimize transformer-based AI/ML models for NLP and multilingual applications.

Build and maintain end-to-end machine learning pipelines for data processing, model training, evaluation, and deployment. Train and fine-tune transformer models on large-scale custom datasets. Work with Seq2Seq, encoder-decoder architectures, attention mechanisms, and modern LLM architectures.

Optimize models using techniques such as LoRA, QLoRA, PEFT, quantization, and distributed training approaches. Develop tokenization pipelines using BPE, SentencePiece, and other subword tokenization methods. Evaluate model performance using BLEU, perplexity, accuracy, and custom evaluation benchmarks.

Collaborate with infrastructure teams to manage GPU environments, distributed training, and scalable AI deployments. Implement efficient model serving, inference optimization, and production-ready ML solutions. Work closely with researchers and engineers to experiment with new architectures, improve existing models, and enhance system performance.

Required Qualifications 2+ years of experience in Machine Learning, Deep Learning, NLP, or AI model development. Strong hands-on experience with Transformer architectures, LLMs, Seq2Seq models, and attention mechanisms. Proven experience in training and fine-tuning models using custom datasets.

Strong programming skills in Python and experience with ML development practices. Hands-on experience with PyTorch or TensorFlow and Hugging Face Transformers. Experience working with vector databases such as Pinecone, Milvus, or similar technologies for embeddings and semantic search.

Experience with GPU acceleration, distributed training, and NLP evaluation methodologies. Understanding of software engineering principles, version control, and production deployment workflows. Strongly Preferred Skills Experience with LoRA, QLoRA, PEFT, and model quantization techniques.

Knowledge of DeepSpeed, FSDP, or other distributed training frameworks. Experience with Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF). Experience building multilingual NLP systems and machine translation models.

Knowledge of speech models, sequence modeling, or other advanced AI architectures. Contributions to AI/ML research papers, open-source projects, or publications. This Role Is NOT Focused On Prompt engineering as the primary responsibility.

LangChain-only application development. OpenAI/GPT API integrations without hands-on model training. Building only RAG pipelines without core model development.

Low-code or no-code GenAI workflows. Tech Stack Python PyTorch / TensorFlow Hugging Face Transformers DeepSpeed / PEFT CUDA & GPU Training Environments AWS / Cloud Infrastructure Docker & Kubernetes Vector Databases (Pinecone, Milvus) NLP & ML Tooling What We’re Looking For Strong ownership and problem-solving mindset. Ability to independently develop and optimize ML models.

Strong analytical and experimentation skills. Deep understanding of transformer architectures and modern NLP systems. Ability to bridge AI research with scalable production-grade machine learning systems.

Passion for building next-generation multilingual AI solutions.

Posted 1 weeks ago

Related Jobs

Related Searches

Apply Now