As a Generative AI Architect, you will play a pivotal role in designing and deploying cutting-edge AI solutions for global enterprises. You will be instrumental in translating complex business challenges into scalable AI architectures, focusing on the latest advancements in Generative AI, LLMs, and NLP. From fine-tuning foundational models to integrating them seamlessly into enterprise ecosystems, your work will bridge innovation with real-world impact.
Key Responsibilities
- Collaborate with clients to understand business objectives and translate them into robust, scalable AI/ML architecture.
- Design end-to-end AI solutions leveraging transformer-based models such as BERT, GPT, LLaMA, and others.
- Architect cloud-native AI deployments using AWS, Azure, or GCP, with a strong emphasis on scalability, performance, and reliability.
- Develop and optimize prompt engineering workflows and Retrieval-Augmented Generation (RAG) pipelines for enhanced LLM performance.
- Fine-tune and operationalize large language models using frameworks such as Hugging Face Transformers, LangChain, or similar.
- Design and implement multi-agent AI systems and collaborative model workflows.
- Work cross-functionally with data scientists, engineers, and product teams to integrate AI into enterprise-grade applications and APIs.
- Define and enforce architectural best practices, including scalability, security, extensibility, and maintainability.
- Evaluate and guide decisions regarding trade-offs between different AI models, tools, and deployment strategies.
- Oversee the implementation of MLOps pipelines, CI/CD for AI models, and ongoing monitoring for production environments.
- Conduct technical reviews, lead architectural discussions, and mentor engineering teams on AI best practices.
- Stay current on industry trends and ethical considerations in AI, contributing to Nagarro’s thought leadership.
Required Skills & Experience
- 10+ years of experience in software architecture with a strong focus on AI/ML systems.
- In-depth understanding of Generative AI fundamentals, transformer architectures, and NLP.
- Hands-on experience working with leading LLMs like GPT, BERT, LLaMA, or equivalents.
- Demonstrated experience in prompt engineering, RAG methodologies, and real-world LLM integration.
- Deep expertise in cloud platforms (AWS, Azure, or GCP), especially in deploying and scaling AI applications.
- Strong knowledge of Python and experience with libraries like PyTorch, TensorFlow, Hugging Face, LangChain, etc.
- Familiarity with multi-agent systems and large-scale distributed AI systems.
- Experience integrating AI/ML models with enterprise systems, APIs, and microservices architectures.
- Solid grounding in MLOps, model lifecycle management, CI/CD pipelines, and infrastructure as code.
- A background in NLP, Knowledge Engineering, and ethical AI development.
- Excellent problem-solving, communication, and collaboration skills.
Nice to Have
- Exposure to domain-specific AI (e.g., healthcare, finance, retail).
- Experience in real-time data streaming or edge AI deployments.
- Knowledge of vector databases and semantic search techniques.
Qualifications
-
Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
Why Nagarro?
- A truly global, remote-friendly work culture that values flexibility and autonomy.
- Opportunities to work on cutting-edge projects with top-tier clients.
- An innovation-driven environment that rewards curiosity, initiative, and continuous learning.
- Flat hierarchies, agile processes, and a collaborative, non-corporate work culture.
- Competitive compensation, benefits, and career advancement paths.
Join Us
If you’re passionate about AI, thrive in a solution-oriented environment, and are ready to architect the next generation of AI-powered platforms, we’d love to hear from you. Help us shape the future, one intelligent solution at a time.