GENERATIVE AI ENGINEER (MID/SR.)
Position Overview
We are seeking a highly skilled Generative AI Engineer (or AI Solution Leader) to join our advanced technology division. The ideal candidate will have hands-on experience in LLM (Large Language Model)-based system development, Retrieval-Augmented Generation (RAG) architectures, and AI conversational agent implementation.
This position plays a crucial role in designing, prototyping, and leading the implementation of cutting-edge AI solutions that leverage knowledge graphs, vector databases, and natural language understanding to build intelligent, scalable enterprise systems.
Responsibilities
- Design and Develop Generative AI Systems
- Build and optimize RAG architectures integrating LLMs with structured/unstructured data sources.
- Develop knowledge graph-based information retrieval systems and implement vector search solutions (e.g., FAISS, Milvus, Pinecone, VertexAI Vector Search).
- Research & Architecture
- Research state-of-the-art models (e.g., GPT, Claude, Gemini, Llama) and integrate them into enterprise use cases.
- Define the AI system architecture from data ingestion to inference and evaluation.
- AI Conversational Agent Development
- Design and implement intelligent chatbots capable of contextual understanding and domain-specific reasoning.
- Integrate conversational flows with backend systems and APIs.
- Project Leadership
- Collaborate with clients and internal teams to define user requirements and translate them into technical design specifications.
- Lead a small AI team (2–5 engineers) to execute design, implementation, and testing phases.
- Ensure technical quality, scalability, and performance of AI-driven applications.
- Collaboration & Communication
- Work closely with product managers, data engineers, and backend/frontend teams.
- Contribute to proposal creation and proof-of-concept (PoC) activities for client projects.
Job Qualifications
- Core Skills
- Hands-on experience with LLMs and RAG system design.
- Proficiency in Python and frameworks such as LangChain, LlamaIndex, Transformers (Hugging Face), or VertexAI SDK.
- Experience with vector databases (e.g., Pinecone, Weaviate, Milvus, Chroma).
- Understanding of knowledge graph design and semantic search.
- Experience building AI chatbots or conversational systems (Dialogflow, Rasa, custom LLM pipelines).
- Preferred Experience
- Familiarity with cloud AI platforms (Google VertexAI, Azure OpenAI, AWS Bedrock).
- Experience with MLOps, prompt engineering, or model fine-tuning.
- Practical experience deploying LLM applications in production environments.
- Soft Skills
- Strong analytical thinking and research capability.
- Ability to design and lead technical projects independently.
Working time
- 8:00 AM – 12:00 PM | 1:00 PM – 5:00 PM
Benefits
- Compensation & Well-being:
- Probation: insurance covered
- 13th-month salary
- Performance & Salary review twice per year
- 19 days of paid time off per year
- Premium healthcare insurance for yourself and family members
- Annual health check-ups.
- Career Development:
- Sponsored certificates
- English course
- Career roadmap and growth opportunities
- Dynamic working environment:
- Free snacks in the pantry
- Happy hour, Party celebration, Team building, Company trip,…
- Diversified activities: Football, Badminton,…
- Hybrid working model
- Kozocom’s environment: happy, equal, and open-minded
Send your CV to: nhuht@kozo-japan.com