Job Description
Accenture is a leading global professional services company, providing a broad range of services in strategy and consulting, interactive, technology, and operations, with digital capabilities across all of these services. We combine unmatched experience and specialized capabilities across more than 40 industries – powered by the world’s largest network of Advanced Technology and Intelligent Operations centers. With 505,000 people serving clients in more than 120 countries, Accenture brings continuous innovation to help clients improve their performance and create lasting value across their enterprises.
An LLMaaS (Large Language Model as a Service) Infrastructure role is crucial in an AI Cloud project, as it involves setting up and managing the underlying infrastructure to support the deployment, training, and inference of large language models
Key Responsibilities of an LLMaaS Infrastructure Role:
- Hardware: Provisioning and configuring servers with high-performance GPUs (GPUs) to handle the compute-intensive nature of LLMs.
- Software: Installing and configuring necessary software, including operating systems, deep learning frameworks (TensorFlow, PyTorch), and LLM libraries (Hugging Face Transformers).
- Cloud Platform: Setting up and managing cloud infrastructure on platforms like AWS, Azure, or GCP, leveraging their AI-specific services.
- Training Pipeline: Designing and implementing efficient training pipelines, including data preprocessing, model training, and evaluation.
- Model Deployment: Deploying trained models to production environments, ensuring scalability and low latency.
- Model Optimization: Optimizing models for inference, including quantization, pruning, and knowledge distillation.
- Data Ingestion: Ingesting and cleaning large datasets for model training.
- Data Storage: Managing data storage solutions (e.g., object storage, data lakes) to efficiently store and retrieve data.
- Data Security: Implementing robust security measures to protect sensitive data.
- Performance Monitoring: Monitoring the performance of LLM models and infrastructure, identifying bottlenecks and optimizing performance.
- Cost Optimization: Implementing cost-saving strategies, such as spot instances and scheduling.
- Model Retraining: Regularly retraining models to improve accuracy and performance.
- Data Privacy: Ensuring compliance with data privacy regulations (e.g., GDPR, CCPA).
- Security Measures: Implementing security measures to protect sensitive data and prevent unauthorized access.
- Auditing: Conducting regular security audits and vulnerability assessments.
Minimum Qualifications
Required Skills and Experience:
- Experience in DevOps
- Strong understanding of cloud computing platforms (AWS, Azure, GCP)
- Proficiency in machine learning frameworks (TensorFlow, PyTorch)
- Experience with distributed computing systems and containerization (Docker, Kubernetes)
- Knowledge of data engineering and data pipelines
- Understanding of AI ethics and responsible AI practices
- Strong problem-solving and troubleshooting skills
- Good command in English
Perks and Benefits
- Mobile Phone Discount
- Special Leave Benefits for Women
- Single Parent Leave
- Gym Membership
- Life Insurance
- Medical / Health Insurance
- Paid Sick Leave
- Employee Discounts
- Performance Bonus
- Paid Bereavement/Family Leave
- Paid Holidays
- Paid Vacation Leave
- Transportation Allowances
- Maternity & Paternity Leave
- Medical, Prescription, Dental, or Vision Plans
Jobs Summary
- Job Level
- Associate / Supervisor
- Job Category
- IT and Software
- Educational Requirement
- Bachelor's degree graduate
- Office Address
- Wisma 46 Kota BNI—18th floor, Jl. Jendral Sudirman Kav. 1, Jakarta, DKI Jakarta 10220
- Industry
- Management Consulting
- Vacancy
- 2 openings