Infrastructure Engineer Intern (Compute Infrastructure - Cloud-Native)- 2026 Start(PHD)
- ByteDance
- San Jose, California
- Full Time
About Team:The Compute Infrastructure team uses Kubernetes and Serverless technologies to build a large, reliable, and efficient compute infrastructure. This infrastructure powers hundreds of large-scale clusters globally, with over millions of online containers and offline jobs daily, including AI and LLM workloads. The team is dedicated to building cutting-edge, industry-leading infrastructure that empowers AI innovation, ensuring high performance, scalability, and reliability to support the most demanding AI/LLM workloads.The team is also dedicated to open-sourcing key infrastructure technologies, including projects in the K8s portfolio such as kubewharf (KubeBrain, Katalyst, Godel, etc)..
We are looking for talented individuals to join us for an internship in 2026. Internships at ByteDance aim to offer students industry exposure and hands-on experience. Turn your ambitions into reality as your inspiration brings infinite opportunities at ByteDance.
Internships at ByteDance aim to provide students with hands-on experience in developing fundamental skills and exploring potential career paths. A vibrant blend of social events and enriching development workshops will be available for you to explore. Here, you will utilize your knowledge in real-world scenarios while laying a strong foundation for personal and professional growth.
PhD internships at Bytedance provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies. Our dynamic internship experience blends hands-on learning, enriching community-building and development events, and collaboration with industry experts.Applications will be reviewed on a rolling basis - we encourage you to apply early. Please state your availability clearly in your resume (Start date, End date).
ResponsibilitiesThis internship provides students the opportunity to work on one of the many innovation projects (but not limited to) supporting diverse Cloud-Native applications -, including containers, VMs, Microservices, big data, and AI/LLM:
- Ultra-large-scale Kubernetes cluster management platform
- Next-Gen AI-Native Godel K8s scheduler with AI intelligence built-in
- Intelligent node-level management & scheduling system for heterogenous resources (CPU/GPU, Memory bandwidth, Network bandwidth, Power, etc)
- Performance optimization for container runtimes and container image distribution
- K8s Control/data plane stability & reliability with automatic & intelligent observability tools