Cloud Intelligence/AIOps R&D Intern

Job Type: Full-time Intern
Location: Beijing/Shanghai

Group Introduction:
Pioneer cloud intelligence with us! The Microsoft DKI (Data, Knowledge, Intelligence) team leads interdisciplinary research in AIOps and Cloud Intelligence, focusing on critical scenarios including failure prediction, anomaly detection, intelligent diagnosis, capacity planning, and incident management. Our innovations power Microsoft mission-critical services like Azure, M365, Copilot through cutting-edge AI technologies. Recent work published in OSDI, FSE, ICSE, AAAI, WWW has significantly improved cloud service quality.

Job Responsibilities (Choose between Engineering or Research):

Engineering Intern:

  • Analyze cloud service architectures (IaaS/PaaS/SaaS) through code analysis techniques
  • Develop AIOps tools for cloud services management and optimization
  • Build multi-modal big data pipelines integrating logs, traces, and telemetry data
  • Implement capacity planning tools with predictive resource scaling algorithms
  • Implement reliable engineering solutions for Microsoft core services (Azure, M365, Copilot, etc)

Research Intern:

  • Innovate AI models for cloud incident management (e.g., incident prediction, diagnosis, root cause analysis)
  • Advance time-series anomaly detection
  • Research autonomous cloud operations with AI
  • Publish breakthroughs in Software Engineering/System/AI conferences

Qualifications:

  • Strong programming skills in Python/Java/C# with cloud development experience
  • Experience in either:
    • Cloud systems (K8s/Docker/Azure) and distributed architectures
    • AI frameworks (PyTorch/TensorFlow) and big data pipeline construction
  • Understanding of cloud service paradigms and operational challenges
  • Publications in software/systems/AI conferences
  • Must be available for a full-time internship for 6 months or more.

Internship Duration Requirements:
Must obtain permission from your academic advisor and commit to at least 6 months of internship.

Please send your English or Chinese resume (in PDF/Word format) to: [email protected]. Please include “Cloud Intelligence/AIOps R&D Intern” in the email subject line.

 
岗位名称:智能云计算/AIOps研发实习生 
工作性质:全职实习生 
工作地点:北京/上海

团队介绍: 
加入智能云计算技术前沿!微软DKI(数据、知识、智能)团队致力于人工智能、软件分析与智能运维的跨学科研究。我们与微软产品团队(如Azure, M365, Copilot深度合作,将一系列创新技术应用在云系统的故障预测、异常检测、智能诊断、容量规划、事故管理等诸多实际应用场景中。研究成果持续发表于人工智能系统、软件工程领域顶会,并支撑微软云核心服务的智能化演进。

岗位职责(方向可选):

工程实习生:

  • 运用代码分析技术解析云服务架构(IaaS/PaaS/SaaS
  • 开发面向云服务管理与优化的智能运维(AIOps)工具
  • 构建集成日志、追踪与遥测数据的多模态大数据工程
  • 实现基于预测性资源伸缩算法的容量规划工具
  • 开发微软核心服务(Azure/M365/Copilot等)可靠性工程方案

研究实习生:

  • 研发云事故管理AI模型(如事故预测、智能诊断、根因分析等)
  • 推进时序异常检测算法创新
  • 研究AI驱动的自主云运维技术
  • 在软件工程/系统/人工智能领域顶会发表突破性成果

岗位要求:

  • 扎实的Python/Java/C#开发能力,具备云平台经验
  • 满足以下任一领域专长:
    • 云系统(K8s/Docker/Azure)与分布式架构
    • 人工智能框架(PyTorch/TensorFlow)与数据工程
  • 理解云服务范式与运维挑战
  • 在系统/软件工程/编程语言/AI领域顶会有论文发表者优先
  • 每周全职工作5天,能够保证实习6个月以上

实习时间要求: 
需获得导师许可,并保证至少6个月的实习周期。

请将完整的中/英文简历(PDF/Word形式)发送至:[email protected]。邮件标题中请注明智能云计算/AIOps研发实习生