| 社招官网

基础设施事业部-天基-集群和服务器智能运维算法专家-Data Center Predictive Maintenance

发布时间: 2019-08-13 工作地点: 杭州 工作年限: 五年以上
所属部门: 阿里集团 学   历: 本科 招聘人数: 若干

岗位描述:

Description
Alibaba is the world's leading e-commerce company, and is also China's leading cloud computing service provider. Alibaba Infrastructure Service Group is Ali e-commerce platform, cloud computing platform, financial platform and logistics platform's infrastructure provider. We are now trying to build a team with talents who are experienced in machine learning and AI-related areas, as well as distributed systems, to intelligently process and analyze massive amounts of data from data centers and clouds with high performance and accuracy. Our vision is to build a "DC and Cloud Brain" to support intelligent data center and cloud design, maintenance, management and control.


Responsibilities
• Work with large scale, multi-dimensional time-series and sequence data sets captured from all aspects of data centers. Applying cut-edge machine learning and AI techniques to solve non-deterministic, intelligent analysis problems in data centers, enabling accurate, efficient cluster and server maintenance.

• Conduct end-to-end solution design and implementation, start from requirement understanding, data gathering and pre-processing, modeling, online deployment to solution updating based on feedbacks.

• Build capabilities in predictive data center maintenance area, take promotion activities to make world-wide technical impact.


阿里巴巴集团是全球领先的电子商务企业,同时也是中国最大的云计算服务提供商。阿里巴巴基础设施服务部门是阿里电商平台,云计算平台,金融平台和物流平台的基础设施提供者。当前,我们致力于招聘有丰富机器学习、AI和分布式系统研究及工程经验的人才,打造精准、高效的数据中心和云计算海量数据智慧化分析服务能力。 我们的愿景是构建“数据中心和云计算大脑”,使能智慧化数据中心和云计算的架构设计、整体运维、高效管理和精准控制。


岗位职责
• 对数据中心和云计算各个维度大规模时间序列、序列和日志等数据进行分析处理。应用前沿技术创造性的解决各种分析和预测问题,积累大规模数据中心智慧化运维能力。

• 负责端到端的算法和应用解决方案设计和部署实现,从需求分析,数据收集,数据预处理,建模,在线部署到反馈更新形成AI闭环等。

• 在数据中心智慧化运维领域,用先进技术和创新解决方案,构筑技术品牌和全球影响力。


岗位要求:

Basic Qualifications
• MS or PhD in computer science, statistics or equivalent research/practical experience

• Knowledge of or experience in DevOps, data driven data center design, maintenance, management and control

• Strong programming skills in one or more of the following programming languages: Python, R, Scala, C++, Java, etc.

• Creative thinking; eager to learn and willing to turn new ideas into reliable tools and products 

• Excellent communication skills


Preferred Qualifications
• Experience in statistical learning and AI related research and engineering projects, especially in multi-dimensional time-series analysis, sequential pattern mining

• Experience in large scale data center hardware and software intelligent analysis, including anomaly detection, failure prediction, fault localization, etc., by leveraging distributed systems like Spark, Storm, etc..

• Strong publication records in machine learning, AI and distributed systems venues, including but not limited to KDD, WWW, NIPS, ICDE, SIGMOD, ACL, SIGIR, ICML, VLDB, SIGCOMM, NSDI, OSDI/SOSP, ATC, EuroSys, FAST, etc.


优先考虑
• 有丰富的统计机器学习和AI相关研究和工程实践项目背景,尤其擅长多维时间序列分析、序列模式挖掘等算法和分布式系统应用者优先。

• 具有大规模数据中心软硬件智能分析经验,利用分布式集群和系统,如Spark、Storm等,对海量数据进行异常检测、故障预测、故障定界定位等。 

• 具有深厚的研究背景,有高质量的论文发表在顶级学术会议,包括但不限于:KDD, WWW, NIPS, ICDE, SIGMOD, ACL, SIGIR, ICML, VLDB, SIGCOMM, NSDI, OSDI/SOSP, ATC, EuroSys, FAST等。

 

 

申请此职位表明您已阅读并同意阿里巴巴及关联公司的《申请工作机会须知》。

推荐岗位

职位名称 职位类别 工作地点 招聘人数 更新时间
智能服务事业部-算法专家-数据智能 算法 杭州 若干 2019-09-11
智能服务事业部-算法专家-阿里小蜜 算法 杭州 若干 2019-09-11
智能服务事业部-算法专家-智能助理-北京 算法 北京 若干 2019-09-11
业务平台事业部-数据服务-高级算法专家(NLP&知识图谱)-杭州 算法 杭州 若干 2019-07-30
阿里云智能事业群-监控运维开发专家-GTS 算法 杭州 若干 2019-07-25