AI Research Scientist
100000-150000元
香港
3年以上
博士
- 補(bǔ)充醫(yī)療保險(xiǎn)
- 創(chuàng)業(yè)公司
- 強(qiáng)積金
職位描述
該職位還未進(jìn)行加V認(rèn)證,請(qǐng)仔細(xì)了解后再進(jìn)行投遞!
Position Overview
We are seeking an experienced AI Research Scientist to lead foundation model development initiatives. The ideal candidate will have hands-on experience in training large-scale models at major tech companies and a proven track record in advancing the state-of-the-art in foundation models.
Key Responsibilities
Lead the architecture design and training of large-scale foundation models
Develop and optimize model training pipelines for distributed systems
Drive research initiatives in model scaling, efficiency, and performance
Implement innovative approaches to improve model capabilities and training efficiency
Collaborate with the engineering team to productionize research breakthroughs
Guide technical decisions related to model architecture and training strategies
Mentor junior researchers and contribute to building our research culture
Required Qualifications
Ph.D. in Computer Science, Machine Learning, or related field
3+ years of experience in training large-scale models at major tech companies, including:
International tech leaders (e.g., Google, Meta, Microsoft, OpenAI, Anthropic) OR
Leading Chinese tech companies (e.g., ByteDance, Alibaba, Baidu, Tencent, SenseTime, Huawei)
Proven experience with distributed training systems and large-scale model optimization
Deep understanding of transformer architectures and their variants
Strong track record in developing and training foundation models
Extensive experience with PyTorch and/or JAX
Publication record in top-tier conferences (NeurIPS, ICML, ICLR)
Preferred Qualifications
Experience with both Chinese and international AI ecosystems
Familiarity with Chinese AI infrastructure (e.g., ModelArts, PAI, ByteMLab)
Background in scaling laws and efficient training strategies
Experience with video generation models or multimodal architectures
Track record of open-source contributions to major ML frameworks
Experience with ML infrastructure design and implementation
Familiarity with mixed-precision training and model parallelism
Experience with custom CUDA kernels and optimization
Technical Expertise
Large-Scale Training: Distributed training frameworks, model parallelism strategies
Infrastructure:
International cloud platforms (AWS/GCP)
Chinese cloud platforms (Alibaba Cloud, Tencent Cloud, Huawei Cloud)
Languages: Python, CUDA, C++ (optional)
Frameworks:
Standard: PyTorch, JAX, DeepSpeed, Megatron-LM
Chinese ecosystem: PaddlePaddle, MindSpore (plus)
Development Tools: Git, Docker, Kubernetes
Monitoring: Weights & Biases, MLflow, or similar tools
What We Offer
Opportunity to shape the future of foundation models in video generation
Leadership role in technical decision-making
Access to substantial computing resources and infrastructure
Competitive compensation package including equity
Regular collaboration with top researchers in the field
Support for conference attendance and research publication
International exposure and collaboration opportunities
Location
Hong Kong (on-site, Hong Kong Science and Technology Park)
Expected Impact
Drive the development of next-generation foundation models
Lead research initiatives that push the boundaries of model capabilities
Build and mentor a world-class research team
工作地點(diǎn)
地址:香港香港香港沙田區(qū)香港科學(xué)園10W棟317-318
求職提示:用人單位發(fā)布虛假招聘信息,或以任何名義向求職者收取財(cái)物(如體檢費(fèi)、置裝費(fèi)、押金、服裝費(fèi)、培訓(xùn)費(fèi)、身份證、畢業(yè)證等),均涉嫌違法,請(qǐng)求職者務(wù)必提高警惕。
職位發(fā)布者
張先生HR
Video Rebirth Limited
- 計(jì)算機(jī)軟件
- 11-20人
- 外商獨(dú)資·外企辦事處
- 香港科學(xué)園10W棟317-318
相似職位
-
環(huán)保運(yùn)維工程師 4000-7000元九原區(qū) 應(yīng)屆畢業(yè)生 大專內(nèi)蒙古盛煌環(huán)境科技有限公司
-
設(shè)備運(yùn)維工程師 面議九原區(qū) 應(yīng)屆畢業(yè)生 不限內(nèi)蒙古盛煌環(huán)境科技有限公司
-
軟件測(cè)試助理 面議青山區(qū) 應(yīng)屆畢業(yè)生 不限鄭州卓集傳媒有限公司
-
新媒體運(yùn)營(yíng)(周末雙休) 面議昆都侖區(qū) 應(yīng)屆畢業(yè)生 不限廣東南油對(duì)外服務(wù)有限公司
-
售后運(yùn)維工程師 面議青山區(qū) 應(yīng)屆畢業(yè)生 不限江蘇鯨充新能源技術(shù)有限公司
-
技術(shù)研發(fā)工程師 6000-10000元昆都侖區(qū) 應(yīng)屆畢業(yè)生 本科北京麥戈龍科技有限公司