Skip to main content

Showing 1–5 of 5 results for author: Sidahmed, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.10704  [pdf, other

    cs.LG cs.AI cs.CL

    PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Authors: Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin, Zhang Chen, Zac Yu, Jarvis Jin, Roman Komarytsia, Christiane Ahlheim, Yonghao Zhu, Simral Chaudhary, Bowen Li, Saravanan Ganesh, Bill Byrne, Jessica Hoffmann, Hassan Mansoor, Wei Li, Abhinav Rastogi, Lucas Dixon

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has proven to be a strong method to align Pretrained Large Language Models (LLMs) with human preferences. But training models with RLHF is computationally expensive, and an overall complex process. In this work, we study RLHF where the underlying models are trained using the parameter efficient method of Low-Rank Adaptation (LoRA) introduced by Hu… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  2. arXiv:2312.10007  [pdf, other

    cs.CL cs.LG

    Faithful Persona-based Conversational Dataset Generation with Large Language Models

    Authors: Pegah Jandaghi, XiangHai Sheng, Xinyi Bai, Jay Pujara, Hakim Sidahmed

    Abstract: High-quality conversational datasets are essential for developing AI models that can communicate with users. One way to foster deeper interactions between a chatbot and its user is through personas, aspects of the user's character that provide insights into their personality, motivations, and behaviors. Training Natural Language Processing (NLP) models on a diverse and comprehensive persona-based… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  3. arXiv:2305.07961  [pdf, other

    cs.IR cs.CL cs.LG

    Leveraging Large Language Models in Conversational Recommender Systems

    Authors: Luke Friedman, Sameer Ahuja, David Allen, Zhenning Tan, Hakim Sidahmed, Changbo Long, Jun Xie, Gabriel Schubiner, Ajay Patel, Harsh Lara, Brian Chu, Zexi Chen, Manoj Tiwari

    Abstract: A Conversational Recommender System (CRS) offers increased transparency and control to users by enabling them to engage with the system through a real-time multi-turn dialogue. Recently, Large Language Models (LLMs) have exhibited an unprecedented ability to converse naturally and incorporate world knowledge and common-sense reasoning into language understanding, unlocking the potential of this pa… ▽ More

    Submitted 16 May, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

  4. arXiv:2110.03450  [pdf, ps, other

    cs.LG

    Efficient and Private Federated Learning with Partially Trainable Networks

    Authors: Hakim Sidahmed, Zheng Xu, Ankush Garg, Yuan Cao, Mingqing Chen

    Abstract: Federated learning is used for decentralized training of machine learning models on a large number (millions) of edge mobile devices. It is challenging because mobile devices often have limited communication bandwidth and local computation resources. Therefore, improving the efficiency of federated learning is critical for scalability and usability. In this paper, we propose to leverage partially… ▽ More

    Submitted 7 November, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: V2: minor wording and format improvement

  5. arXiv:2102.03448  [pdf, other

    cs.LG cs.DC

    Federated Reconstruction: Partially Local Federated Learning

    Authors: Karan Singhal, Hakim Sidahmed, Zachary Garrett, Shanshan Wu, Keith Rush, Sushant Prakash

    Abstract: Personalization methods in federated learning aim to balance the benefits of federated and local training for data availability, communication cost, and robustness to client heterogeneity. Approaches that require clients to communicate all model parameters can be undesirable due to privacy and communication constraints. Other approaches require always-available or stateful clients, impractical in… ▽ More

    Submitted 27 April, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021). Code: https://github.com/google-research/federated/tree/master/reconstruction