Profile Picture
  • All
  • Search
  • Images
  • Videos
    • Shorts
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
Fast Fine Tuning and DPO Training of LLMs using Unsloth
40:55
Fast Fine Tuning and DPO Training of LLMs using Unsloth
5.4K viewsMar 25, 2024
YouTubeAI Anytime
Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained
Direct Preference Optimization: Your Language Model is Secretly …
37.5K viewsDec 22, 2023
YouTubeAI Coffee Break with Letitia
Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs dir…
27.7K viewsJun 21, 2024
YouTubeSerrano.Academy
Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math
48:46
Direct Preference Optimization (DPO) explained: Bradley-Terry m…
32.1K viewsApr 14, 2024
YouTubeUmar Jamil
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
28:53
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
18.1K views9 months ago
YouTubeShaw Talebi
Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model Explained
36:25
Direct Preference Optimization (DPO): Your Language Model is S…
18.9K viewsAug 10, 2023
YouTubeGabriel Mongaras
Reinforcement Learning, RLHF, & DPO Explained
19:39
Reinforcement Learning, RLHF, & DPO Explained
13.3K viewsJun 12, 2024
YouTubeMark Hennings
21:15
DPO直接偏好优化算法 (动画讲解)
8K viewsOct 26, 2024
bilibili数源创域
23:17
完全从零开始实现DPO算法,不依赖trl库,已经实现预训练、SFT、DP…
18.4K viewsDec 12, 2024
bilibili偷星九月333
37:40
DPO Pay by Network x Odoo: Levelling up digital payments in A…
1.2K views5 months ago
YouTubeOdoo
See more videos
Static thumbnail place holder
More like this
Feedback
  • Privacy
  • Terms