All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
LLM Reinforcement Learning Fine-Tuning DeepSeek Method GRPO
40 views
11 months ago
git.ir
24:05
ORPO: NEW DPO Alignment and SFT Method for LLM
4.9K views
Mar 24, 2024
YouTube
Discover AI
0:28
The Truth About LLM Alignment: SFT, RLHF, and DPO
277 views
3 months ago
YouTube
Ryan Banze
12:30
How does DPO improve the LLM's performance? | Simple Explanation
198 views
Jan 29, 2025
YouTube
MLWorks
LLM Fine-Tuning Mastery: Basic to Advanced & Cloud Deploy
8 months ago
git.ir
0:14
Aligning LLMs with Human Preferences
9 views
1 month ago
YouTube
The AI Opus
58:07
Aligning LLMs with Direct Preference Optimization
34.1K views
Feb 8, 2024
YouTube
DeepLearningAI
39:41
ORPO Explained: Superior LLM Alignment Technique vs. DPO/RLHF
3K views
Apr 9, 2024
YouTube
AI Anytime
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs dir
…
31.5K views
Jun 21, 2024
YouTube
Serrano.Academy
6:26
NEW WizardLM-2 8x22B: Fine-tune & Stage-DPO align
2.5K views
Apr 15, 2024
YouTube
Discover AI
41:28
LLMs | Alignment of Language Models: Contrastive Learning | Le
…
1.6K views
Sep 26, 2024
YouTube
LCS2
40:55
Fast Fine Tuning and DPO Training of LLMs using Unsloth
5.9K views
Mar 25, 2024
YouTube
AI Anytime
12:55
DPO Coding | Direct Preference Optimization (DPO) Code impleme
…
404 views
Mar 19, 2025
YouTube
AILinkDeepTech
IBM experts break down LLM benchmarks and best practices | I
…
Sep 17, 2024
ibm.com
3:36:14
LLM Fine-Tuning Crash Course: Finetune model on PDFs, Instructi
…
7.6K views
3 months ago
YouTube
Sunny Savita
4:46
136.LLM Post-Training专题:DPO的微调流程
1.4K views
3 months ago
bilibili
文言AI
5:08
LLM Alignment Methods - DPO vs IPO vs KTO vs PCL
1.6K views
Jan 27, 2024
YouTube
Fahd Mirza
1:20:54
LLM Alignment (RLHF, DPO, ORPO) + Hands-on Project
9.7K views
4 months ago
YouTube
BrainOmega
1:44:33
LLM Alignment|综述及RLHF、DPO、UNA的深入分析
1.7K views
Nov 19, 2024
bilibili
你到这干嘛来了
45:24
[UCLA RL-LLM] Chapter 3.1: Reinforcement learning from hum
…
2.2K views
8 months ago
YouTube
Ernest Ryu
3:28
Enhancing Song Generation in LLMs using DPO-based Multi-Pref
…
7 views
2 months ago
YouTube
Quang Phạm Việt
13:42
GRPO 2.0? DAPO LLM Reinforcement Learning Explained
6.1K views
1 year ago
YouTube
AI Papers Academy
19:39
Reinforcement Learning, RLHF, & DPO Explained
16.7K views
Jun 12, 2024
YouTube
Mark Hennings
22:35
Train LLM Easily with Llama Factory LORA, SFT, DPO, etc. | GUI Traini
…
752 views
Feb 11, 2024
YouTube
Code Port
11:12
LLM实时在线DPO微调教程 - 实战演示
195 views
Sep 5, 2024
bilibili
比特光锥_BightCone
1:16:38
LLM Marathon series : PPO vs DPO: Understanding RLHF and Large L
…
262 views
May 29, 2024
YouTube
Lingo Research Group, IITGN
36:25
Direct Preference Optimization (DPO): Your Language Model is S
…
19.2K views
Aug 10, 2023
YouTube
Gabriel Mongaras
8:35
[Transformers] LLM Transformers - The Essential LLM technical guide
…
202 views
4 months ago
YouTube
AI Podcast Series. Byte Goose AI.
4:58
构建大语言模型,DPO训练方法,原理和实现
16K views
Nov 1, 2023
bilibili
蓝斯诺特
32:51
Interactive: Let’s Learn about llm-d ft Christopher Nuland | Demo Dee
…
474 views
5 months ago
YouTube
OpenShift
See more videos
More like this
Feedback