All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
Learn about the HumanEval LLM benchmark with Empirical
593 views
Apr 4, 2024
YouTube
Arjun Attam
8:13
#22. LLM Benchmarks Explained | Top Open-Source LLMs & How to
…
6 views
2 months ago
YouTube
Tech With Mala
1:10
BEST AI MODEL FOR CODING : 2023-2026 (HumanEval Benchmark)
1.1K views
1 month ago
YouTube
Learn AI / ML
11:02
LLM benchmarks
1.2K views
Mar 24, 2024
YouTube
Vivek Haldar
What Are LLM Benchmarks? | IBM
Jan 29, 2024
ibm.com
4:18
LLM Benchmarks: What You MUST Know Before Creating AI Agents!
…
1.5K views
Feb 25, 2025
YouTube
GetGenerative
19:54
LLM Evaluation Basics Part 2: Understanding Three Key Approa
…
2.6K views
9 months ago
YouTube
Business Data Science with Delali
5:50
7 Popular LLM Benchmarks Explained [OpenLLM Leaderboar
…
27K views
Jan 9, 2024
YouTube
bycloud
16:30
Optimize Coding LLM for Reasoning or Tools?
1.9K views
8 months ago
YouTube
Discover AI
19:14
Learn to Evaluate LLMs and RAG Approaches
25.6K views
Nov 5, 2023
YouTube
AI Anytime
AutoCoder Code Interpreter can install external library
May 30, 2024
reddit
randommagnet1234
32:15
Reza Shabani - How Replit Trained Their Own LLMs (LLM Bootcamp)
11.8K views
May 25, 2023
YouTube
The Full Stack
15:12
[Dafny'25] Dafny as Verification-Aware Intermediate Language for
…
319 views
10 months ago
YouTube
ACM SIGPLAN
26:19
Evaluate LLMs with Language Model Evaluation Harness
8.6K views
May 12, 2024
YouTube
AI Anytime
16:15
Task-Aware LLM Council with Adaptive Decision Pathways for D
…
24 views
3 weeks ago
YouTube
AI Papers Podcast Daily
16:14
The NEW BEST Base LLM??? (DeepSeek LLM)
6.4K views
Nov 29, 2023
YouTube
1littlecoder
1:38
CodeQwen 1.5: Advanced Coding LLM with Impressive 7B Paramete
…
137.7K views
May 3, 2024
TikTok
techfren
9:39
Phind-70B: BEST Coding LLM Outperforming GPT-4 Turbo + Ope
…
13.5K views
Feb 23, 2024
YouTube
WorldofAI
0:25
🔍 Benchmarks: – Chatbot Arena (LMSYS), Hallucination tests ,Hum
…
101 views
2 months ago
YouTube
Hello-Wereld
3:31:24
Deep Dive into LLMs like ChatGPT
5.6M views
Feb 5, 2025
YouTube
Andrej Karpathy
0:47
State-of-the-art results (100%!!) on widely used academic benchmark
…
6.3K views
Sep 25, 2023
TikTok
rajistics
2:15:40
Codex: Evaluating Large Language Models Trained on Code
3.7K views
Jul 28, 2022
YouTube
Samuel Albanie
6:28
First local LLM to Beat GPT-4 on Coding | Codellama-70B
23K views
Jan 30, 2024
YouTube
Prompt Engineering
11:18
OpenCI: NEW Opensource Code Interpreter Model On Par with GP
…
7.9K views
Feb 24, 2024
YouTube
WorldofAI
1:30:41
Вебинар: AI System Design — от идеи до масштабируемого LLM-
…
773 views
10 months ago
YouTube
Codex Town Club
0:55
Is Recursion the Frontier for LLM Reasoning
1.9K views
2 months ago
YouTube
Trelis Research
23:02
Evaluating Biases in LLMs using WEAT and Demographic Diversity
…
7.4K views
Nov 5, 2023
YouTube
AI Anytime
9:12
NEW AutoCoder LLM Beats GPT-4o! Best Opensource Coding LLM!
16.5K views
May 30, 2024
YouTube
WorldofAI
13:33
DeepSeek Engram: Conditional Memory via Scalable Lookup: A N
…
147 views
1 month ago
YouTube
MillionScope
4:04
GPT-OSS Evaluated: 20B vs 120B LLMs
120 views
6 months ago
YouTube
AI Research Roundup
See more videos
More like this
Feedback