Daniel Huynh's picture

Daniel Huynh PRO

dhuynh95

·

dhuynh95

AI & ML interests

None yet

Articles

Open-source embeddings and LLMs outperform Gemini and OpenAI for Web Navigation while being faster and cheaper

Automatic Hallucination detection with SelfCheckGPT NLI

StarCoder Memorization Experiment Highlights Privacy Risks of Fine-Tuning On Code

Introducing BlindChat, an open-source and privacy-by-design Conversational AI fully in-browser

AI Total Cost of Ownership Calculator: Evaluate the cost of in-house AI deployment vs AI APIs

Organizations

dhuynh95's activity

upvoted a paper 1 day ago

From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond

Paper • 2411.03590 • Published 4 days ago • 9

upvoted 4 papers about 1 month ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3 • 34

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53

Can Models Learn Skill Composition from Examples?

Paper • 2409.19808 • Published Sep 29 • 8

Attention Prompting on Image for Large Vision-Language Models

Paper • 2409.17143 • Published Sep 25 • 7

upvoted 2 papers about 2 months ago

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18 • 36

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18 • 43

upvoted a paper 2 months ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 87

upvoted 8 papers 3 months ago

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Paper • 2408.13257 • Published Aug 23 • 25

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 117

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 40

Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models

Paper • 2408.06663 • Published Aug 13 • 15

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 33

OmniParser for Pure Vision Based GUI Agent

Paper • 2408.00203 • Published Aug 1 • 23

Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models

Paper • 2407.19474 • Published Jul 28 • 22

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Paper • 2407.20183 • Published Jul 29 • 37

upvoted 4 papers 4 months ago

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Paper • 2407.15711 • Published Jul 22 • 9

InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

Paper • 2407.05700 • Published Jul 8 • 9

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7 • 6

Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages

Paper • 2407.03321 • Published Jul 3 • 15