Skip to content
View ac-alpha's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@ihp-lab

Block or report ac-alpha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ac-alpha/README.md

Hey, I'm Ashutosh! πŸ‘‹

CS PhD student at USC's Institute for Creative Technologies, working with Prof. Mohammad Soleymani at the Intelligent Human Perception Lab. Bronze Medallist from IIT Roorkee (2021).

πŸ”¬ What I work on

I build and improve multimodal LLMs (audio/video/omni) β€” specifically using post-training techniques like preference optimization to give models better social and emotion understanding. I also work on video generation for social behaviors.

πŸ“„ Recent publications

Paper Venue
MoD-DPO β€” Mitigating cross-modal hallucinations in Omni LLMs CVPR 2026 πŸ”οΈ Denver
AVERE β€” Audiovisual emotion reasoning with preference optimization ICLR 2026 πŸ‡§πŸ‡· Rio
Face-LLaVA β€” Facial expression understanding via instruction tuning WACV 2026 🌡 Tucson
DiTaiListener β€” Controllable listener video generation ICCV 2025 🌺 Hawai'i

πŸ› οΈ Areas of interest

Multimodal LLMs Post-training & RLHF Emotion Understanding Social AI Video Generation Audio/Visual Reasoning

πŸ”— Find me

Website Scholar LinkedIn Email


Currently looking for Research/Applied Scientist internships on multimodal LLMs and video generation β€” feel free to reach out!

Pinned Loading

  1. ihp-lab/Face-LLaVA ihp-lab/Face-LLaVA Public

    [WACV 2026] Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning

    Python 11 2

  2. ihp-lab/LibreFace ihp-lab/LibreFace Public

    [WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis

    Python 211 28

  3. ihp-lab/AVERE ihp-lab/AVERE Public

    [ICLR 2026] Official Codebase for AVERE: Improving Audiovisual Emotion Reasoning with Preference Optimization

    Python 1