About
Contact & Profiles
Research Areas
- Video Analysis and Summarization
- Generative Adversarial Networks and Image Synthesis
- Human Motion and Animation
- Multimodal Machine Learning Applications
- Robotics and Automated Systems
Google (United States)
2024
10.1109/cvpr52733.2024.01835
article
EN
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
2024-06-16
Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen have made significant progress in generating high-resolution images based on text descriptions. However, many generated still suffer from issues artifacts/implausibility, misalignment with descriptions, low aesthetic quality. Inspired by the success of Reinforcement Learning Human Feedback (RLHF) for large language models, prior works collected human-provided scores feedback trained a reward model to improve T2I...
10.48550/arxiv.2312.10240
preprint
EN
cc-by
arXiv (Cornell University)
2023-01-01
Coming Soon ...