eventSeptember 20, 2024

European Conference on Computer Vision (ECCV) 2024

Apple is presenting new research at the European Conference on Computer Vision (ECCV), which takes place in person in Milan, Italy, from September 29 - October 4. We are proud to again sponsor the biennial conference, which brings together the scientific and industrial research communities around ML and computer vision. Below is an overview of Apple’s participation at ECCV 2024.

Schedule

Stop by the Apple booth #34, in the Allianz MiCo Convention Center during exhibition hours (all times GMT+2):

Tuesday, October 1 — Thursday, October 3: 09:00-18:30
Friday, October 4: 09:00-12:30

Sunday, September 29

WORKSHOP
AVGenL: Audio-Visual Generation and Learning 2024
14:00 - 18:00, Suite 9
- POSTER
- AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
- 15:40 - 16:20
- Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko

Monday, September 30

WORKSHOP
2nd workshop on Vision-based InduStrial InspectiON (VISION)
09:00 - 13:00, Tower Lounge
- POSTER
- Synth4Seg - Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization
- 10:40 - 11:40
- Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi

Tuesday, October 1

POSTER
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
16:30 - 18:30, Poster Session 2
Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan

POSTER
CTRLorALTer: Conditional LoRAdapter for Efficient Zero-Shot Control & Altering of T2I Models
16:30 - 18:30, Poster Session 2
Nick Stracke, Stefan Andreas Baumann, Josh Susskind, Miguel Angel Bautista Martin, Björn Ommer

POSTER
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
10:30 - 12:30, Poster Session 1
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Wednesday, October 2

POSTER
VeCLIP: Improving CLIP Training via Visual-enriched Captions
16:30 - 18:30, Poster Session 4
Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao

Accepted Papers

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko

CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models

Nick Stracke, Stefan Andreas Baumann, Josh Susskind, Miguel Angel Bautista Martin, Björn Ommer

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-Training

Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Synth4Seg - Learning Defect Data Synthesis for Defect Segmentation Using Bi-level Optimization

Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi

VeCLIP: Improving CLIP Training via Visual-Enriched Captions

Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao

Acknowledgements

Stephan Richter is a main conference area chair.

Alaa El-Nouby, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Raviteja Vemulapalli, and Yusu Qian are main conference reviewers.

For the 2nd workshop on Vision-based InduStrial InspectiON (VISION):

Alexander Wong, C Thomas, Carrie Yu, Javad Shafiee, Jeff Lai, and Tatiana Likhomanenko are co-organizers.

Shiyu Li and Yuxuan Liu are workshop chairs.

Raviteja Vemulapalli is a keynote speaker.

Vimal Thilak is a workshop reviewer.

European Conference on Computer Vision (ECCV) 2024

Schedule

Sunday, September 29

Monday, September 30

Tuesday, October 1

Wednesday, October 2

Accepted Papers

Acknowledgements

Related readings and updates.

International Conference on Learning Representations (ICLR) 2025

Neural Information Processing Systems (NeurIPS) 2024

Discover opportunities in Machine Learning.