Apple is presenting new research at the European Conference on Computer Vision (ECCV), which takes place in person in Milan, Italy, from September 29 - October 4. We are proud to again sponsor the biennial conference, which brings together the scientific and industrial research communities around ML and computer vision. Below is an overview of Apple’s participation at ECCV 2024.
Schedule
Stop by the Apple booth #34, in the Allianz MiCo Convention Center during exhibition hours (all times GMT+2):
- Tuesday, October 1 — Thursday, October 3: 09:00-18:30
- Friday, October 4: 09:00-12:30
Sunday, September 29
- AVGenL: Audio-Visual Generation and Learning 2024
- 14:00 - 18:00, Suite 9
-
- AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
- 15:40 - 16:20
- Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko
Monday, September 30
- 2nd workshop on Vision-based InduStrial InspectiON (VISION)
- 09:00 - 13:00, Tower Lounge
-
- Synth4Seg - Learning Defect Data Synthesis for Defect Segmentation using Bi-level Optimization
- 10:40 - 11:40
- Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi
Tuesday, October 1
- Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
- 16:30 - 18:30, Poster Session 2
- Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan
- CTRLorALTer: Conditional LoRAdapter for Efficient Zero-Shot Control & Altering of T2I Models
- 16:30 - 18:30, Poster Session 2
- Nick Stracke, Stefan Andreas Baumann, Josh Susskind, Miguel Angel Bautista Martin, Björn Ommer
- MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
- 10:30 - 12:30, Poster Session 1
- Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang
Wednesday, October 2
- VeCLIP: Improving CLIP Training via Visual-enriched Captions
- 16:30 - 18:30, Poster Session 4
- Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao
Accepted Papers
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko
CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models
Nick Stracke, Stefan Andreas Baumann, Josh Susskind, Miguel Angel Bautista Martin, Björn Ommer
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs
Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-Training
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang
Synth4Seg - Learning Defect Data Synthesis for Defect Segmentation Using Bi-level Optimization
Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi
VeCLIP: Improving CLIP Training via Visual-Enriched Captions
Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao
Acknowledgements
Stephan Richter is a main conference area chair.
Alaa El-Nouby, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Raviteja Vemulapalli, and Yusu Qian are main conference reviewers.
For the 2nd workshop on Vision-based InduStrial InspectiON (VISION):
Alexander Wong, C Thomas, Carrie Yu, Javad Shafiee, Jeff Lai, and Tatiana Likhomanenko are co-organizers.
Shiyu Li and Yuxuan Liu are workshop chairs.
Raviteja Vemulapalli is a keynote speaker.
Vimal Thilak is a workshop reviewer.
Related readings and updates.
Empirical Methods in Natural Language Processing (EMNLP) 2024
Apple is presenting new research at the Empirical Methods in Natural Language Processing (EMNLP) conference, which takes place in person in Miami, Florida, from November 12 - 16. We are proud to again sponsor the conference, which brings together the scientific and industrial research communities around natural language processing and artificial intelligence. Below is an overview of Apple’s participation at EMNLP 2024.
International Conference on Learning Representations (ICLR) 2024
Apple sponsored the International Conference on Learning Representations (ICLR), which took place in person from May 7 to 11 in Vienna Austria. ICLR brings together professionals dedicated to the advancement of deep learning.