Apple is presenting new research at the European Conference on Computer Vision (ECCV), which takes place in person in Milan, Italy, from September 29 - October 4. We are proud to again sponsor the biennial conference, which brings together the scientific and industrial research communities around ML and computer vision. Below is an overview of Apple’s participation at ECCV 2024.

Schedule

Stop by the Apple booth #34, in the Allianz MiCo Convention Center during exhibition hours (all times GMT+2):

  • Tuesday, October 1 — Thursday, October 3: 09:00-18:30
  • Friday, October 4: 09:00-12:30

Sunday, September 29

Monday, September 30

Tuesday, October 1

Wednesday, October 2

Accepted Papers

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko

CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models

Nick Stracke, Stefan Andreas Baumann, Josh Susskind, Miguel Angel Bautista Martin, Björn Ommer

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-Training

Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Synth4Seg - Learning Defect Data Synthesis for Defect Segmentation Using Bi-level Optimization

Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi

VeCLIP: Improving CLIP Training via Visual-Enriched Captions

Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao

Acknowledgements

Stephan Richter is a main conference area chair.

Alaa El-Nouby, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Raviteja Vemulapalli, and Yusu Qian are main conference reviewers.

For the 2nd workshop on Vision-based InduStrial InspectiON (VISION):

Alexander Wong, C Thomas, Carrie Yu, Javad Shafiee, Jeff Lai, and Tatiana Likhomanenko are co-organizers.

Shiyu Li and Yuxuan Liu are workshop chairs.

Raviteja Vemulapalli is a keynote speaker.

Vimal Thilak is a workshop reviewer.

Related readings and updates.

Empirical Methods in Natural Language Processing (EMNLP) 2024

Apple is presenting new research at the Empirical Methods in Natural Language Processing (EMNLP) conference, which takes place in person in Miami, Florida, from November 12 - 16. We are proud to again sponsor the conference, which brings together the scientific and industrial research communities around natural language processing and artificial intelligence. Below is an overview of Apple’s participation at EMNLP 2024.

See event details

International Conference on Learning Representations (ICLR) 2024

Apple sponsored the International Conference on Learning Representations (ICLR), which took place in person from May 7 to 11 in Vienna Austria. ICLR brings together professionals dedicated to the advancement of deep learning.

See event details