Roei Herzig

I'm Roei, a first-year Ph.D. student in Computer Science at Tel Aviv University, working with Prof. Amir Globerson and Prof. Trevor Darrell.

I'm also a Machine Learning & Deep Learning Researcher, I have worked at Nexar and Trax Image Recognition in the last 5 years. I graduated Magna Cum Laude from Tel Aviv University with MSc (CS), BSc (CS) and BSc (Physics).

Email  /  Twitter  /  Github  /  LinkedIn  /  Google Scholar

profile photo
Research

I mainly focus on machine learning models and deep learning methods for video and image understanding using structure (e.g. Structured Prediction). I believe our world is compositional and humans don't perceive the world as raw pixels. Moreover, structured models can enjoy the properties of generalization and inductive-bias, which I find critical.

Research Interest:

  • Vision & Language: Object Detection, Scene Understanding, Visual Reasoning.
  • Machine Learning & Deep Learning: Semi-Supervised Learning, Self-Supervised Learning, Unsupervised Learning, Generative Models, Graph Neural Networks.

fast-texture Differentiable Scene Graphs fast-texture
Moshiko Raboh* , Roei Herzig*, Gal Chechik, Jonathan Berant, Amir Globerson
arXiv preprint , 2019
bibtex

We propose an intermediate “graph-like” representation (DSGs) that can be learned in an end-to-end manner from the supervision for a downstream visual reasoning task, which achieves a new state-of-the-art results on Referring Relationships task.

fast-texture Classifying Collisions with Spatio-Temporal Action Graph Networks fast-texture
Roei Herzig*, Elad Levi* , Huijuan Xu*, Eli Brosh, Amir Globerson , Trevor Darrell
arXiv preprint , 2019
bibtex

We propose a latent inter-object graph representation for activity recognition that explores the visual interaction between the objects in a self-supervised manner.

fast-texture Accurate Visual Localization for Automotive Applications
Eli Brosh*, Matan Friedmann*, Ilan Kadar*, Lev Yitzhak Lavy*, Elad Levi*, Shmuel Rippa*, Yair Lempert, Bruno Fernandez-Ruiz, Roei Herzig, Trevor Darrell
Workshop on Autonomous Driving at CVPR , 2019
blog / code / dataset / bibtex

We propose a hybrid coarse-to-fine approach that leverages visual and GPS location cues with on a new large-scale driving dataset based on video and GPS data.

fast-texture Precise Detection in Densely Packed Scenes
Eran Goldman*, Roei Herzig*, Aviv Eisenschtat* , Jacob Goldberger, Tal Hassner
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) , 2019
code / dataset / bibtex

We collect a new SKU-110K dataset which takes detection challenges to unexplored territories, and propose a novel mechanism to learn deep overlap rates for each detection.

fast-texture Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction
Roei Herzig*, Moshiko Raboh* , Gal Chechik, Jonathan Berant, Amir Globerson
Advanced in Neural Information Processing Systems (NeurIPS) , 2018
code / bibtex

We propose a novel invariant graph network for mapping images to scene graphs using the permutation invariant property, which achieves a new state-of-the-art results on Visual Genome dataset.


Webside template credits