[go: up one dir, main page]

Follow
Idan Schwartz
Idan Schwartz
The Department of Computer Science at Bar-Ilan University
Verified email at biu.ac.il - Homepage
Title
Cited by
Cited by
Year
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Y Tewel, Y Shalev, I Schwartz, L Wolf
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2952022
Factor graph attention
I Schwartz, S Yu, T Hazan, AG Schwing
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1382019
High-order attention models for visual question answering
I Schwartz, AG Schwing, T Hazan
Advances in Neural Information Processing Systems 30, 2017
1312017
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies
I Gat, I Schwartz, A Schwing, T Hazan
Advances in Neural Information Processing Systems 33, 2020
1242020
A simple baseline for audio-visual scene-aware dialog
I Schwartz, AG Schwing, T Hazan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
972019
Diverse and aligned audio-to-video generation via text-to-video model adaptation
G Yariv, I Gat, S Benaim, L Wolf, I Schwartz, Y Adi
Proceedings of the AAAI Conference on Artificial Intelligence 38 (7), 6639-6647, 2024
742024
Optimizing relevance maps of vision transformers improves robustness
H Chefer, I Schwartz, L Wolf
Advances in Neural Information Processing Systems 35, 33618-33632, 2022
562022
Perceptual score: What data modalities does your model perceive?
I Gat, I Schwartz, A Schwing
Advances in Neural Information Processing Systems 34, 21630-21643, 2021
512021
Zero-shot video captioning with evolving pseudo-tokens
Y Tewel, Y Shalev, R Nadler, I Schwartz, L Wolf
British Machine Vision Conference, 2023
462023
Identifying anomalous activities in a cloud computing environment
Y Shen, A Benameur, AX Ough, I Schwartz
US Patent App. 18/344,664, 2024
412024
Audiotoken: Adaptation of text-conditioned diffusion models for audio-to-image generation
G Yariv, I Gat, L Wolf, Y Adi, I Schwartz
arXiv preprint arXiv:2305.13050, 2023
402023
Latent space explanation by intervention
I Gat, G Lorberbom, I Schwartz, T Hazan
Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 679-687, 2022
232022
Video and Text Matching with Conditioned Embeddings
A Ali, I Schwartz, T Hazan, L Wolf
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
192022
Ordered attention for coherent visual storytelling
T Braude, I Schwartz, A Schwing, A Shamir
Proceedings of the 30th ACM International Conference on Multimedia, 3310-3318, 2022
17*2022
Ensemble of MRR and NDCG models for Visual Dialog
I Schwartz
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
142021
Discriminative class tokens for text-to-image diffusion models
I Schwartz, V Snæbjarnarson, H Chefer, S Belongie, L Wolf, S Benaim
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
122023
Search system for providing web crawling query prioritization based on classification operation performance
I Guy, I Schwartz, K Radinsky
US Patent 11,636,164, 2023
72023
Interruption Predictions for Cloud Compute Instances
I Schwartz, O Muchnik, J Cohen, K Mcgrath, A Shachar
US Patent 20,220,129,322, 2022
72022
Iterative object count optimization for text-to-image diffusion models
O Zafar, L Wolf, I Schwartz
arXiv preprint arXiv:2408.11721, 2024
62024
Improving Visual Commonsense in Language Models via Multiple Image Generation
G Yariv, I Schwartz, Y Adi, S Benaim
arXiv preprint arXiv:2406.13621, 2024
22024
The system can't perform the operation now. Try again later.
Articles 1–20