Refereed Journal Papers and Book Chapters
Refereed Conference and Workshop Papers
- Sicheng Zhao, Shangfei Wang, Mohammad Soleymani, Dhiraj Joshi, Qiang Ji. "Affective Computing for Large-Scale Heterogeneous Multimedia Data: A Survey". ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2019.
- Michele Merler, Khoi-Nguyen C. Mac, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, Jinjun Xiong, Minh N. Do, John R. Smith, Rogerio Feris, "Automatic Curation of Sports Highlights using Multimodal Excitement Features", IEEE Transactions on Multimedia 21(5), 1147-1160, 2019.
- Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Xin Lu, Quang-Tuan Luong, James Z. Wang, Jia Li, and Jiebo Luo, “On Aesthetics and Emotions in Images: A Computational Perspective”, book-chapter, Scene Vision, MIT Press, (Eds. Kestas Kveraga and Moshe Bar), 2014.
- Xin Jin, Jiebo Luo, Jie Yu, Gang Wang, Dhiraj Joshi, and Jiawei Han, “Reinforced Similarity Integration in Image-Rich Information Networks”, IEEE Transactions on Knowledge and Data Engineering, 25(2):448-460, 2013.
- Dhiraj Joshi, Andrew Gallagher, Jie Yu, and Jiebo Luo, “Inferring Photographic Location using Geotagged Web Images”, Multimedia Tools and Applications - Springer, Special Issue: Social Media Mining and Search, 56(1):131-153, 2012.
- Dhiraj Joshi, Jiebo Luo, Jie Yu, Phoury Lei, Andrew Gallagher, “Using Geotags to Derive Rich Tag-clouds for Image Annotation”, book-chapter, Social Media Modeling and Computing, Springer-Verlag, (Eds. Steven Hoi, Jiebo Luo, Susanne Boll, Dong Xu, Rong Jin, and Irwin King), 2011.
- Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Quang-Tuan Luong, James Z. Wang, Jia Li, and Jiebo Luo, “Aesthetics and Emotions in Images: A Computational Perspective”, IEEE Signal Processing Magazine – Featured Article, vol. 28, no. 5, pp. 94-115, 2011.
- Jiebo Luo, Dhiraj Joshi, Jie Yu, and Andrew Gallagher, “Geotagging in Multimedia and Computer Vision - A Survey”, Multimedia Tools and Applications - Springer, Special Issue: Survey Papers in Multimedia by World Experts, 51(1):187–211, 2011.
- Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z. Wang, “Image Retrieval: Ideas, Influences, and Trends of the New Age”, ACM Computing Surveys, vol. 40, no. 2, article 5, pp. 1-60, 2008.
- Dhiraj Joshi, James Z. Wang, and Jia Li, “The Story Picturing Engine – A System for Automatic Text Illustration”, ACM Transactions on Multimedia Computing, Communications and Applications (TOMCCAP), vol. 2, no. 1, pp. 68-89, 2006.
- Dhiraj Joshi, Jia Li, and James Z. Wang, “A Computationally Efficient Approach to the Estimation of Two- and Three-dimensional Hidden Markov Models, IEEE Transactions on Image Processing, vol. 15, no. 7, pp. 1871-1886, 2006.
- Kalyanmoy Deb, Ashish Anand, and Dhiraj Joshi, “A Computationally Efficient Evolutionary Algorithm for Real-Parameter Optimization”, Evolutionary Computation Journal (MIT Press), vol. 10, no. 4, pp. 371-395, 2002.
Refereed Conference and Workshop Papers
- Shengcao Cao, Dhiraj Joshi, Liangyan Gui, Yu-Xiong Wang, "HASSOD: Hierarchical Adaptive Self-Supervised Object Detection", accepted to NeuRIPS 2023.
- Shengcao Cao, Dhiraj Joshi, Liangyan Gui, Yu-Xiong Wang, "Contrastive Mean Teacher for Domain Adaptive Object Detectors", CVPR 2023.
- Hanjing Wang, Dhiraj Joshi, Shiqiang Wang, Qiang Ji, "Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning", CVPR 2023.
- Andrew Rouditchenko, Angie Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne, Rameswar Panda, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass, "AVLnet: Learning Audio-Visual Language Representations from Instructional Videos", arXiv:2006.09199. Interspeech 2021.
- Khoi-Nguyen C. Mac, Dhiraj Joshi, Raymond A. Yeh, Jinjun Xiong, Rogerio Feris, Minh N. Do, "Learning Motion in Feature Space: Locally- Consistent Deformable Convolution Networks for Fine Grained Action Detection", ICCV 2019 (oral - 4.3% of all submitted papers).
- Angie Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogerio Feris, Dan Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, James Glass, "Grounding Spoken Words in Unlabeled Video", Sight and Sound Workshop, (CVPR) 2019.
- Michele Merler, Dhiraj Joshi, Khoi-Nguyen C. Mac, Quoc-Bao Nguyen, Stephen Hammer, John Kent, Jinjun Xiong, Minh N. Do, John R. Smith, Rogerio Feris, "The Excitement of Sports: Automatic Highlights using Audio-Visual Cues", Sight and Sound Workshop, (CVPR) 2018.
- John R. Smith, Dhiraj Joshi, Benoit Huet, Winston Hsu, and Jozef Kota, "Harnessing A.I. for Augmenting Creativity: Application to Movie Trailer Creation", ACM Multimedia, 2017 (Best Brave New Ideas Paper Award 2017, IBM Pat Goldberg Memorial Best Paper Award 2017).
- Dhiraj Joshi, Michele Merler, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogerio Feris, "IBM High-Five: Highlights From Intelligent Video Engine", ACM Multimedia, 2017 (demo).
- Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogerio Feris, "Auto Curation of Golf Highlights using Multimodal Excitement Features", Int. Workshop on Computer Vision in Sports (with CVPR), 2017.
- Ryosuke Shigenaka, Yan-Ying Chen, Francine Chen, Dhiraj Joshi, Yukihiro Tsuboshita, "Image-based User Profiling of Frequent and Regular Venue Categories", IEEE Int. Conference on Multimedia Expo (ICME), 2017 (finalist - World's First 10K Best Paper Award at IEEE ICME 2017).
- Francine Chen, Dhiraj Joshi, Yasuhide Miura, and Tomoko Ohkuma, "Social Media based Profiling of Business Locations", Fuji Xerox Technical Report, March 2016.
- Yan-Ying Chen, Francine Chen, Matthew Cooper, and Dhiraj Joshi, "Using Business-Aware Latent Topics for Image Captioning in Social Media", IEEE Int. Conference on Multimedia Expo (ICME), 2016.
- Bor-Chun Chen, Yan-Ying Chen, Francine Chen, and Dhiraj Joshi, "Business-Aware Visual Concept Discovery from Social Media for Multimodal Business Venue Recognition", AAAI Conference on Artificial Intelligence (AAAI-16), 2016.
- Bokai Cao, Francine Chen, Dhiraj Joshi, and Phillip Yu, "Inferring Crowd-Sourced Venues for Tweets", IEEE Int. Conf. on Big Data (IEEE BigData), 2015.
- Dhiraj Joshi, Matthew Cooper, Francine Chen, and Yan-Ying Chen, "Building User Profiles from Shared Photos", ACM Multimedia Workshop on Multimedia Commons, in conjunction with ACM Multimedia, 2015.
- Francine Chen, Dhiraj Joshi, Yasuhide Miura, and Tomoko Ohkuma, "Social Media based Profiling of Business Locations", ACM Multimedia Workshop on Geotagging and its Applications, in conjunction with ACM Multimedia, 2014.
- Huizhong Chen, Matthew Cooper, Dhiraj Joshi, and Bernd Girod, "Multi-modal Language Models for Lecture Video Retrieval", ACM International Conference on Multimedia, short paper, 2014.
- Dhiraj Joshi, Francine Chen, and Lynn Wilcox, "Finding Selfies of Users in Microblogged Photos", ACM International Workshop on Social Media Retrieval and Analysis (SoMeRa), in conjunction with ACM SIGIR, short paper, 2014.
- Junjie Cai, Qiong Liu, Francine Chen, Dhiraj Joshi, and Qi Tian, "Scalable Image Search with Multiple Index Tables", ACM International Conference on Multimedia Retrieval (ICMR), short paper, 2014.
- Minwoo Park, Dhiraj Joshi, and Alexander Loui, "TagCloud++ - Scalable Tag-clouds for Arbitrary Layouts", IEEE Symposium on Multimedia (ISM), 2012.
- Hua Wang, Dhiraj Joshi, Jiebo Luo, Heng Huang, and Minwoo Park, "Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Structured Multi-Task Learning", IEEE Symposium on Multimedia (ISM), 2012.
- Charles Parker, Dhiraj Joshi, Phoury Lei, Jiebo Luo, “Characterizing Geographically Sensitive Music via Social Media”, ACM International Workshop on Music Information Retrieval with User-Centered and Multimodal Strategies (MIRUM), in conjunction with ACM Multimedia, 2011.
- Vivek Singh, Jiebo Luo, Dhiraj Joshi, Phoury Lei, Madirakshi Das, Peter Stubler, “Dynamic Media Show Drivable by Semantics”, ACM International Conference on Multimedia (demo session), 2011.
- Vivek Singh, Jiebo Luo, Dhiraj Joshi, Phoury Lei, Madirakshi Das, Peter Stubler, “Reliving On Demand: A Total Viewer Experience”, ACM International Conference on Multimedia, 2011.
- Dhiraj Joshi, Jiebo Luo, Jie Yu, Phoury Lei, Andrew Gallagher, “Rich Location-driven Tag Cloud Suggestions based on Public, Community and Personal Sources”, ACM International Workshop on Connected Media, in conjunction with ACM Multimedia, 2010.
- Dhiraj Joshi, Mark. D. Wood, and Jiebo Luo, “Suggesting Songs for Media Creation using Semantics”, IAPR International Conference on Pattern Recognition, 2010.
- Dhiraj Joshi, Andrew Gallagher, Jie Yu, and Jiebo Luo, “Exploring User Image Tags for Geo-location Inference”, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2010.
- Xin Jin, Jiebo Luo, Jie Yu, Gang Wang, Dhiraj Joshi, and Jiawei Han, “iRIN: Image Retrieval in Image-Rich Information Networks”, demo, International Conference on World Wide Web, 2010.
- Andrew Gallagher, Dhiraj Joshi, Jie Yu, and Jiebo Luo, “Geo-location Inference from Image Content and User Tags”, IEEE Workshop on Internet Vision, in conjunction with IEEE International Conference on Computer Vision and Pattern Recognition, 2009.
- Jie Yu, Dhiraj Joshi, and Jiebo Luo, “Connecting people in photo-sharing sites by photo content and User Annotation”, IEEE International Conference on Multimedia and Expo, 2009.
- Jiebo Luo, Jie Yu, Dhiraj Joshi, and Wei Hao, “Event Recognition – Viewing the World with a Third Eye”, ACM International Conference on Multimedia, 2008.
- Jiebo Luo, Wei Hao, Dale McIntyre, Dhiraj Joshi, and Jie Yu, “Recognizing Picture Taking Environment from Satellite Images: A Feasibility Study”, IAPR International Conference on Pattern Recognition, 2008.
- Dhiraj Joshi and Jiebo Luo, “Inferring Generic Activities and Events from Image Content and Bags of Geo-tags”, ACM International Conference on Image and Video Retrieval, 2008.
- Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z. Wang, “Tagging over Time: Real-world Image Annotation by Lightweight Meta-learning”, ACM International Conference on Multimedia, 2007.
- Dhiraj Joshi, Milind Naphade, and Apostol Natsev, “A Greedy Performance Driven Algorithm for Decision Fusion Learning”, IEEE International Conference on Image Processing, 2007.
- Dhiraj Joshi, Milind Naphade, and Apostol Natsev, “Semantics Reinforcement and Fusion Learning for Multimedia Streams”, ACM International Conference on Image and Video Retrieval, 2007.
- Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z. Wang, “Studying Aesthetics in Photographic Images Using a Computational Approach”, European Conference on Computer Vision (ECCV),2006.
- Murray Campbell, Shahram Ebadollahi, Alexander Haubold, Dhiraj Joshi, Milind Naphade, Apostol Natsev, Joachim Seidl, John R. Smith, Katya Scheinberg, Jelena Tesic and Lexing Xie, “IBM Research TRECVID-2006 Video Retrieval System”, TREC Video Retrieval Workshop, 2006.
- Dhiraj Joshi, Ritendra Datta, Ziming Zhuang, WP Weiss, Marc Friedenberg, Jia Li, and James Z. Wang, “PARAgrab: A Comprehensive Architecture for Web Image Management and Multimodal Querying”, demo, Very Large Databases (VLDB), 2006.
- Dhiraj Joshi and Daniel Gatica Perez, “Discovering Groups of People in Google News”, ACM International Workshop on Human Centered Multimedia, in conjunction with ACM Multimedia, 2006.
- Dhiraj Joshi, Jia Li, and James Z. Wang, “A Stochastic Approach to 3-D Image modeling”, IEEE/NLM Life Science Systems & Applications Workshop, 2006.
- Dhiraj Joshi, Jia Li, and James Z. Wang, “Parameter Estimation of Multi-dimensional Hidden Markov models – A Scalable Approach”, IEEE International Conference on Image Processing, 2005.
- Jia Li, Dhiraj Joshi, and James Z. Wang, “Stochastic Modeling of Volume Images with a 3-D Hidden Markov model”, IEEE International Conference on Image Processing, 2004.
- Dhiraj Joshi, James Z. Wang, and Jia Li, “The Story Picturing Engine: Finding Elite Images to Illustrate a Story Using Mutual Reinforcement”, ACM International Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia, 2004.
- Kalyanmoy Deb, Dhiraj Joshi, and Ashish Anand, “Real Coded Evolutionary Algorithms with Parent Centric Recombination”, IEEE International Congress on Evolutionary Computation, 2002.