Actions, (LLM) Agents and World Models
[LLM Agents]
[Reasoning LLMs]
[Logic-Actions]
[Bias-Fairness]
[Instructions;Prompts]
[Data-Smart]
[Robustness;Self-aware]
[Task-focussed]
[Symbolic-Semantic-Parsing]
[Knowledge-Reasoning-NLI-NeuroSymbolic]
[IR+QA;Knowledge Hunting]
[Information Extraction; NER]
[Datasets;Systems]
[Application-Discovery]
[Application-Cybersecurity]
[Application-BioMed]
[Application-Robotics]
[Other]
-
Venkatesh Mishra, Amir Saeidi, Satyam Raj, Mutsumi Nakamura, Jayanth Srinivasa, Gaowen Liu, Ali Payani, Chitta Baral. How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on Tau -bench. EMNLP Findings 2025.
-
Mihir Parmar, Xin Liu, Palash Goyal, Yanfei Chen, Long Le, Swaroop Mishra, Hossein Mobahi, Jindong Gu, Zifeng Wang, Hootan Nakhost, Chitta Baral, Chen-Yu Lee, Tomas Pfister, Hamid Palangi. PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving. EMNLP 2025.
-
Mihir Parmar, Palash Goyal, Xin Liu, Yiwen Song, Mingyang Ling, Chitta Baral, Hamid Palangi, Tomas Pfister. PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving. EMNLP 2025.
-
Shikhhar Siingh, Abhinav Rawat, Chitta Baral, Vivek Gupta. GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning. ACL 2025.
- Divij Handa, Pavel Dolin, Shrinidhi Kumbhar, Tran Cao Son, Chitta Baral. ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints. ICLR 2025.
- Naman Ahuja, Fenil Bardoliya, Chitta Baral, Vivek Gupta. Map & Make: Schema Guided Text to Table Generation. ACL 2025.
Reasoning LLMs (Alignment using Optimization and Reinforcement Learning)
[LLM Agents]
[Reasoning LLMs]
[Logic-Actions]
[Bias-Fairness]
[Instructions;Prompts]
[Data-Smart]
[Robustness;Self-aware]
[Task-focussed]
[Symbolic-Semantic-Parsing]
[Knowledge-Reasoning-NLI-NeuroSymbolic]
[IR+QA;Knowledge Hunting]
[Information Extraction; NER]
[Datasets;Systems]
[Application-Discovery]
[Application-Cybersecurity]
[Application-BioMed]
[Application-Robotics]
[Other]
- Amir Saeidi, Shivanshu Verma, Aswin RRV, Kashif Rasul, and Chitta Baral.
Triple Preference Optimization: Achieving Better Alignment using a Single Step Optimization. TMLR 2025.
-
Aswin RRV, Jacob Dineen, Divij Handa, Md Nayem Uddin, Mihir Parmar, Chitta Baral, Ben Zhou. ThinkTuning: Instilling Cognitive Reflections without Distillation. EMNLP 2025.
-
Jacob Dineen, Aswin RRV, Qin Liu, Zhikun Xu, Xiao Ye, Ming Shen, Zhaonan Li, Shijie Lu, Chitta Baral, Muhao Chen, Ben Zhou. QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA.
EMNLP Findings 2025.
Logic and Actions
[LLM Agents]
[Reasoning LLMs]
[Logic-Actions]
[Bias-Fairness]
[Instructions;Prompts]
[Data-Smart]
[Robustness;Self-aware]
[Task-focussed]
[Symbolic-Semantic-Parsing]
[Knowledge-Reasoning-NLI-NeuroSymbolic]
[IR+QA;Knowledge Hunting]
[Information Extraction; NER]
[Datasets;Systems]
[Application-Discovery]
[Application-Cybersecurity]
[Application-BioMed]
[Application-Robotics]
[Other]
-
Md Nayem Uddin, Amir Saeidi, Divij Handa, Agastya Seth, Tran Cao Son, Eduardo Blanco, Steven Corman, Chitta Baral. UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization. ACL 2025.
- Divij Handa, Pavel Dolin, Shrinidhi Kumbhar, Tran Cao Son, Chitta Baral. ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints. ICLR 2025.
- Zhikun Xu, Ming Shen, Jacob Dineen, Zhaonan Li, Xiao Ye, Shijie Lu, Aswin RRV, Chitta Baral, Ben Zhou. ToW: Thoughts of Words Improve Reasoning in Large Language Models.
NAACL 2025.
-
Venkatesh Mishra, Bimsara Pathiraja, Mihir Parmar, Sat Chidananda, Jayanth Srinivasa, Gaowen Liu, Ali Payani, Chitta Baral. Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning. NAACL 2025 Findings.
- Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral. Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models. EMNLP 2024.
- Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral. Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
EMNLP 2024.
- Mutsumi Nakamura, Santosh Mashetty, Mihir Parmar, Neeraj Varshney, and Chitta Baral. LogicAttack: Adversarial Attacks for Evaluating Logical Consistency of Natural Language Inference.
Findings of EMNLP 2023.
-
Mihir Parmar, Nisarg Patel, Neeraj Varshney, Mutsumi Nakamura, Man Luo, Santosh Mashetty, Arindam Mitra, Chitta Baral. Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models. ACL 2024.
Bias and Fairness
[LLM Agents]
[Reasoning LLMs]
[Logic-Actions]
[Bias-Fairness]
[Instructions;Prompts]
[Data-Smart]
[Robustness;Self-aware]
[Task-focussed]
[Symbolic-Semantic-Parsing]
[Knowledge-Reasoning-NLI-NeuroSymbolic]
[IR+QA;Knowledge Hunting]
[Information Extraction; NER]
[Datasets;Systems]
[Application-Discovery]
[Application-Cybersecurity]
[Application-BioMed]
[Application-Robotics]
[Other]
- Mihir Parmar, Swaroop Mishra, Mor Geva and Chitta Baral.
Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions. EACL 2023.
(Outstanding paper award)
Data Generation; Instructibility; Hallucination Mitigation; Knowledge Guidance; Instruction Engineering; Limitations of Transformer models
[LLM Agents]
[Reasoning LLMs]
[Logic-Actions]
[Bias-Fairness]
[Instructions;Prompts]
[Data-Smart]
[Robustness;Self-aware]
[Task-focussed]
[Symbolic-Semantic-Parsing]
[Knowledge-Reasoning-NLI-NeuroSymbolic]
[IR+QA;Knowledge Hunting]
[Information Extraction; NER]
[Datasets;Systems]
[Application-Discovery]
[Application-Cybersecurity]
[Application-BioMed]
[Application-Robotics]
[Other]
- Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra.
TarGEN: Targeted Data Generation with Large Language Models. COLM 2024.
- Aswin RRV, Nemika Tyagi, Md Nayem Uddin, Neeraj Varshney, Chitta Baral.
Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies. ACL 2024 (Findings).
- Neeraj Varshney, Pavel Dolin, Agastya Seth, Chitta Baral. The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness. ACL 2024 (Findings).
-
Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral. InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis. NAACL 2024.
-
Neeraj Varshney, Agneet Chatterjee, Mihir Parmar, Chitta Baral.
Investigating Acceleration of LLaMA Inference by Enabling Intermediate Layer Decoding via Instruction Tuning with 'LITE'.Findings of NAACL 2024.
(paper with code)
-
... Chitta Baral ... (hundreds of co-authors)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
(BIG BENCH)
arXiv:2206.04615.
Dataset,
TMLR (Transactions on Machine Learning Research), 2023.
-
Y. Wang, S. Mishra, P. Alipoormolabashi, Y. Kordi, A. Mirzaei, A. Naik, A. Ashok, A. S.
Dhanasekaran, A. Arunkumar, D. Stap, E. Pathak, G. Karamanolakis, H. Lai, I. Purohit,
Ishan; Mondal, J. Anderson, K. Kuznia, K. Doshi, K. K. Pal, M. Patel, M. Moradshahi,
M. Parmar, M. Purohit, N. Varshney, P. R. Kaza, P. Verma, R. S. Puri, R. Karia, S. Doshi,
S. K. Sampat, S. Mishra, S. Reddy, S. Patro, T. Dixit, X. Shen, C. Baral, Y. Choi, N. A. Smith,
H. Hajishirzi, and D. Khashabi.
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ Tasks. EMNLP 2022.
-
Pruthvi Jayeshkumar Patel, Swaroop Mishra, Mihir Parmar and Chitta Baral.
Is a Question Decomposition Unit All We Need? EMNLP 2022.
- Kirby Kuznia, Swaroop Mishra, Mihir Parmar and Chitta Baral. Less is More: Summary of Long Instructions is Better for Program Synthesis. EMNLP 2022.
- Mihir Parmar, Swaroop Mishra, Mirali Purohit, Man Luo, M. Hassan Murad, Chitta Baral. In-BoXBART: Get Instructions into Biomedical Multi-Task Learning. Findings of NAACL 2022.
- Swaroop Mishra, Daniel Khashabi, Chitta Baral, Hannaneh Hajishirzi.
Cross-Task Generalization via Natural Language Crowdsourcing Instructions. ACL 2022.
- Swaroop Mishra, Daniel Khashabi, Chitta Baral, Yejin Choi, Hannaneh Hajishirzi. Reframing Instructional Prompts to GPTk's Language. Findings of ACL 2022.
-
Kuntal Pal, Chitta Baral.
Investigating Numeracy Learning Ability of a Text-to-Text Transfer Model. Findings of EMNLP 2021.