Integrate RL with LLM Agents

Overview and Motivation Presentation on integrating reinforcement learning concepts with LLM engines to improve performance Multiple approaches exist to enhance LLM performance: in-context learning, post-training (reinforcement learning), and fine-tuning Focus on efficiently integrating RL concepts with LLMs, particularly for multi-agent systems Covers differences between reinforcement learning on LLMs versus reinforcement learning on LLM agents Addresses RL implementation in HPC systems due to memory-intensive requirements Multi-Agent System Fundamentals Multi-agent systems consist of multiple LLMs, each with specific roles and functions State acts as history, compiling all previous agent turns, context, and evidence in multi-turn systems Communication Topologies ...

February 20, 2026 · 11 min · Ibrahim Al Azher

LLMs as a Judges

Overview On March 28, 2025, Miftahul Jannat Mokarrama, a PhD student in the Computer Science department, presented a research topic on “LLMs as Judges,” a trending subject in the field of large language models. Her presentation provided a comprehensive survey covering functionality, methodology, applications, meta-evaluation, and limitations. Key conclusions included: LLMs as evaluators are versatile, evaluation is context-specific, challenges persist, human-AI collaboration is essential, and evaluation should extend beyond traditional research papers.

March 28, 2025 · 1 min · Miftahul Jannat Mokarrama

Large Language Models: What, Why, and How

Overview On 02/21/2025, the first NIU AI/ML Research Seminar was held in PM103. Ibrahim Al Azher, a PhD student from the CS department, delivered a general introduction to current studies on Large Language Models, their limitations, and recent advancements in Retrieval-Augmented Generation (RAG). The presentation, which lasted about an hour, was well received by the participants and sparked engaging discussions afterward. This event marked a successful start to the AI/ML research seminar series.

February 21, 2025 · 1 min · Ibrahim Al Azher