About
My primary interest lies in comprehensively exploring the strengths and limitations of modern AI systems, particularly Large Language Models (LLMs), across multiple dimensions. My goal is to harness this knowledge to craft robust and interpretable AI solutions that tackle a wide range of challenges. My research interests aligns with two key areas: 1.Architectural Innovations – I am passionate about uncovering how these models function through approximate mathematical modeling and developing tools that help people to understand these models. I aim to enhance the performance and efficiency of these models by modifying their architecture. 2.AI4CODE – I also aim at improving these AI models for code generation. I wanted to incorporate human-like reasoning abilities into LLMs, enabling them to engage in more robust and structured problem-solving before arriving at target code.