28 Aug 2024 8 min read

[AI Dev Tools] AI-Assisted Debugging, Patch Generation, Code Summarization...

Cursor Lens: An open-source dashboard for Cursor.sh IDE

Cursor Lens is an open-source tool that provides insights into AI-assisted coding sessions using Cursor AI, acting as a proxy between Cursor and various AI providers.

Key Features:

Integrates with multiple AI providers including OpenAI and Anthropic, capturing and logging all requests between Cursor and AI providers.
Offers a visual analytics dashboard displaying AI usage, token consumption, and request patterns, along with real-time monitoring of ongoing AI interactions.
Allows users to configure and switch between different AI models, tracking token usage and providing cost estimates based on model pricing.
Built using Next.js with React for the frontend and backend, PostgreSQL with Prisma ORM for the database, and Tailwind CSS with shadcn/ui components for styling.
Supports prompt caching with Anthropic, allowing system and context messages in specific chats to be cached for improved efficiency.

Source: https://github.com/HamedMP/CursorLens

PatUntrack: Automated Patch Example Generation for Issue Reports

PatUntrack is a system that automatically generates patch examples from vulnerability issue reports (IRs) without tracked insecure code, using LLMs to analyze vulnerabilities.

The system generates a complete description of the Vulnerability-Triggering Path (VTP) from vulnerable IRs.
PatUntrack corrects hallucinations in the VTP description using external golden knowledge.
It then produces Top-K pairs of Insecure Code and Patch Examples based on the corrected VTP description.
Experiments on 5,465 vulnerable IRs showed PatUntrack outperformed traditional LLM baselines by 14.6% (Fix@10) on average in patch example generation.
In a real-world application, 27 out of 37 IR authors confirmed the usefulness of PatUntrack-generated patch examples for 76 newly disclosed vulnerable IRs.

Source: PatUntrack: Automated Generating Patch Examples for Issue Reports without Tracked Insecure Code

CrashTracker: Explainable Fault Localization for Framework-Specific Crashes

A tool that combines static analysis and LLMs to locate and explain crashing faults in applications relying on complex frameworks, particularly focusing on Android framework-specific crashes.

The approach uses exception-thrown summaries (ETS) to describe key elements related to framework-specific exceptions, extracted through static analysis.
Data-tracking of ETS elements helps identify and prioritize potential buggy methods for a given crash.
LLMs enhance result explainability using candidate information summaries (CIS), which provide multiple types of explanation-related contexts.
CrashTracker achieved a 0.91 MRR value in fault localization precision and improved user satisfaction scores for fault explanations by 67.04% compared to static analysis alone.

Source: Better Debugging: Combining Static Analysis and LLMs for Explainable Crashing Fault Localization

UTGen: Enhancing Automated Unit Test Understandability with LLMs

UTGen combines search-based software testing and LLMs to improve the comprehensibility of automatically generated unit tests, addressing a common challenge faced by software engineers.

The tool enhances test understandability by contextualizing test data, improving identifier naming, and adding descriptive comments.
A controlled experiment with 32 participants from academia and industry evaluated UTGen's impact on bug-fixing tasks.
Results showed participants using UTGen test cases fixed up to 33% more bugs and required up to 20% less time compared to baseline test cases.
Feedback from participants indicated that enhanced test names, test data, and variable names contributed to an improved bug-fixing process.

Source: Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests

Kubernetes Manifest Generation: LLM-Based Approach Evaluation

A study proposing a benchmarking method to evaluate the effectiveness of LLMs in synthesizing Kubernetes manifests from Compose specifications.

The benchmark uses Compose specifications as input, a standard widely adopted by application developers.
Results show LLMs generally produce accurate manifests and compensate for simple specification gaps.
Inline comments for readability were often omitted in the generated manifests.
LLMs demonstrated low completion accuracy for atypical inputs with unclear intentions.
The study aims to address the complexity barrier of Kubernetes for developers unfamiliar with the system.