AI cybersecurity

Safeguarding Your Software Supply Chain: The Power of Multi-Agent AI in Detecting Malicious Code

Discover how multi-agent AI systems revolutionize software supply chain security by detecting malicious PyPI packages with high accuracy and efficiency, protecting businesses from evolving threats.

ARSA Technology Team

21 Jan 2026 • 4 min read

The Escalating Threat of Malicious Code in Open-Source Software

Open-source software forms the bedrock of modern digital infrastructure, offering unparalleled flexibility and accelerating development across industries. Platforms like the Python Package Index (PyPI) serve as vast repositories where developers globally share and reuse code, significantly boosting productivity. However, this accessibility also creates a critical vulnerability: the potential for malicious actors to inject harmful code into widely used libraries, posing a growing and insidious threat to the entire software supply chain.

These malicious packages are often cleverly disguised as legitimate components, making their detection particularly challenging. Attackers employ various sophisticated techniques, including typosquatting (creating packages with names similar to popular ones), embedding obfuscated strings, and executing remote payloads during installation or at first import. The proliferation of AI-based systems trained on publicly available repositories further compounds this risk, as these systems can inadvertently ingest and perpetuate malicious code. Businesses must prioritize robust defenses to protect their operations from these evolving threats.

The Limitations of Traditional Security Approaches

Historically, organizations have relied on traditional security methods such as static rule-based scanners or signature-driven classifiers to identify potential threats. While these tools have their place, they often prove insufficient against advanced malicious code. They struggle with detecting subtle semantic patterns, complex obfuscation techniques, indirect API usage, or logic-level concealment that sophisticated attackers frequently employ. This leaves critical gaps in an organization's defense, especially when dealing with the dynamic and ever-changing landscape of open-source software.

The emergence of Large Language Models (LLMs) has ushered in new possibilities for software analysis. Their ability to interpret code semantics and contextual reasoning through natural language prompts makes them highly attractive for tasks like code summarization, defect detection, and vulnerability identification. However, applying LLMs effectively in security-critical domains like malicious code detection in open-source repositories has its own challenges. Many existing attempts treat LLMs as monolithic "black boxes," where a single model handles multiple stages of analysis without adequate modularity or transparency, leading to less reliable and less interpretable results.

Embracing Multi-Agent AI Systems for Next-Generation Security

To overcome the limitations of monolithic LLM applications, the software engineering community is increasingly adopting multi-agent systems. These innovative systems leverage orchestrated networks of specialized LLMs, allowing complex tasks to be broken down into more manageable and specialized roles. Each "agent" focuses on a specific aspect of the problem, collaborating to achieve a comprehensive solution, often surpassing the capabilities of single-agent pipelines.

This modular approach significantly enhances transparency and interpretability in security pipelines. By assigning distinct roles and responsibilities to each AI agent, the decision-making process becomes clearer and more auditable. This paradigm shift from singular model invocation to structured, distributed reasoning is proving to be a highly effective strategy for tackling intricate security challenges like detecting malicious software, and is a principle ARSA Technology applies across various industries.

Introducing LAMPS: A Collaborative AI System for Package Security

One such pioneering innovation is LAMPS (LLM-based Multi-Agent system for detecting Malicious PyPI PackageS), which exemplifies the power of collaborative AI for cybersecurity. LAMPS features a modular design with four distinct, role-specific agents that work in concert: a package retrieval agent, a file extraction agent, a classification agent, and a verdict aggregation agent. This specialized division of labor ensures that each stage of the detection process is handled with precision and depth.

The system is coordinated using the CrewAI framework, an orchestration layer that facilitates communication and task distribution among the agents. The classification agent employs a fine-tuned CodeBERT model, specifically trained to identify subtle patterns in Python code. Meanwhile, the other agents leverage LLaMA 3 models for robust contextual reasoning, allowing them to interpret complex scenarios and make informed decisions. This design represents a significant leap forward, transforming existing security frameworks into intelligent, real-time threat detection systems, similar to how ARSA's AI BOX - Basic Safety Guard enhances real-time security compliance in industrial settings.

Proven Accuracy and Real-World Impact

The effectiveness of LAMPS has been rigorously evaluated on two complementary datasets, demonstrating exceptional performance in identifying malicious PyPI packages. On D1, a balanced dataset comprising 6,000 setup.py files, LAMPS achieved an impressive 97.7% accuracy, outperforming established state-of-the-art approaches such as MPHunter and TF-IDF stacking models. This high level of accuracy ensures that critical threats are identified swiftly and reliably, minimizing exposure for businesses.

Furthermore, on D2, a more realistic multi-file dataset with 1,296 files and natural class imbalance, LAMPS reached an outstanding 99.5% accuracy and 99.5% balanced accuracy. These results significantly surpassed RAG-based approaches and fine-tuned single-agent baselines, with McNemar’s test confirming these improvements as statistically highly significant. Such high accuracy in realistic, imbalanced data scenarios highlights LAMPS's robustness and its practical applicability in real-world software supply chain security, offering a reliable layer of defense that complements other AI-driven solutions like ARSA AI Video Analytics for general security monitoring.

The Business Imperative: Strengthening Your Software Supply Chain

For businesses, the implications of advanced malicious code detection are profound. The ability to identify and mitigate threats within the software supply chain protects not only sensitive data and intellectual property but also ensures operational continuity and maintains customer trust. By deploying sophisticated AI-powered systems, organizations can move beyond reactive security measures to a proactive, intelligent defense strategy.

Solutions like LAMPS showcase the feasibility and tangible benefits of distributed LLM reasoning and modular multi-agent designs in enterprise security. This technology can be adapted to various enterprise needs, from verifying the integrity of internal software components to enhancing overall cybersecurity posture. For organizations looking to integrate advanced AI capabilities into their existing security infrastructure, platforms like ARSA AI API offer robust tools to build custom solutions, while our expertise, honed since 2018, ensures seamless integration and impactful results. Investing in such cutting-edge AI security measures is no longer optional but a strategic imperative for navigating the complexities of the digital landscape.

Ready to explore how advanced AI can fortify your enterprise security and protect your software supply chain? Discover ARSA Technology's innovative solutions and enhance your operational resilience. For a free consultation, contact ARSA today.