# iris

**Repository Path**: bingosxs/iris

## Basic Information

- **Project Name**: iris
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: MIT
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2025-07-06
- **Last Updated**: 2025-07-06

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# IRIS 
IRIS is a neurosymbolic framework that combines LLMs with static analysis for security vulnerability detection. IRIS uses LLMs to generate source and sink specifications and to filter false positive vulnerable paths. 

- [Workflow](#workflow)
- [Dataset](#dataset)
- [Results](#results)
- [Environment Setup](#environment-setup)
- [Quickstart](#quickstart)
- [Supported CWEs](#supported-cwes)
- [Supported Models](#supported-models)
- [Adding CWEs](docs/adding_cwes.md)
- [Contributing](#contributing-and-feedback)
- [Citation](#citation)
- [Team](#team)

## Workflow

At a high level, IRIS takes a project and a CWE (vulnerability class, such as path traversal vulnerability or CWE-22) as input, statically analyzes the project, and outputs a set of potential vulnerabilities (of type CWE) in the project. To achieve this, IRIS takes the following steps:

![iris workflow](iris_arch.png)

1. First we create CodeQL queries to collect external APIs in the project and all internal function parameters. 
2. We use an LLM to classify the external APIs as potential sources, sinks, or taint propagators. In another query, we use an LLM to classify the internal function parameters as potential sources. We call these taint specifications.
3. Using the taint specifications from step 2, we build a project-specific and cwe-specific (e.g., for CWE 22) CodeQL query. 
4. Then we run the query to find vulnerabilities in the given project and post-process the results. 
5. We provide the LLM the post-processed results to filter out false positives and determine whether a CWE is detected.  

## Dataset 
We have curated a dataset of Java projects, containing 120 real-world previously known vulnerabilities across 4 popular vulnerability classes. The dataset is also available to use with the Hugging Face datasets library. 

[CWE-Bench-Java](https://github.com/iris-sast/cwe-bench-java)

[CWE-Bench-Java on Hugging Face](https://huggingface.co/datasets/iris-sast/CWE-Bench-Java)

## Results 
Results on the effectiveness of IRIS across 121 projects and 9 LLMs can be found at `/results`. Each model has a unique CSV file, with the following structure as an example.

| CWE ID | CVE | Author | Package | Tag | Recall | Alerts | Paths | TP Alerts | TP Paths | Precision | F1 |
| -- | ------------ | ------ | -------|----------|-----------------|------------------------|------------|------------|-------------|-----------------|----------------|
| CWE-022 | CVE-2016-10726 | DSpace | DSpace | 4.4 | 0 | 31 | 63 | 0 | 0 | 0 | 0 |

`None` refers to data that was not collected, while `N/A` refers to a measure that cannot be calculated, either because of missing data or a division by zero.

## Environment Setup

You need a set of JDKs, Gradle, and Maven versions to run IRIS with CWE-Bench-Java, a benchmark of 120 vulnerable Java projects. These versions are indicated in ```.json``` files.

We support multiple ways to run IRIS:
- [Environment Setup for Linux](#environment-setup-for-linux)
- [Environment Setup for Docker](#environment-setup-for-docker)
- [Environment Setup for Other Systems](#environment-setup-for-other-systems)

### Environment Setup for Linux
<details>
<summary>Installation Steps</summary>

#### Step 1. Clone IRIS 
First, clone the repository. We have included `cwe-bench-java` as a submodule, so use the following command to clone correctly:
```bash
$ git clone https://github.com/iris-sast/iris --recursive
```

#### Step 2. Conda environment  
Run `scripts/setup_environment.sh`. 
```bash
$ chmod +x scripts/setup_environment.sh
$ bash ./scripts/setup_environment.sh
$ source ~/.bashrc # make sure to source .bashrc for PATH update
```
If you are building for Mac, run the following bash command instead.
```bash
$ bash ./scripts/setup_environment.sh --mac
```
This will download CodeQL directly, instead of installing our patched version. Please note that this does not ensure reproducibility of results.

This will do the following:
- creates a conda environment specified by environment.yml
- installs our [patched version of CodeQL 2.15.3](https://github.com/iris-sast/iris/releases/tag/codeql-0.8.3-patched). This version of CodeQL **is necessary** for IRIS. To prevent confusion in case users already have an existing CodeQL version, we unzip this within the root of the iris directory. Then we add a PATH entry to the path of the patched CodeQL's binary.
- creates a directory to store CodeQL databases.

If you have a CUDA-capable GPU and want to enable hardware acceleration, install the appropriate CUDA toolkit, for example:
```bash
$ conda install pytorch-cuda=12.1 -c nvidia -c pytorch
```
Replace 12.1 with the CUDA version compatible with your GPU and drivers, if needed.

#### Step 3: Get the JDKs needed 
We have included CWE-Bench-Java as a submodule in IRIS in the data folder. We have also provided scripts to fetch and build Java projects to be used with IRIS. 

For building, we need Java distributions as well as Maven and Gradle for package management. In case you have a different system than Linux x64, please modify `data/cwe-bench-java/scripts/jdk_version.json`, `data/cwe-bench-java/scripts/mvn_version.json`, and `data/cwe-bench-java/scripts/gradle_version.json` to specify the corresponding JDK/MVN/Gradle files. For Mac in particular, please install the JDKs using the ```.dmg``` files and adjust the ```.json``` accordingly. In addition, please prepare 3 versions of JDK and put them under the ```java-env``` folder. Oracle requires an account to download the JDKs, and we are unable to provide an automated script. Download from the following URLs:

JDK 7u80: https://www.oracle.com/java/technologies/javase/javase7-archive-downloads.html

JDK 8u202: https://www.oracle.com/java/technologies/javase/javase8-archive-downloads.html

JDK 17: https://www.oracle.com/java/technologies/javase/jdk17-archive-downloads.html

At this point, your `java-env` directory should look like 
```
- data/cwe-bench-java/java-env/
  - jdk-7u80-linux-x64.tar.gz
  - jdk-8u202-linux-x64.tar.gz
  - jdk-17_linux-x64_bin.tar.gz
```

After this proceed to step 4 on fetching and building Java projects.

#### Step 4. Fetch and build Java projects
Now run the fetch and build script. You can also choose to fetch and not build, or specify a set of projects. You can find project names in the project_slug column in `cwe-bench-java/data/build_info.csv`.
```bash
# fetch projects and build them
$ python3 data/cwe-bench-java/scripts/setup.py

# fetch projects and don't build them
$ python3 data/cwe-bench-java/scripts/setup.py --no-build

# example - build the perwendel__spark_CVE-2018-9159_2.7.1 project 
$ python3 data/cwe-bench-java/scripts/setup.py --filter perwendel__spark_CVE-2018-9159_2.7.1

# example - only build projects under CWE-022 and CWE-078
$ python3 data/cwe-bench-java/scripts/setup.py --cwe CWE-022 CWE-078 

# example - only build keycloak projects 
$ python3 data/cwe-bench-java/scripts/setup.py --filter keycloak 

# example - do not build any apache related projects
$ python3 data/cwe-bench-java/scripts/setup.py --exclude apache       
```
This will create the `build-info` and `project-sources` directories. It will also install JDK, Maven, and Gradle versions used to build the projects in `cwe-bench-java`. `build-info` is used to store build information and `project-sources` is where the fetched projects are stored.

#### Step 5. Generate CodeQL databases
To use CodeQL, you will need to generate a CodeQL database for each project. We have provided a script to automate this. The script will generate databases for all projects found in `data/cwe-bench-java/project-sources`. To generate a database for a specific project, use the `--project` argument. 
```bash
# build CodeQL databases for all projects in project-sources
$ python3 scripts/build_codeql_dbs.py 

# build a specific CodeQL database given the project slug
$ python3 scripts/build_codeql_dbs.py --project perwendel__spark_CVE-2018-9159_2.7.1 
```
Note - if the script fails due to trying to locate CodeQL, run the following to find the path:
```bash
$ which codeql
```
Then update `CODEQL_DIR` in `src/config.py`. 

#### Step 6. Check IRIS directory configuration in `src/config.py`
By running the provided scripts, you won't have to modify `src/config.py`. Double check that the paths in the configuration are correct. Each path variable has a comment explaining its purpose.

</details>

### Environment Setup for Docker
<details>
<summary>Installation Steps</summary>

#### Step 1. Docker Setup
Clone IRIS. The Dockerfile has scripts that will create the conda environment, clone `cwe-bench-java`, and install the patched CodeQL version. Before building the Dockerfile you will need download the JDK versions needed. Then the Dockerfile copies them to the container. 

#### Step 2. Get the JDKs needed 
For building, we need Java distributions as well as Maven and Gradle for package management. In addition, please prepare 3 versions of JDK and **put them in the IRIS root directory**. Oracle requires an account to download the JDKs, and we are unable to provide an automated script. Download from the following URLs:

JDK 7u80: https://www.oracle.com/java/technologies/javase/javase7-archive-downloads.html

JDK 8u202: https://www.oracle.com/java/technologies/javase/javase8-archive-downloads.html

JDK 17: https://www.oracle.com/java/technologies/javase/jdk17-archive-downloads.html (Make sure to download specifically JDK 17, not 17.X.X)

At this point, your `iris` directory should look like 
```
- /iris
  - jdk-7u80-linux-x64.tar.gz
  - jdk-8u202-linux-x64.tar.gz
  - jdk-17_linux-x64_bin.tar.gz
```

#### Step 3. Build and run the Docker container
# build for Windows/ Mac with Intel
$ docker build -t iris .
# build for ARM architecture/ Apple Silicon
$ docker build --platform=linux/amd64 -t iris .
# run
$ docker run -it iris
# If run with all GPUs 
$ docker run --gpus all -it iris
# run with specific GPUs
$ docker run --gpus '"device=0,1"' -it iris
```

If you choose to run with GPUs and have a CUDA-capable GPU and want to enable hardware acceleration, install the appropriate CUDA toolkit:
```bash
$ conda activate iris
$ conda install pytorch-cuda=12.1 -c nvidia -c pytorch
```
Replace 12.1 with the CUDA version compatible with your GPU and drivers, if needed.

Confirm that the patched CodeQL is in your PATH.

After this, proceed to step 4 on fetching and building Java projects.

#### Step 4. Fetch and build Java projects
Now run the fetch and build script. You can also choose to fetch and not build, or specify a set of projects. You can find project names in the project_slug column in `cwe-bench-java/data/build_info.csv`.
```bash
# fetch projects and build them
$ python3 data/cwe-bench-java/scripts/setup.py

# fetch projects and don't build them
$ python3 data/cwe-bench-java/scripts/setup.py --no-build

# example - build the perwendel__spark_CVE-2018-9159_2.7.1 project 
$ python3 data/cwe-bench-java/scripts/setup.py --filter perwendel__spark_CVE-2018-9159_2.7.1

# example - only build projects under CWE-022 and CWE-078
$ python3 data/cwe-bench-java/scripts/setup.py --cwe CWE-022 CWE-078 

# example - only build keycloak projects 
$ python3 data/cwe-bench-java/scripts/setup.py --filter keycloak 

# example - do not build any apache related projects
$ python3 data/cwe-bench-java/scripts/setup.py --exclude apache       
```
This will create the `build-info` and `project-sources` directories. It will also install JDK, Maven, and Gradle versions used to build the projects in `cwe-bench-java`. `build-info` is used to store build information and `project-sources` is where the fetched projects are stored.

#### Step 5. Generate CodeQL databases
To use CodeQL, you will need to generate a CodeQL database for each project. We have provided a script to automate this. The script will generate databases for all projects found in `data/cwe-bench-java/project-sources`. To generate a database for a specific project, use the `--project` argument. 
```bash
# build CodeQL databases for all projects in project-sources
$ python3 scripts/build_codeql_dbs.py 

# build a specific CodeQL database given the project slug
$ python3 scripts/build_codeql_dbs.py --project perwendel__spark_CVE-2018-9159_2.7.1 
```
Note - if the script fails due to trying to locate CodeQL, run the following to find the path:
```bash
$ which codeql
```
Then update `CODEQL_DIR` in `src/config.py`. 

#### Step 6. Check IRIS directory configuration in `src/config.py`
By running the provided scripts, you won't have to modify `src/config.py`. Double check that the paths in the configuration are correct. Each path variable has a comment explaining its purpose.

</details>

### Environment Setup for Other Systems

**Mac**: If you are using a Mac device, you can build IRIS one of two ways. You can build following the [Docker instructions](#environment-setup-for-docker) above, or you can build natively on Mac using the [Linux instructions](#environment-setup-for-linux). However, it is far more difficult to install the dependencies for IRIS natively, and we cannot insure reproducibility of results. Therefore, it is strongly advised to build using Docker.

**Windows**: We have not evaluated IRIS on Windows machines. If you are interested in extending IRIS's support to Windows machines, please feel free to raise a PR.

## Quickstart
Make sure you have followed all of the environment setup instructions before proceeding! 

`src/neusym_vul.py` is used to analyze one specific project. `src/neusym_vul_for_query.py` is used to analyze multiple projects. Results are written to the `output` directory.

See the [Supported CWEs](#supported-cwes) section for `--query` arguments and the [Supported Models](#supported-models) section for `--llm` arguments.

The following is an example of using IRIS to analyze perwendel__spark_CVE-2018-9159_2.7.1 for vulnerabilities that fall under CWE-022, using qwen2.5-coder-7b. Query `cwe-022wLLM` refers to [cwe-22 path traversal](https://cwe.mitre.org/data/definitions/22.html). You should be able to immediately execute this command to see an example of an evaluation.  

```bash
$ python3 src/neusym_vul.py --query cwe-022wLLM --run-id <SOME_ID> --llm qwen2.5-coder-7b perwendel__spark_CVE-2018-9159_2.7.1
```

The following is an example of using IRIS to analyze zerotunaround for vulnerabilities that fall under CWE-022, using GPT-4. Query `cwe-022wLLM` refers to [cwe-22 path traversal](https://cwe.mitre.org/data/definitions/22.html). 
```bash
$ python3 src/neusym_vul.py --query cwe-022wLLM --run-id <SOME_ID> --llm gpt-4 zeroturnaround__zt-zip_CVE-2018-1002201_1.12
```

### Outputs

After the steps above, IRIS will generate `results.sarif` and `results_pp.sarif` files in `output/[project-name]/cwe-XXwLLM` containing the vulnerabilities found in the project before and after posthoc filtering. You can download [Sarif Viewer](https://marketplace.visualstudio.com/items?itemName=MS-SarifVSCode.sarif-viewer) to view the sarif files. Additionally, `results.csv` contains the vulnerabilities in a simplified form.

<details>
  <summary>Example Output directory structure (using run-id test1): </summary>
  
```
output
├── common
│   └── test1
│       └── cwe-022 (cache of common specs, can be reused across projects)
└── perwendel__spark_CVE-2018-9159_2.7.1
    └── test1
        ├── common
        │   ├── func_params.csv (list of all function parameters in the project)
        │   ├── llm_labelled_source_func_params.json (function parameters labelled as sources by LLM)
        │   ├── logs (Log files and raw LLM outputs)
        │   │   └── label_func_params
        │   │       ├── raw_llm_response_0.txt
        │   │       ├── raw_llm_response_20.txt
        │   │       ...
        │   └── source_func_param_candidates.csv
        ├── cwe-022 
        │   ├── MySinks.qll (Codeql file listing all sink specifications returned by LLM)
        │   ├── MySources.qll (Codeql file listing all source specifications returned by LLM)
        │   ├── MySummaries.qll (Codeql file listing all summary specifications returned by LLM)
        │   ├── Spec.yml (Alternate yml file listing all the specs)
        │   ├── candidate_apis.csv (candidate specs)
        │   ├── external_apis.csv 
        │   ├── llm_labelled_sink_apis.json (sinks labelled by LLM)
        │   ├── llm_labelled_source_apis.json (sources labelled by LLM)
        │   ├── llm_labelled_taint_prop_apis.json (taint propagators labelled by LLM)
        │   └── logs (intermediate logs)
        │       └── label_apis
        │           ├── raw_llm_response_0.txt
        │           ├── ...
        │           ├── raw_user_prompt_0.txt
        │           ├── ...
        ├── cwe-022wLLM (final results with all the vulnerabilities)
        │   ├── results.csv
        │   ├── results.sarif (before contextual filtering)
        │   └── results_pp.sarif (after contextual filtering)
        ├── cwe-022wLLM-final
        │   └── results.json (results statistics)
        ├── cwe-022wLLM-posthoc-filter (results of contextual filtering)
        │   ├── logs
        │   │   ├── raw_llm_response_0_0.txt
        │   │   ├── raw_llm_response_0_1.txt
        │   │   ├── ...
        │   ├── results.json
        │   ├── results.sarif
        │   └── stats.json
        ├── fetch_* (intermediate analysis results)
        │   ├── ...
        └── log (Main log files)
            ├── ...       
```

</details>

### Running IRIS on a new Java project

The easiest way to run IRIS on a Java project not in CWE-Bench-Java is to make the following changes:

1. Add the project info to `data/cwe-bench-java/data/project_info.csv`. For instance, to run the latest perwendel/spark version:

```
ID,perwendel__spark_latest,,,,,perwendel,spark,latest,,,,
```

The only required fields are: project slug (use a unique name), github username, and github tag if any.

2. Clone the project to `data/cwe-bench-java/project-sources`. Use the same folder name as the slug used above.
3. Add build info to `data/cwe-bench-java/data/build_info.csv`. For instance, you can add: `perwendel__spark_latest,success,8u202,3.5.0,n/a,n/a` to use java 8 and mvn 3.5.0. Please use appropriate java/mvn/gradle versions as needed.
4. Build the Java project.
```bash
$ python3 data/cwe-bench-java/scripts/setup.py --filter [project slug]
```
5. Generate the CodeQL database.
```bash
$ python3 scripts/build_codeql_dbs.py --project [project slug]
```
6. Provide a list of internal packages in `data/cwe-bench-java/package-names/[project slug].txt`. This should contain the package names of all internal packages of the project. E.g., `spark` for `perwendel/spark`. The following command uses CodeQL to extract the internal packages and writes them to the required txt file in the `package-names` directory. We provided a script to automate this. 
```bash
$ python scripts/get_packages_codeql.py [project slug]
```

## Supported CWEs
Here are the following CWEs supported, that you can specify as an argument to `--query` when using `src/neusym_vul.py` and `src/neusym_vul_for_query.py`. 

- `cwe-022wLLM` - [CWE-022](https://cwe.mitre.org/data/definitions/22.html) (Path Traversal)
- `cwe-078wLLM` - [CWE-078](https://cwe.mitre.org/data/definitions/78.html) (OS Command Injection)
- `cwe-079wLLM` - [CWE-079](https://cwe.mitre.org/data/definitions/79.html) (Cross-Site Scripting)
- `cwe-094wLLM` - [CWE-094](https://cwe.mitre.org/data/definitions/94.html) (Code Injection)

## Supported Models
We support the following models with our models API wrapper (found in `src/models`) in the project. Listed below are the arguments you can use for `--llm` when using `src/neusym_vul.py` and `src/neusym_vul_for_query.py`. You're free to use your own way of instantiating models or adding on to the existing library. Some of them require your own API key or license agreement on HuggingFace. 

<details>
  <summary>List of Models</summary>
  
### Codegen
- `codegen-16b-multi`
- `codegen25-7b-instruct`
- `codegen25-7b-multi`

### Codellama
#### Standard Models
- `codellama-70b-instruct`
- `codellama-34b`
- `codellama-34b-python`
- `codellama-34b-instruct`
- `codellama-13b-instruct`
- `codellama-7b-instruct`

### CodeT5p
- `codet5p-16b-instruct`
- `codet5p-16b`
- `codet5p-6b`
- `codet5p-2b`

### DeepSeek
- `deepseekcoder-33b`
- `deepseekcoder-7b`
- `deepseekcoder-v2-15b`

### Gemini
- `gemini-1.5-pro`
- `gemini-1.5-flash`
- `gemini-pro`
- `gemini-pro-vision`
- `gemini-1.0-pro-vision`

### Gemma
- `gemma-7b`
- `gemma-7b-it`
- `gemma-2b`
- `gemma-2b-it`
- `codegemma-7b-it`
- `gemma-2-27b`
- `gemma-2-9b`

### GPT
- `gpt-4`
- `gpt-3.5`
- `gpt-4-1106`
- `gpt-4-0613`

### LLaMA
#### LLaMA-2
- `llama-2-7b-chat`
- `llama-2-13b-chat`
- `llama-2-70b-chat`
- `llama-2-7b`
- `llama-2-13b`
- `llama-2-70b`

#### LLaMA-3
- `llama-3-8b`
- `llama-3.1-8b`
- `llama-3-70b`
- `llama-3.1-70b`
- `llama-3-70b-tai`

### Mistral
- `mistral-7b-instruct`
- `mixtral-8x7b-instruct`
- `mixtral-8x7b`
- `mixtral-8x22b`
- `mistral-codestral-22b`

### Qwen
- `qwen2.5-coder-7b`
- `qwen2.5-coder-1.5b`
- `qwen2.5-14b`
- `qwen2.5-32b`
- `qwen2.5-72b`

### StarCoder
- `starcoder`
- `starcoder2-15b`

### WizardLM
#### WizardCoder
- `wizardcoder-15b`
- `wizardcoder-34b-python`
- `wizardcoder-13b-python`

#### WizardLM Base
- `wizardlm-70b`
- `wizardlm-13b`
- `wizardlm-30b`

### Ollama

You need to install the `ollama` package manually.

- `qwen2.5-coder:latest`
- `qwen2.5:32b`
- `llama3.2:latest`
- `deepseek-r1:32b`
- `deepseek-r1:latest`

</details>

## [Adding CWEs](docs/adding_cwes.md)
We're always open to contributions of CWE we don't currently support. Refer to our [list of currently supported CWEs](#supported-cwes) to figure out what CWE can be added.  

## Contributing and Feedback
Feel free to address any open issues or add your own issue and fix. We love feedback! Please adhere to the following guidelines. 

1. Create a Github issue outlining the piece of work. Solicit feedback from anyone who has recently contributed to the component of the repository you plan to contribute to. 
2. Checkout a branch from `main` - preferably name your branch `[github username]/[brief description of contribution]`
3. Create a pull request that refers to the created github issue in the commit message.
4. To link to the github issue, in your commit for example you would simply add in the commit message:
[what the PR does briefly] #[commit issue]
5. Then when you push your commit and create your pull request, Github will automatically link the commit back to the issue. Add more details in the pull request, and request reviewers from anyone who has recently modified related code.
6. After 1 approval, merge your pull request.

## Citation 
Consider citing our paper:
```
@inproceedings{li2025iris,
title={LLM-Assisted Static Analysis for Detecting Security Vulnerabilities},
author={Ziyang Li and Saikat Dutta and Mayur Naik},
booktitle={International Conference on Learning Representations},
year={2025},
url={https://arxiv.org/abs/2405.17238}
}
```
[Arxiv Link](https://arxiv.org/abs/2405.17238)

## Team

IRIS is a collaborative effort between researchers at the University of Pennsylvania and Cornell University. Please reach out to us if you have questions about IRIS.

[Ziyang Li](https://liby99.github.io)

[Claire Wang](https://clairewang.net)

[Saikat Dutta](https://www.cs.cornell.edu/~saikatd)

[Mayur Naik](https://www.cis.upenn.edu/~mhnaik)

<img src="https://github.com/user-attachments/assets/37969a67-a3fd-4b4f-9be4-dfeed28d2b48" width="175" height="175" alt="Cornell University" />

<img src="https://github.com/user-attachments/assets/362abdfb-4ca4-46b2-b003-b185ce4d20af" width="300" height="200" alt="University of Pennsylvania"/>