[NEW DATA EXTENSION] Kotlin HTTP4k (core, format-jackson, format-moshi, format-gson, multipart) by Copilot · Pull Request #26 · testing-felickz/codeql-development-template

Copilot · 2026-03-26T21:11:55Z

📝 Data Extension Information

Language: java
Extension Name(s): http4k-core.model.yml, http4k-format-jackson.model.yml, http4k-format-moshi.model.yml, http4k-format-gson.model.yml, http4k-multipart.model.yml
Extension Types: sourceModel, sinkModel, summaryModel, neutralModel
Target Library/Framework: http4k
Library Modules Covered: org.http4k.core (http4k-core), org.http4k.format (http4k-format-jackson, http4k-format-moshi, http4k-format-gson), org.http4k.lens (http4k-multipart)

🎯 Description

What This Data Extension Models

Comprehensive models for the http4k Kotlin HTTP toolkit, with one model file per external API artifact:

http4k-core (http4k-core.model.yml):

Sources: Request.query(String), Request.queries(String), Request.header(String), Request.headerValues(String), Request.bodyString(), Request.getUri() — all as remote input sources
Sinks: Response.body(String) for html/js-injection, Response.header(String,String) for response-splitting (Argument[0..1]) and request-forgery (Argument[1])
Summaries: Immutable builder pattern taint propagation (Argument[this] → ReturnValue as value, argument values → ReturnValue as taint) for Request/Response method chains

http4k-format-jackson (http4k-format-jackson.model.yml):

Sinks: ConfigurableJackson.asA() for unsafe-deserialization (String and InputStream overloads)
Summaries: Taint propagation through deserialization (Argument[0] → ReturnValue) and JSON string parsing via asJsonObject()

http4k-format-moshi (http4k-format-moshi.model.yml):

Sinks: ConfigurableMoshi.asA() for unsafe-deserialization (String and InputStream overloads)
Summaries: Taint propagation through deserialization (Argument[0] → ReturnValue) and JSON string parsing via asJsonObject()

http4k-format-gson (http4k-format-gson.model.yml):

Sinks: ConfigurableGson.asA() for unsafe-deserialization (String and InputStream overloads)
Summaries: Taint propagation through deserialization (Argument[0] → ReturnValue) and JSON string parsing via asJsonObject()

http4k-multipart (http4k-multipart.model.yml):

Sources: MultipartFormBody.field/fields/fieldValue/fieldValues/file/files for form data, MultipartFormFile.getFilename()/getContent() for uploaded file metadata and content, MultipartFormField.getValue() for field values — all as remote sources
Summaries: MultipartFormBody.from() taint propagation from HttpMessage to parsed body

Threat Model

remote

Example Vulnerable Code

// XSS: unsanitized query param reflected in response body
fun handleGreet(request: Request): Response {
    val name = request.query("name") ?: "Guest"
    return Response(Status.OK).body("<h1>Hello, $name!</h1>")
}

// SQL Injection: query param flows into raw SQL
fun handleSearch(request: Request): Response {
    val query = request.query("q") ?: ""
    val sql = "SELECT * FROM products WHERE name LIKE '%$query%'"
    statement.executeQuery(sql)
    // ...
}

// Unsafe deserialization: untrusted JSON parsed via Jackson
fun handleJson(request: Request): Response {
    val obj = Jackson.asA<UserData>(request.bodyString())
    // ...
}

// Path traversal: multipart file upload with attacker-controlled filename
fun handleUpload(request: Request): Response {
    val form = MultipartFormBody.from(request)
    val file = form.file("upload")!!
    File("/uploads/${file.filename}").writeBytes(file.content.readBytes())
    // ...
}

Example Safe Code

// Parameterized query prevents SQL injection
fun handleSearch(request: Request): Response {
    val query = request.query("q") ?: ""
    val stmt = conn.prepareStatement("SELECT * FROM products WHERE name LIKE ?")
    stmt.setString(1, "%$query%")
    // ...
}

// Sanitized filename prevents path traversal
fun handleUpload(request: Request): Response {
    val form = MultipartFormBody.from(request)
    val file = form.file("upload")!!
    val safeName = File(file.filename).name // strip path components
    File("/uploads/$safeName").writeBytes(file.content.readBytes())
    // ...
}

📦 Extension Details

Extension YAML

http4k-core.model.yml:

extensions:
  - addsTo:
      pack: codeql/java-all
      extensible: sourceModel
    data:
      - ["org.http4k.core", "Request", True, "query", "(String)", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.core", "Request", True, "queries", "(String)", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.core", "Request", True, "header", "(String)", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.core", "Request", True, "headerValues", "(String)", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.core", "Request", True, "bodyString", "()", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.core", "Request", True, "getUri", "()", "", "ReturnValue", "remote", "manual"]
  - addsTo:
      pack: codeql/java-all
      extensible: sinkModel
    data:
      - ["org.http4k.core", "Response", True, "body", "(String)", "", "Argument[0]", "html-injection", "manual"]
      - ["org.http4k.core", "Response", True, "body", "(String)", "", "Argument[0]", "js-injection", "manual"]
      - ["org.http4k.core", "Response", True, "header", "(String,String)", "", "Argument[0..1]", "response-splitting", "manual"]
      - ["org.http4k.core", "Response", True, "header", "(String,String)", "", "Argument[1]", "request-forgery", "manual"]

http4k-format-jackson.model.yml:

extensions:
  - addsTo:
      pack: codeql/java-all
      extensible: sinkModel
    data:
      - ["org.http4k.format", "ConfigurableJackson", True, "asA", "(String,Class)", "", "Argument[0]", "unsafe-deserialization", "manual"]
      - ["org.http4k.format", "ConfigurableJackson", True, "asA", "(InputStream,Class)", "", "Argument[0]", "unsafe-deserialization", "manual"]

http4k-format-moshi.model.yml:

extensions:
  - addsTo:
      pack: codeql/java-all
      extensible: sinkModel
    data:
      - ["org.http4k.format", "ConfigurableMoshi", True, "asA", "(String,Class)", "", "Argument[0]", "unsafe-deserialization", "manual"]
      - ["org.http4k.format", "ConfigurableMoshi", True, "asA", "(InputStream,Class)", "", "Argument[0]", "unsafe-deserialization", "manual"]

http4k-format-gson.model.yml:

extensions:
  - addsTo:
      pack: codeql/java-all
      extensible: sinkModel
    data:
      - ["org.http4k.format", "ConfigurableGson", True, "asA", "(String,Class)", "", "Argument[0]", "unsafe-deserialization", "manual"]
      - ["org.http4k.format", "ConfigurableGson", True, "asA", "(InputStream,Class)", "", "Argument[0]", "unsafe-deserialization", "manual"]

http4k-multipart.model.yml:

extensions:
  - addsTo:
      pack: codeql/java-all
      extensible: sourceModel
    data:
      - ["org.http4k.core", "MultipartFormBody", True, "file", "(String)", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.core", "MultipartFormBody", True, "fieldValue", "(String)", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.lens", "MultipartFormFile", True, "getFilename", "()", "", "ReturnValue", "remote", "manual"]
      - ["org.http4k.lens", "MultipartFormFile", True, "getContent", "()", "", "ReturnValue", "remote", "manual"]

Access Path Explanation

Model	Access Path	Rationale
`Request.query` → `ReturnValue`	Query param string returned to caller	Standard remote source pattern
`Request.header` → `ReturnValue`	HTTP header value returned to caller	Headers can be attacker-controlled
`Request.bodyString` → `ReturnValue`	Request body as string	Full request body is user-controlled
`Request.getUri` → `ReturnValue`	Full URI object	URI path/query is user-controlled
`Response.header` → `Argument[0..1]`	Both header name and value	Newline injection in either enables response splitting
`Response.header` → `Argument[1]`	Header value for request-forgery	e.g., `Location` header for SSRF/redirect
`Response.body` → `Argument[0]`	String written to HTTP response body	XSS vector
`ConfigurableJackson.asA` → `Argument[0]`	Untrusted input deserialized	Unsafe deserialization (CWE-502)
`MultipartFormBody.file` → `ReturnValue`	Uploaded file from multipart form	File uploads are classic attack vector
`MultipartFormFile.getFilename` → `ReturnValue`	Attacker-controlled filename	Path traversal risk
`MultipartFormFile.getContent` → `ReturnValue`	Attacker-controlled file content	Malicious file content

Note: The issue originally specified Argument[0] for header, but this was corrected to Argument[1] since the header value (second parameter) is what's exploitable for request-forgery. Response-splitting uses Argument[0..1] since newline injection in either name or value is dangerous.

🧪 Testing

Extension YAML resolves without errors
Database created with sample code (codeql database create or codeql test extract)
Single query verified with extension applied (codeql query run --additional-packs=<model-pack-dir>)
Unit tests pass with extension applied (codeql test run --additional-packs=<model-pack-dir>)
Positive test cases (vulnerable code detected)
Negative test cases (safe code not flagged)

📋 Checklist

Extension YAML is valid and properly formatted
Extension placed in correct location (languages/java/custom/src/)
qlpack.yml includes dataExtensions configuration
Documentation/comments included in YAML
One model file per external API artifact
Access paths verified via API graph queries
No false positives in test cases

🔗 References

https://www.http4k.org/
http4k core reference
http4k JSON format reference
http4k multipart forms guide
CWE-79 (XSS), CWE-89 (SQL Injection), CWE-502 (Deserialization), CWE-918 (SSRF), CWE-113 (HTTP Response Splitting), CWE-22 (Path Traversal)

Note: This data extension was developed following CodeQL Models as Data best practices.

⌨️ Start Copilot coding agent tasks without leaving your editor — available in VS Code, Visual Studio, JetBrains IDEs and Eclipse.

Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/c2b323fa-771e-4a4d-8876-fa6a463ca2bc Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

…[0] (header name) Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/c2b323fa-771e-4a4d-8876-fa6a463ca2bc Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

felickz · 2026-03-30T13:35:38Z

@copilot evaluate the docs for http4k core and model any additional remote/local sources and sinks for this library https://www.http4k.org/api/http4k-core/index.html

Also add models for http4k-format-jackson / http4k-format-moshi / http4k-format-gson for Deserialization sinks/sources. Untrusted JSON/XML is parsed

http4k-multipart for File uploads — a classic source of malicious input (path traversal, file type abuse, size bombs).

…, and multipart sources Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/08129f00-891a-47a8-b941-04ebad32a61f Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

…rgument[0..1]) Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/08129f00-891a-47a8-b941-04ebad32a61f Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

Copilot · 2026-03-30T14:12:16Z

@copilot evaluate the docs for http4k core and model any additional remote/local sources and sinks for this library https://www.http4k.org/api/http4k-core/index.html

Also add models...

Expanded the data extensions across three model files in commits c7b9b78 and b36259f:

http4k-core (http4k.model.yml):

Added remote sources: Request.header(String), Request.headerValues(String), Request.bodyString(), Request.queries(String), Request.getUri()
Added response-splitting sink on Response.header() covering both Argument[0..1]
Added summary models for immutable builder pattern taint propagation through method chains

http4k-format (http4k-format.model.yml):

ConfigurableJackson, ConfigurableMoshi, ConfigurableGson .asA() methods modeled as unsafe-deserialization sinks
Taint summary models for asA() and asJsonObject() so tainted JSON input propagates to deserialized objects

http4k-multipart (http4k-multipart.model.yml):

MultipartFormBody.field/fields/fieldValue/fieldValues/file/files as remote sources
MultipartFormFile.getFilename() and .getContent() as remote sources (path traversal, malicious content)
MultipartFormField.getValue() as remote source
MultipartFormBody.from() summary for taint propagation from HttpMessage

felickz · 2026-03-31T20:51:44Z

@copilot

The models should be stored in a series of YAML data extension files, one for each external API. For example:

    jmh-core.model.yml                  # models calls to jmh-core@1.20
    rocksdbjni.model.yml                # models calls to rocksdbjni@7.7.3

…-jackson, http4k-format-moshi, http4k-format-gson, http4k-multipart Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/845edef4-ae1e-46d7-a82b-6fbcfeb02871 Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

Copilot · 2026-03-31T20:58:34Z

@copilot

The models should be stored in a series of YAML data extension files, one for each external API. For example:
...

Reorganized in commit 3f4b098. The model files are now split one per external API artifact:

http4k-core.model.yml — core Request/Response models
http4k-format-jackson.model.yml — Jackson deserialization models
http4k-format-moshi.model.yml — Moshi deserialization models
http4k-format-gson.model.yml — Gson deserialization models
http4k-multipart.model.yml — multipart form/file upload models

Initial plan

1182930

Copilot AI assigned Copilot and felickz Mar 26, 2026

Copilot started work on behalf of felickz March 26, 2026 21:12 View session

Copilot AI linked an issue Mar 26, 2026 that may be closed by this pull request

[Data Extension Create]: Kotlin HTTP4k #25

Open

1 task

Copilot AI and others added 2 commits March 26, 2026 21:21

Add HTTP4k data extension with source and sink models

10ff588

Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/c2b323fa-771e-4a4d-8876-fa6a463ca2bc Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

Fix header sink to use Argument[1] (header value) instead of Argument…

e012e18

…[0] (header name) Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/c2b323fa-771e-4a4d-8876-fa6a463ca2bc Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add data extension for http4k in Java~~ [NEW DATA EXTENSION] Kotlin HTTP4k Mar 26, 2026

Copilot AI requested a review from felickz March 26, 2026 21:24

Copilot finished work on behalf of felickz March 26, 2026 21:24

Copilot started work on behalf of felickz March 30, 2026 13:35 View session

Copilot AI and others added 2 commits March 30, 2026 14:09

Extend response-splitting sink to cover both header name and value (A…

b36259f

…rgument[0..1]) Agent-Logs-Url: https://github.com/testing-felickz/codeql-development-template/sessions/08129f00-891a-47a8-b941-04ebad32a61f Co-authored-by: felickz <1760475+felickz@users.noreply.github.com>

Copilot AI changed the title ~~[NEW DATA EXTENSION] Kotlin HTTP4k~~ [NEW DATA EXTENSION] Kotlin HTTP4k (core, format, multipart) Mar 30, 2026

Copilot finished work on behalf of felickz March 30, 2026 14:17

Copilot started work on behalf of felickz March 31, 2026 20:51 View session

Copilot AI changed the title ~~[NEW DATA EXTENSION] Kotlin HTTP4k (core, format, multipart)~~ [NEW DATA EXTENSION] Kotlin HTTP4k (core, format-jackson, format-moshi, format-gson, multipart) Mar 31, 2026

Copilot finished work on behalf of felickz March 31, 2026 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NEW DATA EXTENSION] Kotlin HTTP4k (core, format-jackson, format-moshi, format-gson, multipart)#26

[NEW DATA EXTENSION] Kotlin HTTP4k (core, format-jackson, format-moshi, format-gson, multipart)#26
Copilot wants to merge 6 commits intomainfrom
copilot/add-http4k-data-extension

Copilot AI commented Mar 26, 2026 •

edited

Loading

Uh oh!

felickz commented Mar 30, 2026

Uh oh!

Copilot AI commented Mar 30, 2026

Uh oh!

felickz commented Mar 31, 2026

Uh oh!

Copilot AI commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Data Extension Information

🎯 Description

What This Data Extension Models

Threat Model

Example Vulnerable Code

Example Safe Code

📦 Extension Details

Extension YAML

Access Path Explanation

🧪 Testing

📋 Checklist

🔗 References

Uh oh!

felickz commented Mar 30, 2026

Uh oh!

Copilot AI commented Mar 30, 2026

Uh oh!

felickz commented Mar 31, 2026

Uh oh!

Copilot AI commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Mar 26, 2026 •

edited

Loading