Added ADR for architectural standards by mrrajan · Pull Request #2358 · guacsec/trustify

mrrajan · 2026-05-14T12:21:25Z

Summary

Add ADR-00018 proposing a CONVENTIONS.md file and documenting pattern standards for the Trustify project
Catalogs recurring anti-patterns found across the codebase (N+1 queries, unbounded collections, sequential independent queries, missing database indexes, uncontrolled pagination, redundant deserialization) with specific file locations and severity ratings
Presents convention options for each anti-pattern category for the team to evaluate during PR review

Rendered version https://github.com/mrrajan/trustify/blob/TC-4289/docs/adrs/00018-conventions-file.md

Summary by Sourcery

Introduce an ADR proposing a centralized CONVENTIONS.md file and cataloging key performance and coding anti‑patterns in the Trustify codebase, along with candidate standards for addressing them.

Documentation:

Add ADR-00017 documenting the rationale, scope, and maintenance process for a repository-wide CONVENTIONS.md conventions file used by contributors and AI tools.
Document a detailed analysis of recurring performance and coding anti-patterns across the codebase, including their locations and proposed convention options for remediation.

sourcery-ai · 2026-05-14T12:21:51Z

Reviewer's Guide

Adds ADR-00017 describing the introduction, purpose, and lifecycle of a repository-level CONVENTIONS.md file and documenting a detailed catalog of existing performance and coding anti-patterns in the Trustify codebase, along with convention options the team should select during review.

File-Level Changes

Change	Details	Files
Introduce ADR-00017 defining a repository-level CONVENTIONS.md file as the canonical source of coding and architectural conventions, optimized for AI-assisted development workflows.	Specify scope, content structure, and maintenance process for CONVENTIONS.md at the repo root Clarify that conventions are prescriptive, derived from existing patterns, and kept minimal and actionable Describe how AI tools (e.g., Claude Code) will consume CONVENTIONS.md and how it relates to any CLAUDE.md configuration Define governance for updating conventions via PRs and linking significant changes to ADRs	`docs/adrs/00017-conventions-file.md`
Document a comprehensive analysis of recurring performance anti-patterns in the existing codebase, with concrete examples and convention options to adopt.	Catalog N+1 queries, unbounded queries, in-memory filtering, app-side counting, sequential DB calls, missing bulk operations, unbounded recursion, extra round-trips vs JOINs, validation-plus-follow-on queries, and missing indexes For each category, explain what it is, why it matters, where it occurs (file/line-level references), and propose option sets (A/B/C) for conventions Summarize performance anti-patterns in a table to guide reviewers in selecting preferred options that will later be codified in CONVENTIONS.md	`docs/adrs/00017-conventions-file.md`
Document a comprehensive analysis of recurring coding anti-patterns in the existing codebase, with concrete examples and convention options to adopt.	Catalog swallowed errors, stringly-typed APIs, code duplication, tight module coupling, oversized functions, inconsistent tracing, missing public docs, mixed logging frameworks, magic numbers, and raw SQL patterns that defeat parameterization For each category, provide rationale, specific occurrences, and option sets (A/B/C) indicating recommended conventions versus laxer alternatives Summarize coding anti-patterns in a final table and describe how selected options should be propagated into CONVENTIONS.md in a follow-up change	`docs/adrs/00017-conventions-file.md`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey - I've found 1 issue, and left some high level feedback:

The ADR is very long and mixes the high-level decision (introducing CONVENTIONS.md) with a detailed audit of anti-patterns; consider splitting the anti-pattern catalog into a separate document (or appendix) to keep the ADR focused on the architectural decision and process.
Many examples reference approximate line numbers and specific file locations that will quickly become stale; instead of line-based pointers, consider linking to symbols or providing higher-level file/section references that will age more gracefully.
You define option sets (A/B/C) for many categories but the process for capturing the final choice in CONVENTIONS.md is only described informally; consider adding a concise, explicit section on how and where the selected options are recorded and how future ADRs should evolve or override them.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The ADR is very long and mixes the high-level decision (introducing CONVENTIONS.md) with a detailed audit of anti-patterns; consider splitting the anti-pattern catalog into a separate document (or appendix) to keep the ADR focused on the architectural decision and process.
- Many examples reference approximate line numbers and specific file locations that will quickly become stale; instead of line-based pointers, consider linking to symbols or providing higher-level file/section references that will age more gracefully.
- You define option sets (A/B/C) for many categories but the process for capturing the final choice in CONVENTIONS.md is only described informally; consider adding a concise, explicit section on how and where the selected options are recorded and how future ADRs should evolve or override them.

## Individual Comments

### Comment 1
<location path="docs/adrs/00017-conventions-file.md" line_range="107" />
<code_context>
+
+## Anti-Pattern Analysis
+
+Analysis of the trustify codebase has identified recurring anti-patterns across
+multiple modules. This section catalogs each category, lists the occurrences found, and
+presents convention options for the team to evaluate. The selected conventions will be
</code_context>
<issue_to_address>
**nitpick (typo):** Capitalize the project name "Trustify" for consistency.

This line uses lowercase "trustify" while earlier references use "Trustify"; update this occurrence to match the proper noun capitalization.

```suggestion
Analysis of the Trustify codebase has identified recurring anti-patterns across
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

PhilipCattanach · 2026-05-14T13:30:19Z

Thanks @mrrajan Very interesting reading
I would suggest choosing the appropriate convention for some of the performance issue will probably need tests at scale. I'm concerned that some of the potential solutions might make things worse at scale.

I'd like us to come up with a way of prioritizing our next steps. As I would like to think there is some non contentious conventions we can adopt that will address some of these things.

And we might need some Spikes to evaluate the best convention for some of the issues highlighted.

mrizzi

A first review of the methodology before getting into the anti-patterns analysis with the maintainers.

mrizzi · 2026-05-14T16:02:19Z

+### Content Principles
+
+1. **Prescriptive, not descriptive**: each convention states what to do and what to avoid, with concrete code examples
+2. **Derived from existing code**: conventions are extracted from established patterns in the codebase, not invented


Maybe the initial version but not once stable because, once fully adopted, it should be other way around, i.e. code should be derived from existing conventions

Question is, how do we deal with current code that doesn't align to those newly defined patterns?

Good question 👍

Sending random and sometimes partial PRs without JIRAs?

side note: Yesterday I asked AI to read the instrumentation section and to produce a checklist for me, not changed the code yet as some parts are subjective (example -> the size of returned struct data) + still need to validate if all the checklist is 100% correct. Imagining all this checklist is correct, all the instrumentation-related-code from all files should be updated via a single PR? Should this initiative be separated by our current workspace-crates to avoid git conflicts and long PR reviews?

Question is, how do we deal with current code that doesn't align to those newly defined patterns?

It will mean having tech debt initiatives to enforce consistency in current code not yet aligned with expected patterns and code conventions.

mrizzi · 2026-05-14T16:14:54Z

+
+### Trade-offs
+
+- **Maintenance burden**: the file must be kept in sync with evolving practices — stale conventions are worse than no conventions


I don't think this exists because, as described in the above Maintenance Process, when a convention has to change, then an ADR arrives. Once merged, the impacted code has to refactored to be aligned with the new code convention. I don't see how "stale conventions" could happen other than implementing code not aligned with the conventions, which should be considered a mistake to prevent.

To my understanding, feedback from reviews might turn in PR changing the conventions file? If that's the case, then this scenario may happen. Unless of course, a change in the conventions file is accompanied by reworking the code base.

Changes from PRs are meant to be additive to improve code conventions for identified lacks. Changing them to "adhere" to the code in a PR should be blocked otherwise code will drift from conventions.

mrizzi · 2026-05-14T16:27:03Z

+- **Occurrences found** — specific locations in the codebase
+- **Convention options** — choices for the team to decide on (to be resolved via PR review)
+
+> **How to use this section**: During PR review, maintainers should mark their preferred


@jcrossley3 @ctron @ruromero @queria please review this

ctron · 2026-05-15T06:27:25Z

I'm not sure an ADR is the right approach to this. We already have the conventions file, and this ADR would need to be copied over to that file in a second step.

Why no simply make the modifications to the conventions file and discuss this on a PR? Because what would we review on that second PR (moving content over to the conventions file)? Feels like doppelmoppel.

ctron · 2026-05-15T06:30:03Z

+
+## Context
+
+Trustify uses AI-assisted development workflows (Claude Code, Copilot, and similar tools). These tools perform best when they have access to explicit, machine-readable project conventions — coding patterns, naming rules, error-handling idioms, testing practices, and architectural norms.


As we do have some documentation (for less then we should have) for humans. I believe that this file should be usable by humans as well. Replacing the existing documents. Which means, that this file should be structured and written to serve both.

Agree! Rephrased to,

Trustify needs a single, explicit reference for coding patterns, naming rules, error-handling idioms, testing practices, and architectural norms. Contributors and reviewers use it during implementation and review; the same clarity also helps AI-assisted workflows (Claude Code, Copilot, and similar tools) when those tools load project context.

ctron · 2026-05-15T06:39:40Z

+| **Entity Model Patterns** | ORM model conventions | `DeriveEntityModel`, relations, `Linked` structs |
+| **Migration Patterns** | Database migration conventions | Idempotency guards, naming, raw SQL loading |
+| **Rust Idioms** | Preferred Rust patterns | Type inference, iterator ownership, `.zip()`, capacity |
+| **SeaORM Query Patterns** | ORM query conventions | `.is_in()`, chunking, `IntoIterator` parameters |


IntoIterator wouldn't be a SeaORM thing, but a Rust thing.

Agree - Removed

ctron · 2026-05-15T06:41:03Z

+| **Rust Idioms** | Preferred Rust patterns | Type inference, iterator ownership, `.zip()`, capacity |
+| **SeaORM Query Patterns** | ORM query conventions | `.is_in()`, chunking, `IntoIterator` parameters |
+| **Observability** | Tracing and instrumentation | `#[instrument]` usage, span conventions, error levels |
+| **Shared Table Patterns** | Concurrent insert handling | Nested transaction duplicate-key and `.on_conflict` patterns |


Not sure I understand the scope of this section. I'd prefer to have some "database" section maybe. Defining how database resources/entities are created, column types, tables, enums, indexes, ...

Agree! We should have a dedicated database section.

ctron · 2026-05-15T06:58:16Z

+1. **Prescriptive, not descriptive**: each convention states what to do and what to avoid, with concrete code examples
+2. **Derived from existing code**: conventions are extracted from established patterns in the codebase, not invented
+3. **Minimal and actionable**: each entry should be short enough that a contributor (or AI tool) can apply it without reading surrounding prose
+4. **Reference implementations**: point to specific files in the codebase as canonical examples


I think that may be tricky. As those code locations will change over time. And that would mean updating the conventions file, or dead links.

Agree! We should have just the examples not the references to the specific files. Rephrased to share canonical examples on the conventions file.

ctron · 2026-05-15T07:05:28Z

+- Claude Code automatically loads `CONVENTIONS.md` from the repository root as part of its project context
+- The file uses markdown with code blocks, making it parseable by any LLM
+- Conventions are structured as clear rules with examples, optimizing for AI instruction-following
+- When a `CLAUDE.md` file is present (for tool-specific configuration), `CONVENTIONS.md` complements it — `CONVENTIONS.md` focuses on language and framework patterns that apply regardless of the tool


I wonder how that aligns with review tools? like sourcery.ai?

Sourcery.ai applies generic best practices and CONVENTIONS.md captures project specific ones. So, project specific practices might called out by the review tools. We have to explore how to align review tools to use CONVENTIONS.md file as reference.

ctron · 2026-05-15T07:10:06Z

+## Anti-Pattern Analysis
+
+Analysis of the Trustify codebase has identified recurring anti-patterns across
+multiple modules. This section catalogs each category, lists the occurrences found, and


I see the same problem as before. If we "point" to those locations, and those locations will be gone over time (which should be the case with anti-patterns) we have a document of dead links.

Agree! The references should use function/ symbol names or example snippets in the CONVENTIONS.md file

ctron · 2026-05-15T07:11:05Z

+
+Each category below follows this structure:
+- **What it is** — description of the anti-pattern
+- **Why it matters** — performance impact


I assume "performance impact" is an example. But in general, "why" can be all kind of reasons.

Agree! rephrased to performance impact, maintainability, consistency

ctron · 2026-05-15T07:11:20Z

+Each category below follows this structure:
+- **What it is** — description of the anti-pattern
+- **Why it matters** — performance impact
+- **Occurrences found** — specific locations in the codebase


See my comment about links above.

Agree! This should be references not specific locations in codebase.

ctron · 2026-05-15T07:13:40Z

+**What it is**: Database queries executed inside loops — fetching related data one entity
+at a time instead of loading all related data in a single batch query.
+
+**Why it matters**: For a list of N entities, this generates N additional queries instead of


On the "pro" side, it may reduce code complexity. In some cases dramatically. And when it is known that "N" will always be a small number, and the operation itself is fast. Then keeping the N+1 pattern would be a good thing IMHO. It depends on the case.

ctron · 2026-05-15T07:14:59Z

+1 batch query. On API read paths serving collections, this scales linearly with result size
+and dominates response latency.
+
+**Occurrences found (5)**:


I think there are a lot more in, especially in the "analysis" module.

ctron · 2026-05-15T07:15:19Z

+
+**Occurrences found (5)**:
+
+| # | File | Line(s) | Description | Severity |


How is "severity" evaluated here?

Severity is based on,

Whether the N is bounded or not

Code path is API or internal

ctron · 2026-05-15T07:18:16Z

+
+**Convention options**:
+
+- **Option A — Batch with JOINs/IN clauses**: All collection data access MUST use batch


Maybe we should consider this: https://pganalyze.com/blog/5mins-postgres-performance-in-vs-any

ctron · 2026-05-15T07:21:14Z

+- **Option C — Keep as-is**: No convention. Performance is addressed case-by-case when
+  bottlenecks are observed. N+1 patterns are acceptable if the loop iteration count is
+  expected to be small.
+


I would say "Option D": prefer option A, but fall back to "N+1" if there are good reasons.

"C" doesn't seem to cover this, as it says "no convention". I think the should try, and try hard to remediate N+1 patterns. But not at all cost.

Sounds good! Added option D

- **Option D — Prefer batch, allow exceptions**: Default to Option A — batch loading on collection paths and remediate existing N+1 patterns as tech debt. Per-entity DB calls inside loops are permitted only when batching is impractical or costly.

ctron · 2026-05-15T07:23:26Z

+
+| # | File | Line(s) | Description |
+|---|------|---------|-------------|
+| 1 | `modules/importer/src/service.rs` | ~133 | `importer::Entity::find().all()` fetches all importers, sorts in memory |


Which I don't see as a problem, as the number of importers is considered small (below 50). Sure it may be an improvement. But in some cases it's not. Having pagination on the HTTP API would be beneficial.

ctron · 2026-05-15T07:23:53Z

+**Why it matters**: As data grows, unbounded queries consume increasing memory and network
+bandwidth. A single unbounded query on a large table can cause OOM or timeout.
+
+**Occurrences found (3)**:


Again, there is more. Especially in the analysis module.

ctron · 2026-05-15T07:26:48Z

+
+### AP-2: Unbounded Queries
+
+**What it is**: Queries that fetch all rows from a table or relation without applying


In some cases we load all data, and then keep in memory to process it. Especially as mentioned above to do things like batch loading, pre-loading, ...

In such cases it doesn't seem to make sense to use limits, as we load and store the data in-memory anyway. In fact, it would be counter productive (even batch loading). Batch loading only makes sense to limit the number of postgres parameters. Having to page through data would mean to execute the query multiple times. Compared to one single query.

Agree! lets limit only for the public API endpoints

ctron · 2026-05-15T07:28:02Z

+  must use `.limit()` or chunked iteration. Exceptions only for tables with a known small
+  upper bound (e.g., importers, which are admin-configured).
+
+- **Option B — Limit only on public API endpoints**: Public API list endpoints must be


Option B 👍

ctron · 2026-05-15T07:31:35Z

+
+| # | File | Line(s) | Description |
+|---|------|---------|-------------|
+| 1 | `modules/fundamental/src/vulnerability/model/details/mod.rs` | ~54-68 | All scores fetched via `.all(tx)`, then filtered with `.filter(\|s\| s.advisory_id == ...)` |


I'd say that's a false positive. As the code does re-use that data. But eliminates a second query by re-using that data and manually filtering out information for the first use case of the data.

Agree! It is a false positive. I will remove this example section - But as a convention for this issue, shall we suggest option B?

ctron · 2026-05-15T09:12:29Z

+
+| # | File | Line(s) | Pattern | Severity |
+|---|------|---------|---------|----------|
+| 1 | `modules/importer/src/model/mod.rs` | ~331 | `serde_json::from_value(report).ok()` — importer run report deserialization silently dropped | High |


In this case, this provides additional diagnostic information, if possible. If the serialization of that fails, that would just not be possible. Logging would be fine, but throwing an error not.

Agree! It is additional noise

ctron · 2026-05-15T09:23:29Z

+| # | File | Line(s) | Pattern | Severity |
+|---|------|---------|---------|----------|
+| 1 | `modules/importer/src/model/mod.rs` | ~331 | `serde_json::from_value(report).ok()` — importer run report deserialization silently dropped | High |
+| 2 | `modules/importer/src/server/progress.rs` | ~48, 77 | `let _ = self.service.set_progress_message(...)` — DB write failures for progress silently discarded | High |


In this case it would actually be beneficial to add this to the instrumentation. Rather than an explicit log call.

ctron · 2026-05-15T09:25:17Z

+| 1 | `modules/importer/src/model/mod.rs` | ~331 | `serde_json::from_value(report).ok()` — importer run report deserialization silently dropped | High |
+| 2 | `modules/importer/src/server/progress.rs` | ~48, 77 | `let _ = self.service.set_progress_message(...)` — DB write failures for progress silently discarded | High |
+| 3 | `modules/importer/src/server/mod.rs` | ~186 | `serde_json::to_value(report).ok()` — importer run report serialization silently dropped on `update_finish`; report data lost from database | High |
+| 4 | `modules/importer/src/model/mod.rs` | ~65 | `OffsetDateTime::from_unix_timestamp_nanos(t).ok()` — heartbeat timestamp errors silently dropped | Medium |


Here it might actually be beneficial to remove the root cause of this. Using a proper timestamp in the database, if possible.

Agree! we should fix the root cause instead!

ctron · 2026-05-15T09:28:03Z

+| 2 | `modules/importer/src/server/progress.rs` | ~48, 77 | `let _ = self.service.set_progress_message(...)` — DB write failures for progress silently discarded | High |
+| 3 | `modules/importer/src/server/mod.rs` | ~186 | `serde_json::to_value(report).ok()` — importer run report serialization silently dropped on `update_finish`; report data lost from database | High |
+| 4 | `modules/importer/src/model/mod.rs` | ~65 | `OffsetDateTime::from_unix_timestamp_nanos(t).ok()` — heartbeat timestamp errors silently dropped | Medium |
+| 5 | `modules/ingestor/src/service/advisory/csaf/loader.rs` | ~44, 48 | `published`/`modified` timestamp conversions silently become `None` | Medium |


In this case it would be beneficial to report this back to the user, as we do have a mechanism for reporting such things as "warning" during the upload. This would also end up in the report. Logging this, or failing, wouldn't help.

Same for the other CSAF related ones.

Sounds good!

ctron · 2026-05-15T09:32:53Z

+
+- **Option C — Keep as-is**: No convention. Developers judge whether an error is worth
+  logging. `.ok()` is acceptable for non-critical conversions.
+


Option D:

if we have instrumentation wrapping it, that's ok, no need for extra processing

if we have a way to report this case back to the user, leverage this, but don't extra log it

if that case prevents us from processing the data, and the data is essential, propagate the error

if running into an error is an acceptable and expected case, that's fine, do nothing

if we can ignore the error, but have no other way to handle it, log it, but try to be compact doing it, e.g. like .inspect_err() rather than a full blown match.

ctron · 2026-05-15T09:37:12Z

+
+**Convention options**:
+
+- **Option A — Enums for fixed value sets**: Any value drawn from a fixed set (statuses,


ctron · 2026-05-15T09:37:58Z

+
+**Convention options**:
+
+- **Option A — Extract shared logic into generics or traits**: When two or more modules


ctron · 2026-05-15T09:38:49Z

+  composition. However, the `fundamental` layer MUST NOT depend on `ingestor` — shared
+  types like `Deprecation` must be moved to `common/`.
+
+- **Option C — Keep as-is**: No convention. Cross-module imports are acceptable within


ctron · 2026-05-15T09:40:02Z

+  phase into a private helper method with a descriptive name. The parent function becomes
+  a coordinator that calls the helpers in sequence.
+
+- **Option B — Maximum cyclomatic complexity**: No hard line limit, but functions with


Option B. And I'd say it is ok to use inner strucs, fns, ... for doing this.

ctron · 2026-05-15T09:43:45Z

+
+- **Option C — Keep as-is**: No convention. Instrumentation is added when needed for
+  debugging. Partial coverage is acceptable.
+


Option D: Option A, but with a twist. I think this is already well described in the tracing/logging markdown file. In a nutshell, everything that is calling out to external things (DB, storage, ... ) should be wrapped in instrumentation. Everything that does not, as long as it is considered trivial, must not be wrapped. Even if things cannot be wrapped (multiple sections of loading where loading functions don't have instrumentation) use .instrument or other constructs amending this. But don't do it twice (inner vs outer). Not limited to pub, but based on logic blocks and the fact that external factors are in play.

ctron · 2026-05-15T09:44:36Z

+
+**Convention options**:
+
+- **Option A — Document all public items**: Every public struct, enum, trait, function,


Option A.

In addition, also add non-verbose comments inside functions.

ctron · 2026-05-15T09:45:11Z

+and `tracing::instrument`/`tracing::Instrument` attributes, creating inconsistent
+observability output.
+
+**Why it matters**: `tracing` and `log` have different span-awareness behavior. `log::*`


Not sure this is true, as log actually forwards to tracing.

Referring to https://docs.rs/tracing-log/latest/tracing_log/, we need logTracer::init() to have logs and tracing connected. But in our project only the test file, test-context/src/flame.rs has this configuration for setup_global_subscriber.

we are using a tracing-log feature here: https://github.com/guacsec/trustify/blob/main/common/infrastructure/Cargo.toml#L33

log actually forwards to tracing

this explains why even using log::info here https://gist.github.com/helio-frota/171ac4ac8d362827e22b66ce165f32cf?permalink_comment_id=6121576#gistcomment-6121576, I was able to see the correlation with traces using opentelemetry-appender-tracing instead of opentelemetry-appender-log 👍

Thanks for the clarification @helio-frota! updated this section.

ctron · 2026-05-15T09:45:37Z

+
+**Convention options**:
+
+- **Option A — Use `tracing::` exclusively**: All logging MUST use `tracing::` macros


Option A sounds reasonable.

ctron · 2026-05-15T09:51:39Z

+
+- **Option C — Keep as-is**: No convention. Inline values are acceptable with explanatory
+  comments.
+


Option D: It depends. In general we should have Option A. However, there are cases where it causes confusion or has other side effects.

Assume the example from above. Having 13 replaced by a const value, would require to render the string every time used. Rather than having a const struct. This seems counter productive. It would be better to keep that 13 in the string, but document around it "why" there's a 13 in there.

Same for structs:

ImporterConfiguration::Cve(CveImporter { common: CommonImporter { disabled: true, period: Duration::from_secs(300), description: Some(description.into()), labels: Default::default(), }, source: DEFAULT_SOURCE_CVEPROJECT.into(), years: HashSet::default(), start_year, }),

Having DEFAULT_SOURCE_CVEPROJECT here, imported from another module. Only being used once. Doesn't make much sense to me. It would be clearer if that would be in the same file. Unless, it would indeed be re-used. Then it could be refactored.

ctron · 2026-05-15T09:58:40Z

+
+| # | File | Line(s) | Description |
+|---|------|---------|-------------|
+| 1 | `modules/fundamental/src/vulnerability/service/mod.rs` | ~455-657 | `build_vulnerabilities_query_string` (~455-521) uses `format!()` to interpolate SQL fragments; `build_query` (~523-657) constructs sub-queries with `Statement::from_sql_and_values`, then `.to_string()`s them and concatenates with `UNION ALL` into a `Statement::from_string` |


This example might not be ideal. But it seems like a decent trade off handling the complexity. Having that replaced by something else, the question would be if the alternative is actually better,

There is no SQL injection in play here, as that's gets sorted out before.

Sounds good!

ctron · 2026-05-15T09:59:39Z

+  SeaORM's `UnionType` builder or a single parameterized CTE. `Statement::from_string`
+  is only acceptable for static SQL with no dynamic values.
+
+- **Option B — Allow with review gate**: `Statement::from_string` with dynamic content


Option B: There should be a good reason to not use Option A, and a discussion around that. But to me the example shows a valid reason to deviate from the norm.

Sounds good!

mrrajan

Thanks @ctron for your review, I have added a new section for database and addressed the review comments. I have added a new section Preferred option: under each category based on the review.

mrrajan · 2026-05-18T06:40:18Z

+| **Entity Model Patterns** | ORM model conventions | `DeriveEntityModel`, relations, `Linked` structs |
+| **Migration Patterns** | Database migration conventions | Idempotency guards, naming, raw SQL loading |
+| **Rust Idioms** | Preferred Rust patterns | Type inference, iterator ownership, `.zip()`, capacity |
+| **SeaORM Query Patterns** | ORM query conventions | `.is_in()`, chunking, `IntoIterator` parameters |


Agree - Removed

mrrajan · 2026-05-18T06:41:14Z

+| **Rust Idioms** | Preferred Rust patterns | Type inference, iterator ownership, `.zip()`, capacity |
+| **SeaORM Query Patterns** | ORM query conventions | `.is_in()`, chunking, `IntoIterator` parameters |
+| **Observability** | Tracing and instrumentation | `#[instrument]` usage, span conventions, error levels |
+| **Shared Table Patterns** | Concurrent insert handling | Nested transaction duplicate-key and `.on_conflict` patterns |


Agree! We should have a dedicated database section.

mrrajan · 2026-05-18T06:42:49Z

+1. **Prescriptive, not descriptive**: each convention states what to do and what to avoid, with concrete code examples
+2. **Derived from existing code**: conventions are extracted from established patterns in the codebase, not invented
+3. **Minimal and actionable**: each entry should be short enough that a contributor (or AI tool) can apply it without reading surrounding prose
+4. **Reference implementations**: point to specific files in the codebase as canonical examples


Agree! We should have just the examples not the references to the specific files. Rephrased to share canonical examples on the conventions file.

mrrajan · 2026-05-18T06:47:14Z

+- Claude Code automatically loads `CONVENTIONS.md` from the repository root as part of its project context
+- The file uses markdown with code blocks, making it parseable by any LLM
+- Conventions are structured as clear rules with examples, optimizing for AI instruction-following
+- When a `CLAUDE.md` file is present (for tool-specific configuration), `CONVENTIONS.md` complements it — `CONVENTIONS.md` focuses on language and framework patterns that apply regardless of the tool


Sourcery.ai applies generic best practices and CONVENTIONS.md captures project specific ones. So, project specific practices might called out by the review tools. We have to explore how to align review tools to use CONVENTIONS.md file as reference.

mrrajan · 2026-05-18T10:39:24Z

+  SeaORM's `UnionType` builder or a single parameterized CTE. `Statement::from_string`
+  is only acceptable for static SQL with no dynamic values.
+
+- **Option B — Allow with review gate**: `Statement::from_string` with dynamic content


Sounds good!

mrrajan · 2026-05-18T12:55:54Z

+
+## Context
+
+Trustify uses AI-assisted development workflows (Claude Code, Copilot, and similar tools). These tools perform best when they have access to explicit, machine-readable project conventions — coding patterns, naming rules, error-handling idioms, testing practices, and architectural norms.


Agree! Rephrased to,

Trustify needs a single, explicit reference for coding patterns, naming rules, error-handling idioms, testing practices, and architectural norms. Contributors and reviewers use it during implementation and review; the same clarity also helps AI-assisted workflows (Claude Code, Copilot, and similar tools) when those tools load project context.

mrrajan · 2026-05-18T12:58:30Z

+
+### AP-2: Unbounded Queries
+
+**What it is**: Queries that fetch all rows from a table or relation without applying


Agree! lets limit only for the public API endpoints

mrrajan · 2026-05-18T12:59:00Z

+| 1 | `modules/fundamental/src/vulnerability/model/details/mod.rs` | ~54-68 | All scores fetched via `.all(tx)`, then filtered with `.filter(\|s\| s.advisory_id == ...)` |
+| 2 | `modules/fundamental/src/advisory/model/summary.rs` | ~40-67 | Bulk scores fetched, then O(advisories x vulns x scores) in-memory iteration |
+
+**Convention options**:


mrrajan · 2026-05-18T13:33:53Z

+and `tracing::instrument`/`tracing::Instrument` attributes, creating inconsistent
+observability output.
+
+**Why it matters**: `tracing` and `log` have different span-awareness behavior. `log::*`


Referring to https://docs.rs/tracing-log/latest/tracing_log/, we need logTracer::init() to have logs and tracing connected. But in our project only the test file, test-context/src/flame.rs has this configuration for setup_global_subscriber.

Signed-off-by: mrrajan <86094767+mrrajan@users.noreply.github.com.>

…ntions Signed-off-by: mrrajan <86094767+mrrajan@users.noreply.github.com.>

helio-frota · 2026-05-19T18:56:19Z

+- **Option A — Extract shared logic into generics or traits**: When two or more modules
+  implement the same logic pattern (e.g., label CRUD, license filtering), extract it into
+  a shared generic function, trait, or macro parameterized by the entity type. Duplicated
+  logic blocks longer than ~20 lines are prohibited.


informational only:

I quickly modified my personal duplicate code segment tool to focus on 20+ lines and found only these: https://gist.github.com/helio-frota/87f1a9ab12fe84480f2574bc7b92e8a7
( I'm skipping tests,test,entity,migration,benches )

ctron

Looks good to me. With the exception that I still see the actual content (AP, …) My understanding was, that his would be removed from this ADR, but then added in another PR to the conventions file.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

github-project-automation Bot added this to Trustify May 14, 2026

mrrajan marked this pull request as ready for review May 14, 2026 12:22

sourcery-ai Bot reviewed May 14, 2026

View reviewed changes

Comment thread docs/adrs/00017-conventions-file.md Outdated

mrrajan force-pushed the TC-4289 branch 2 times, most recently from 156ce43 to 79bfc3b Compare May 14, 2026 12:32

mrizzi reviewed May 14, 2026

View reviewed changes

ctron reviewed May 15, 2026

View reviewed changes

Comment thread docs/adrs/00018-conventions-file.md Outdated

ctron reviewed May 15, 2026

View reviewed changes

Comment thread docs/adrs/00018-conventions-file.md

ctron reviewed May 15, 2026

View reviewed changes

mrrajan commented May 18, 2026

View reviewed changes

mrrajan force-pushed the TC-4289 branch 2 times, most recently from 00239e8 to 9c838c3 Compare May 18, 2026 13:40

mrrajan added 3 commits May 19, 2026 17:31

Added ADR for architectural standards

be32fa5

Signed-off-by: mrrajan <86094767+mrrajan@users.noreply.github.com.>

Rename ADR file

f12c45f

Signed-off-by: mrrajan <86094767+mrrajan@users.noreply.github.com.>

Addressed review comments, Added preferred options and database conve…

cb3dee5

…ntions Signed-off-by: mrrajan <86094767+mrrajan@users.noreply.github.com.>

mrrajan mentioned this pull request May 19, 2026

docs: Update CONVENTIONS.md file #2366

Merged

helio-frota reviewed May 19, 2026

View reviewed changes

ctron reviewed May 20, 2026

View reviewed changes

Comment thread docs/adrs/00018-conventions-file.md Outdated

ctron requested changes May 20, 2026

View reviewed changes

Fix false positives, stale conventions text, and log/tracing accuracy

9f54f00

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

mrrajan force-pushed the TC-4289 branch from 9c838c3 to 9f54f00 Compare May 21, 2026 13:32

mrrajan requested a review from ctron May 28, 2026 09:36

mrrajan enabled auto-merge May 28, 2026 09:36


		### Trade-offs

		- Maintenance burden: the file must be kept in sync with evolving practices — stale conventions are worse than no conventions


		## Context

		Trustify uses AI-assisted development workflows (Claude Code, Copilot, and similar tools). These tools perform best when they have access to explicit, machine-readable project conventions — coding patterns, naming rules, error-handling idioms, testing practices, and architectural norms.


		Occurrences found (5):

		\| # \| File \| Line(s) \| Description \| Severity \|


		Convention options:

		- Option A — Batch with JOINs/IN clauses: All collection data access MUST use batch


		### AP-2: Unbounded Queries

		What it is: Queries that fetch all rows from a table or relation without applying


		- Option C — Keep as-is: No convention. Developers judge whether an error is worth
		logging. `.ok()` is acceptable for non-critical conversions.


		Convention options:

		- Option A — Enums for fixed value sets: Any value drawn from a fixed set (statuses,


		Convention options:

		- Option A — Extract shared logic into generics or traits: When two or more modules


		- Option C — Keep as-is: No convention. Instrumentation is added when needed for
		debugging. Partial coverage is acceptable.


		Convention options:

		- Option A — Document all public items: Every public struct, enum, trait, function,


		Convention options:

		- Option A — Use `tracing::` exclusively: All logging MUST use `tracing::` macros


		- Option C — Keep as-is: No convention. Inline values are acceptable with explanatory
		comments.

Conversation

mrrajan commented May 14, 2026 • edited by mrizzi Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Summary by Sourcery

Uh oh!

sourcery-ai Bot commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

PhilipCattanach commented May 14, 2026

Uh oh!

mrizzi left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mrizzi May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ctron commented May 15, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

mrrajan commented May 14, 2026 •

edited by mrizzi

Loading

sourcery-ai Bot commented May 14, 2026 •

edited

Loading

mrizzi May 20, 2026 •

edited

Loading