Add docket number component parsing to PACER parsers#1681
Draft
Add docket number component parsing to PACER parsers#1681
Conversation
Add parsing for docket number components (office code, case type, judge initials, and defendant number) to DownloadConfirmationPage parser. - Add BaseDocketReport inheritance to access parsing methods - Extract and parse docket number into components - Update 21 test JSON files with new fields - Appellate courts return null for all component fields Addresses #1093 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Add extraction and parsing of docket number components to MobileQuery parser. - Add _get_docket_number() method to extract from HTML - Parse docket number into components (office code, case type, judge initials, defendant number) - Update test JSON file with new fields Addresses #1093 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Add extraction and parsing of docket number components to ListOfCreditors parser. - Add _get_docket_number() method to extract from receipt table - Parse docket number into components (office code, case type, judge initials, defendant number) - Update 3 test JSON files with new fields Addresses #1093 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Add docket number fields to AttachmentPage parser output for API consistency, though attachment pages don't contain docket numbers. - Add BaseDocketReport inheritance - Add _get_docket_number() method (returns None) - Return null for all docket number component fields - Update 188 test JSON files with null docket fields Addresses #1093 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
Add changelog entry for new PACER parser features that extract and parse docket number components. Addresses #1093 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds parsing for docket number components (office code, case type, judge initials, and defendant number) to PACER parsers that were previously missing this functionality.
Changes
This PR adds 6 new fields to the output of 4 PACER parsers:
docket_number- The full docket number stringfederal_dn_office_code- Office/division code (e.g., "3")federal_dn_case_type- Case type (e.g., "cv", "cr", "bk")federal_dn_judge_initials_assigned- Assigned judge initialsfederal_dn_judge_initials_referred- Referred judge initials (if any)federal_defendant_number- Defendant number (for criminal cases)Modified Parsers
Test Updates
Updated 212 test JSON files:
Implementation Details
_parse_dn_components()method fromBaseDocketReportclassTesting
All existing tests pass with the updated expected output:
Closes #1093
Generated with Claude Code