Fix: Handle malformed CSV lines in WikiInfo parser#833
Fix: Handle malformed CSV lines in WikiInfo parser#833Annu881 wants to merge 1 commit intodbpedia:masterfrom
Conversation
…ield count mismatch or invalid lang codes)
📝 WalkthroughWalkthroughThe WikiInfo.scala parser has been modified to gracefully handle malformed CSV lines by replacing exception-throwing logic with guarded early returns. When encountering lines with fewer than 15 fields or invalid language codes, the parser now logs warnings and returns None, allowing processing to continue rather than aborting. Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes 🚥 Pre-merge checks | ✅ 5✅ Passed checks (5 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches🧪 Generate unit tests (beta)
No actionable comments were generated in the recent review. 🎉 🧹 Recent nitpick comments
Tip Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord. Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|



Fixes #831
Changes
WikiInfo.scalato validate the field count before parsing each line.Noneinstead of throwing exceptions, allowing the extraction pipeline to continue gracefully.Testing
download.test.properties(Yiddish wiki).Summary by CodeRabbit