GroupDocs.Metadata is a document metadata management API built for precision and compliance. Read, edit, enrich, or purge metadata across documents, images, audio, video, emails, archives, fonts, and CAD files to automate governance, privacy, and metadata cleanup workflows.
- Explore refreshed .NET examples and showcases in the GroupDocs.Metadata-for-.NET repository.
- Updated tutorials and how-tos in the GroupDocs.Metadata documentation.
- Recent articles on metadata cleaning, inspection, and governance on the GroupDocs.Metadata blog.
High-performance APIs for metadata viewing, editing, search, and sanitization.
- GroupDocs.Metadata-for-.NET: Core API for reading, updating, and cleaning metadata in C# and VB.NET.
- Examples: Ready-to-run samples with input files.
- Showcases: UI demos for quick evaluation.
// Quick .NET Metadata Sanitization Example
using (Metadata metadata = new Metadata("input.pdf"))
{
var removed = metadata.Sanitize(); // strip detected metadata packages
Console.WriteLine($"Properties removed: {removed}");
metadata.Save("clean.pdf");
}Native Java library to inspect, update, and purge metadata across formats.
- GroupDocs.Metadata-for-Java: Standard Java API and examples.
// Quick Java Metadata Inspection
try (Metadata metadata = new Metadata("input.docx")) {
DocumentInfo info = metadata.getDocumentInfo();
System.out.println(info.getPageCount());
}Fast Node.js integration that wraps the Java engine for cross-platform metadata workflows.
- GroupDocs.Metadata-for-Node.js-via-Java: Runnable samples for metadata reading and cleanup.
// Quick Node.js Metadata Example
const groupdocs = require('@groupdocs/groupdocs.metadata');
(async () => {
const metadata = new groupdocs.Metadata('input.xlsx');
const info = metadata.getDocumentInfo();
console.log(info.getFileType());
})();Python scripts powered by the .NET engine to automate metadata discovery and removal.
- GroupDocs.Metadata-for-Python-via-.NET: Efficient Python samples for PDF, Word, images, and more.
# Quick Python Metadata Removal Example
from groupdocs.metadata import Metadata
with Metadata("input.docx") as metadata:
metadata.sanitize()
metadata.save("clean.docx")Practical metadata workflows teams ship to production:
- Metadata compliance: Inspect, redact, and sanitize metadata before sharing documents externally.
- Digital forensics: Extract EXIF, XMP, IPTC, ID3, APE, and other tags for audit trails.
- Batch processing: Clean or enrich metadata in bulk within CI/CD pipelines.
- Content governance: Standardize titles, keywords, authors, and custom tags across repositories.
- Metadata read/write: Inspect, edit, and add metadata across 60+ formats (PDF, DOCX, XLSX, PPTX, images, audio, video, email, CAD, fonts, archives).
- Metadata removal & sanitization: Strip hidden properties, comments, digital signatures, revisions, and embedded packages.
- Search & filter: Query metadata properties by tags, categories, types, and values for precise governance.
- Security & compliance: Clean sensitive metadata to prevent leakage before distribution.
- Performance & scalability: Optimized for batch workflows and server-side processing.
- Format autodetection: Detect file format and MIME type by internal structure.
- Content insights: Extract technical info (EXIF, XMP, IPTC, Image Resources), word/character statistics, and page details.
- Documentation: Comprehensive Guides and Tutorials.
- Support: Expert help at the GroupDocs Free Support Forum.
- Evaluation: Get a Temporary License for full feature testing.
- Live Demo: Try metadata viewing and cleaning online at GroupDocs.Metadata apps.
document-metadata metadata-extraction metadata-editor metadata-cleaning metadata-sanitization exif-metadata xmp-metadata iptc-metadata pdf-metadata docx-metadata metadata-governance metadata-removal metadata-automation metadata-management-api