A small library for detecting the type of a file based on header signature (also known as magic number) rather than file extension. It is designed with extensibility in mind, so that recognised formats can be added easily.
FileSignatures is available on NuGet, so can be installed via the Package Manager:
Install-Package FileSignatures
Create an instance of the FileFormatInspector class, then pass it a stream to your file:
var inspector = new FileFormatInspector();
var format = inspector.DetermineFileFormat(stream);
This will return a FileFormat instance which contains the signature and media type of the recognised format, or null if a matching format could not be determined.
Because the formats are defined as a type hierarchy, you can either check for a specific type if you want to work with a particular format, or the base type if you are interested in multiple formats.
var format = inspector.DetermineFileFormat(stream);
if(format is Pdf) {
// Just matches Pdf
}
if(format is OfficeOpenXml) {
// Matches Word, Excel, Powerpoint
}
if(format is Image) {
// Matches any image format
}
See the examples for a sample web application and a console application which demonstrate how to filter uploads by a particular format and retrieve the signature details for a file.
Currently, the following formats are built-in:
Name | Media-Type | Extension |
---|---|---|
Bitmap | image/bitmap | .bmp |
Excel | application/vnd.openxmlformats-officedocument.spreadsheetml.sheet | .xlsx |
Excel 97-2003 | application/vnd.ms-excel | .xls |
Windows Executable | application/octet-stream | .exe |
GIF | image/gif | .gif |
JPEG | image/jpeg | .jpeg |
application/pdf | ||
PNG | image/png | .png |
PowerPoint | application/vnd.openxmlformats-officedocument.presentationml.presentation | .pptx |
Powerpoint 97-2003 | application/vnd.ms-powerpoint | .ppt |
Rich Text Format | application/rtf | .rtf |
TIFF | image/tiff | .tif |
Word | application/vnd.openxmlformats-officedocument.wordprocessingml.document | .docx |
Word 97-2003 | application/msword | .doc |
Xps | application/vnd.ms-xpsdocument | .xps |
Zip | application/zip | .zip |
Create a new class (or many classes) which inherit from FileFormat
to implement a custom format. Next, pass a collection of recognised formats to the constructor of FileFormatInspector
, being sure to include your custom format.
The FileFormatLocator
class can be used to load all custom formats located within an assembly:
var assembly = typeof(CustomFileFormat).GetTypeInfo().Assembly;
// Just the formats defined in the assembly containing CustomFileFormat
var customFormats = FileFormatLocator.GetFormats(assembly);
// Formats defined in the assembly and all the defaults
var allFormats = FileFormatLocator.GetFormats(assembly, true);
Using this method, you can continue to create custom formats and they will automatically be included into the recognised formats without any additional configuration.
This project is licensed under the MIT license.