Output Formats Reference¶
This section is under development.
Comprehensive guide to all output formats supported by PyForge CLI.
Coming Soon¶
Detailed output format specifications will be available in a future release.
Current Output Formats¶
Text (.txt)¶
- Used for: PDF conversion
 - Encoding: UTF-8
 - Features: Preserves line breaks and basic formatting
 
Parquet (.parquet)¶
- Used for: Excel, XML, MDB/ACCDB, DBF, CSV conversion
 - Compression: SNAPPY (default), GZIP, LZ4, ZSTD
 - Features: Column-oriented, highly compressed, fast read/write
 - Data Types: String-based conversion (Phase 1 implementation)
 - Schemas: Automatically inferred from source structure
 
Format Details¶
For detailed information about each output format, see:
- PDF to Text Converter
 - Excel to Parquet Converter
 - XML to Parquet Converter
 - Database Files Converter
 - DBF Files Converter
 - CSV to Parquet Converter
 
Next Steps¶
- CLI Reference - Complete command documentation
 - Converters - Format-specific conversion guides