Output Formats Reference¶
This section is under development.
Comprehensive guide to all output formats supported by PyForge CLI.
Coming Soon¶
Detailed output format specifications will be available in a future release.
Current Output Formats¶
Text (.txt)¶
- Used for: PDF conversion
- Encoding: UTF-8
- Features: Preserves line breaks and basic formatting
Parquet (.parquet)¶
- Used for: Excel, XML, MDB/ACCDB, DBF, CSV conversion
- Compression: SNAPPY (default), GZIP, LZ4, ZSTD
- Features: Column-oriented, highly compressed, fast read/write
- Data Types: String-based conversion (Phase 1 implementation)
- Schemas: Automatically inferred from source structure
Format Details¶
For detailed information about each output format, see:
- PDF to Text Converter
- Excel to Parquet Converter
- XML to Parquet Converter
- Database Files Converter
- DBF Files Converter
- CSV to Parquet Converter
Next Steps¶
- CLI Reference - Complete command documentation
- Converters - Format-specific conversion guides