Best CSV Compression Tool
Up to 99% smaller than LZMA. 128 wins, 0 losses across real-world CSV files. Lossless, verified, free to try.
How PZIP Compresses CSV
PZIP rotates your table 90 degrees — compressing each column independently with a type-specific strategy. Timestamps, numbers, categorical fields, and strings each get specialized treatment. The result: structure that generic compressors miss is captured and compressed far more efficiently.
Codec: Columnar codecHistory of CSV
CSV (Comma-Separated Values) originated in early IBM mainframe systems for data interchange. It became the universal tabular data exchange format, standardized as RFC 4180 in 2005. Today CSV remains the #1 format for open data portals, scientific datasets, and financial data exports.
Compression Timeline
CSV format emerges on IBM mainframes
Lotus 1-2-3 popularizes CSV for spreadsheets
RFC 4180 standardizes CSV format
zstd created at Facebook (general-purpose)
Brotli released by Google (web-focused)
PZIP achieves 71.5% better than LZMA on CSV with columnar codec
Real-World Benchmark Results
Every file tested with LZMA-9 (maximum compression) as baseline. Round-trip correctness verified on every file.
| File | Size | PZIP vs LZMA | Result | Download |
|---|---|---|---|---|
| Classifications.csv | 427.7 KB | -99% | WIN | Source |
| Constituents.csv | 1.9 MB | -96.7% | WIN | Source |
| Media_primary.csv | 1.8 MB | -96.3% | WIN | Source |
| mushrooms.csv | 365.2 KB | -96.1% | WIN | Source |
| covid_us_counties.csv | 1.9 MB | -95.4% | WIN | Source |
| quant.csv | 729.8 KB | -95.1% | WIN | Source |
| bob_ross.csv | 207.4 KB | -94.7% | WIN | Source |
| families.csv | 671.8 KB | -94.7% | WIN | Source |
| consumer_complaints_26k.csv | 4.5 MB | -94.6% | WIN | Source |
| nyc_baby_names.csv | 1.9 MB | -94.6% | WIN | Source |
| Medium.csv | 451.2 KB | -94.4% | WIN | Source |
| Geography.csv | 545.8 KB | -94.3% | WIN | Source |
| census_income.csv | 3.7 MB | -93.9% | WIN | Source |
| fivethirtyeight_steak_survey.csv | 61.2 KB | -93.9% | WIN | Source |
| jsonl2csv_ecb_exchange_rates.csv | 1.4 MB | -93.5% | WIN | Source |
| exchange_rates.csv | 1.9 MB | -93.4% | WIN | Source |
| titanic_seaborn.csv | 55.7 KB | -93.4% | WIN | Source |
| adult_income.csv | 1.9 MB | -93.3% | WIN | Source |
| ssa_names_national.csv | 1.9 MB | -93.1% | WIN | Source |
| lemurs.csv | 1.9 MB | -92.6% | WIN | Source |
| jsonl2csv_fivethirtyeight_bob_ross.csv | 65.2 KB | -92.5% | WIN | Source |
| bob_ross_538.csv | 64.8 KB | -92.4% | WIN | Source |
| olympics.csv | 1.9 MB | -92.3% | WIN | Source |
| jsonl2csv_hadley_baby_names.csv | 1.2 MB | -92.2% | WIN | Source |
| natural_gas_prices.csv | 120.5 KB | -92.2% | WIN | Source |
| jsonl2csv_fivethirtyeight_congress_age.csv | 1.3 MB | -92.1% | WIN | Source |
| fivethirtyeight_congress.csv | 1.3 MB | -92% | WIN | Source |
| jhu_covid_deaths_global.csv | 1.2 MB | -92% | WIN | Source |
| nytimes_covid_states.csv | 2.1 MB | -92% | WIN | Source |
| covid_us_states.csv | 1.9 MB | -91.7% | WIN | Source |
| stress_10k.csv | 1.3 MB | -91.2% | WIN | Source |
| jsonl2csv_gold_prices.csv | 36.9 KB | -90.6% | WIN | Source |
| taxis.csv | 849.0 KB | -90.6% | WIN | Source |
| global_temp_monthly.csv | 82.0 KB | -90.5% | WIN | Source |
| gold_prices_monthly.csv | 34.7 KB | -90.5% | WIN | Source |
| coffee_ratings.csv | 589.5 KB | -90.3% | WIN | Source |
| penguins_raw.csv | 51.9 KB | -89.8% | WIN | Source |
| population.csv | 526.0 KB | -89.6% | WIN | Source |
| nba_players.csv | 162.5 KB | -89.5% | WIN | Source |
| inflation_us.csv | 28.0 KB | -89.4% | WIN | Source |
| daily-temperatures.csv | 66.3 KB | -89.3% | WIN | Source |
| Titles.csv | 1.2 MB | -89.2% | WIN | Source |
| comic_characters.csv | 1.1 MB | -89.2% | WIN | Source |
| tornados.csv | 1.9 MB | -88.8% | WIN | Source |
| uber_rides_raw.csv | 1.9 MB | -88.6% | WIN | Source |
| jsonl2csv_fivethirtyeight_comic_marvel.csv | 2.3 MB | -88.4% | WIN | Source |
| slaughter_house.csv | 1.9 MB | -88.4% | WIN | Source |
| diamonds.csv | 1.9 MB | -88.3% | WIN | Source |
| fivethirtyeight_marvel_wikia.csv | 2.3 MB | -88.3% | WIN | Source |
| volcano_eruptions.csv | 1.2 MB | -87.9% | WIN | Source |
| us_zip_codes.csv | 1.2 MB | -87.5% | WIN | Source |
| drugs_fda.csv | 1.7 MB | -87.4% | WIN | Source |
| Cxx17Issues.csv | 63.0 KB | -87.3% | WIN | Source |
| pset_definitions.csv | 117.6 KB | -87.3% | WIN | Source |
| numbats.csv | 182.7 KB | -87.1% | WIN | Source |
| Cxx23Issues.csv | 63.0 KB | -87% | WIN | Source |
| plastic_pollution.csv | 724.6 KB | -86.8% | WIN | Source |
| water_access.csv | 1.9 MB | -86.8% | WIN | Source |
| astronauts.csv | 213.9 KB | -86.5% | WIN | Source |
| Cxx20Issues.csv | 62.3 KB | -86.4% | WIN | Source |
| jsonl2csv_jhu_covid_confirmed_global.csv | 1.7 MB | -85.8% | WIN | Source |
| jhu_covid_confirmed.csv | 1.7 MB | -85.7% | WIN | Source |
| Cxx2cIssues.csv | 51.8 KB | -85.6% | WIN | Source |
| Cxx2cPapers.csv | 32.0 KB | -84.9% | WIN | Source |
| fips_codes.csv | 77.5 KB | -83.9% | WIN | Source |
| gps_track_10k.csv | 546.2 KB | -83.9% | WIN | Source |
| ufo_sightings.csv | 1.9 MB | -83.8% | WIN | Source |
| wapo_police_shootings.csv | 1.8 MB | -83.6% | WIN | Source |
| co2_emissions.csv | 1.9 MB | -83.1% | WIN | Source |
| us_broadband.csv | 104.1 KB | -83.1% | WIN | Source |
| wine_quality_red.csv | 98.6 KB | -82.9% | WIN | Source |
| cpuid.csv | 86.2 KB | -82.8% | WIN | Source |
| carseats.csv | 20.8 KB | -82.6% | WIN | Source |
| snakes_count_10000.csv | 86.9 KB | -82.6% | WIN | Source |
| gdp.csv | 563.2 KB | -82.3% | WIN | Source |
| mlb_players.csv | 55.6 KB | -82.3% | WIN | Source |
| world_cities.csv | 1.2 MB | -81.4% | WIN | Source |
| jsonl2csv_fivethirtyeight_daily_show.csv | 126.3 KB | -81.3% | WIN | Source |
| energy_data.csv | 1.9 MB | -81.2% | WIN | Source |
| runways.csv | 3.8 MB | -81.1% | WIN | Source |
| nasdaq_listing.csv | 246.8 KB | -81% | WIN | Source |
| uci_abalone.csv | 187.4 KB | -80.4% | WIN | Source |
| imdb_top250.csv | 1.3 MB | -80.1% | WIN | Source |
| beer_reviews.csv | 155.8 KB | -80% | WIN | Source |
| usgs_earthquakes.csv | 1.9 MB | -80% | WIN | Source |
| iso3166_countries.csv | 20.2 KB | -79.9% | WIN | Source |
| tesla_stock.csv | 54.1 KB | -79.9% | WIN | Source |
| wine-quality-red.csv | 82.2 KB | -79.9% | WIN | Source |
| senators_twitter.csv | 1.9 MB | -79.8% | WIN | Source |
| usgs_quakes_2024h1.csv | 2.3 MB | -79.7% | WIN | Source |
| animal_crossing.csv | 53.1 KB | -79.2% | WIN | Source |
| pokemon.csv | 72.3 KB | -78.7% | WIN | Source |
| jsonl2csv_fivethirtyeight_riddler.csv | 384.9 KB | -78.1% | WIN | Source |
| earthquake_data.csv | 814.0 KB | -77.9% | WIN | Source |
| california_housing.csv | 1.4 MB | -77.8% | WIN | Source |
| airport_codes.csv | 1.9 MB | -77.7% | WIN | Source |
| cars.csv | 20.7 KB | -77.7% | WIN | Source |
| life_expectancy.csv | 157.4 KB | -77.4% | WIN | Source |
| sp500.csv | 52.4 KB | -77.4% | WIN | Source |
| jsonl2csv_fivethirtyeight_avengers.csv | 27.2 KB | -77.3% | WIN | Source |
| fivethirtyeight_avengers.csv | 27.0 KB | -77.2% | WIN | Source |
| vega_zipcodes.csv | 1.9 MB | -75.9% | WIN | Source |
| bechdel.csv | 1.0 MB | -75.7% | WIN | Source |
| country_codes.csv | 131.2 KB | -74.9% | WIN | Source |
| jsonl2csv_country_codes.csv | 131.4 KB | -74.9% | WIN | Source |
| kaggle_titanic.csv | 58.9 KB | -74.7% | WIN | Source |
| languages.csv | 1.9 MB | -74.6% | WIN | Source |
| transit_cost.csv | 102.2 KB | -74.4% | WIN | Source |
| childcare_costs.csv | 1.9 MB | -74.3% | WIN | Source |
| sp500_data.csv | 115.3 KB | -74.1% | WIN | Source |
| scientific_measurements.csv | 272.2 KB | -73.9% | WIN | Source |
| finance_google.csv | 58.8 KB | -73.7% | WIN | Source |
| spam.csv | 466.7 KB | -73% | WIN | Source |
| netflix_titles.csv | 1.9 MB | -72.6% | WIN | Source |
| jsonl2csv_fivethirtyeight_bechdel.csv | 204.6 KB | -72.2% | WIN | Source |
| plotly_precipitation.csv | 388.5 KB | -72.2% | WIN | Source |
| spotify_songs.csv | 1.9 MB | -72.1% | WIN | Source |
| fivethirtyeight_police_killings.csv | 125.9 KB | -69.9% | WIN | Source |
| vega_airports.csv | 205.4 KB | -69.1% | WIN | Source |
| hw_25000.csv | 618.5 KB | -69% | WIN | Source |
| stock_data.csv | 148.1 KB | -68.9% | WIN | Source |
| breast_cancer.csv | 117.1 KB | -68.8% | WIN | Source |
| board_games.csv | 1.9 MB | -68.5% | WIN | Source |
| big_mac_index.csv | 301.6 KB | -68.1% | WIN | Source |
| movies.csv | 91.1 KB | -68% | WIN | Source |
| brain_networks.csv | 1.0 MB | -65.4% | WIN | Source |
| fivethirtyeight_recent_grads.csv | 26.2 KB | -64.8% | WIN | Source |
| jsonl2csv_fivethirtyeight_nba_raptor.csv | 1.2 MB | -60.9% | WIN | Source |
Frequently Asked Questions
How much smaller does PZIP make CSV files?
+
On our 124-file benchmark of real-world datasets (US Census, OWID, NOAA, NYC TLC), PZIP achieves 14.8% median savings and up to 68.9% savings vs LZMA-9.
Is PZIP CSV compression lossless?
+
Yes — every compressed file is byte-exact verified. decode(encode(X)) = X, always. PZIP runs round-trip correctness verification on every file.
How is PZIP different from gzip or zstd for CSV?
+
Gzip and zstd treat CSV as generic bytes. PZIP understands columns, data types, and patterns — compressing each column with the best strategy for its type. That's why PZIP beats LZMA by up to 71.5% on structured data.
Can PZIP compress very large CSV files?
+
Yes. The CLI can compress very large CSV files. The free web demo currently supports uploads up to 30 MB per file due to the hosting request-size limit.
PZIP vs Other Compressors for CSV
| Feature | PZIP | LZMA / xz | gzip | zstd |
|---|---|---|---|---|
| Type-Aware | Yes | No | No | No |
| Lossless | Yes | Yes | Yes | Yes |
| Never-Worse Guarantee | Yes | N/A | N/A | N/A |
| Best CSV Saving | 99% | Baseline | Worse | ~Similar |
| Round-Trip Verified | Every file | Manual | Manual | Manual |
Try PZIP on Your CSV Files
Upload any CSV file up to 30 MB. Free during beta — no signup required. See how much smaller PZIP makes it.
Baseline: LZMA-9 (maximum compression)
Competitors: gzip-9, bz2-9, brotli-11, zstd-19, PPMd 2-24
Verification: Byte-exact round-trip on every file
Guarantee: Never-worse (PZIP <= LZMA, always)
Test files: 128 real-world CSV files
Updated: 2026-02-15