Project I: File Compression with Huffman

- Name: El Mehdi Hamte - MrFall
- Date: 26-11-2026

1. Introduction

This project implements file compression and decompression using the Huffman algorithm in the C programming language.

The goal is to minimize the number of bits needed to represent characters by assigning shorter codes to frequent characters and longer codes to rare characters.

2. Huffman Algorithm Explanation

1. Character Frequency Calculation

We count how many times each character appears in the input file.

2. Building the Huffman Tree

Leaves = characters and their frequencies
Internal nodes = sum of children
Always merge the two smallest nodes first using a Min-Heap

3. Generating Huffman Codes

Each path from root to leaf generates a code:

Left → 0
Right → 1

4. File Encoding

Replace each character with its Huffman code
Bits are grouped into bytes
Last byte may contain padding → stored in a .meta file

5. File Decoding

Rebuild the Huffman tree from codes.map
Decode the compressed bit stream
Remove padding
Restore the original text exactly

3. Project Files and Their Functions

File	Description
`huffman.h`	Node structure + Huffman function prototypes
`huffman.c`	Huffman tree creation, code generation, encode/decode logic
`minheap.h`	Heap structure and prototypes
`minheap.c`	Min-Heap implementation (insert, extract-min, build-heap)
`main.c`	CLI interface: `encode` and `decode` commands
`Makefile`	Automates compilation (`make`)

4. How to Use the Program

1. Create your input file

echo "ABRACADABRA" > input.txt

2. Compile the project

gcc main.c huffman.c minheap.c -o huff

(or simply)

make

3. Compress a file

./huff encode input.txt compressed.huf codes.map

This generates:

compressed.huf → compressed binary file
codes.map → Huffman codes used
compressed.huf.meta → padding info

4. Decompress

./huff decode compressed.huf compressed.huf.meta codes.map mrfall.txt

5. Check output

cat mrfall.txt

Expected:

ABRACADABRA

5. Test Results

File	Original Size	Compressed Size	Compression Ratio
`input.txt` (ABRACADABRA)	11 bytes	3 bytes	72% saved

Compression becomes more effective with larger and more repetitive input data.

6. Educational Videos

Learn Huffman Coding visually:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project I: File Compression with Huffman

1. Introduction

2. Huffman Algorithm Explanation

1. Character Frequency Calculation

2. Building the Huffman Tree

3. Generating Huffman Codes

4. File Encoding

5. File Decoding

3. Project Files and Their Functions

4. How to Use the Program

1. Create your input file

2. Compile the project

3. Compress a file

4. Decompress

5. Check output

5. Test Results

6. Educational Videos

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Makefile		Makefile
Readme.md		Readme.md
huffman.c		huffman.c
huffman.h		huffman.h
input.txt		input.txt
main.c		main.c
minheap.c		minheap.c
minheap.h		minheap.h

mr-fall/UIR-Projet_I-Huffman-Compression

Folders and files

Latest commit

History

Repository files navigation

Project I: File Compression with Huffman

1. Introduction

2. Huffman Algorithm Explanation

1. Character Frequency Calculation

2. Building the Huffman Tree

3. Generating Huffman Codes

4. File Encoding

5. File Decoding

3. Project Files and Their Functions

4. How to Use the Program

1. Create your input file

2. Compile the project

3. Compress a file

4. Decompress

5. Check output

5. Test Results

6. Educational Videos

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages