Bine ați venit la Scribd!

Mini Project 2

Încărcat de

0% au considerat acest document util (0 voturi)

192 vizualizări4 pagini

Huffman coding is an entropy encoding algorithm used for lossless data compression. It was developed by David A. Huffman while he was a Ph.D. Student at MIT. By using this utility, we can compress a regular file; or uncompress a compressed file.

Descriere originală:

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Attribution Non-Commercial (BY-NC)

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

192 vizualizări4 pagini

Mini Project 2

Încărcat de

Nutan Kesarkar

Drepturi de autor:

Attribution Non-Commercial (BY-NC)

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 4

Căutați în document

Mini Project 2 ------ Practice on C programming

Please email to my TA:

Due: 2010-12-15
In computer science and information theory, Huffman coding is an entropy encoding
algorithm used for lossless data compression. The term refers to the use of a
variable-length code table for encoding a source symbol (such as a character in a file)
where the variable-length code table has been derived in a particular way based on the
estimated probability of occurrence for each possible value of the source symbol. It
was developed by David A. Huffman while he was a Ph.D. student at MIT, and
published in the 1952 paper "A Method for the Construction of
Minimum-Redundancy Codes". For more detailed information see
http://en.wikipedia.org/wiki/Huffman_coding.

In this project, we aim to put Huffman coding into practice and use it to implement a
compress/uncompress utility. By using this utility, we can compress a regular file; or
uncompress a compressed file that is compressed by our utility.
For example:

Basic idea:
1 scan file and do statistic on each character (the times of its occurrence in this file)
2 create Huffman tree based on the statistic
3 compute Huffman code of each character based on the Huffman tree in step 2
4 encode the source file: write the huffman code of each character into the
compressed file. For example:
The binary value of character „A‟ is 01000001, which have 8 bits. Suppose the
Huffman code of „A‟ is 0110. Then we only need to write 4 bits instead of 8 bits to
represent „A‟. So we save four bits. Since each byte has 8 bits, we use these save 4
bits to store other character‟s Huffman code. So suppose the Huffman code of „B‟ is
110 and binary value of character „B‟ is 01000010. Then in the original file, it requires
two bytes to store „A‟ and „B‟, but now we only need 7 bits. However, since each byte
has 8 bits, you need to make up another bit. It either comes from one bit of another
Huffman code of the following character or a 0 if the end of the file is reached.

Original file: 01000001 01000010……..

A B

Compressed file: 0110 110 x……..

A B

When you create the compressed file, you need to put the encoding information into
the compressed file in order to use it when uncompress the file.

When uncompressing the file, you read the encoding information first and
re-construct Huffman tree. Next decode the file based on the Huffman tree.

Huffman Code: Example

The following example bases on a data source using a set of five different symbols.
The symbol's frequencies are:

Symbol Frequency
A 24
B 12
C 10
D 8
E 8
----> total 186 bit
(with 3 bit per code word)

The two rarest symbols 'E' and 'D' are connected first, followed by 'C' and 'D'. The
new parent nodes have the frequency 16 and 22 respectively and are brought
together in the next step. The resulting node and the remaining symbol 'A' are
subordinated to the root node that is created in a final step.

Code Tree according to Huffman

Symbol Frequency Code Code total
Length Length
A 24 0 1 24
B 12 100 3 36
C 10 101 3 30
D 8 110 3 24
E 8 111 3 24
---------------------------------------
ges. 186 bit tot. 138 bit
(3 bit code)

Basic Requirement:
1. Achieve Huffman code based on the statistic of all the characters of a file
2. Output the file based on the Huffman code, you can output plain Huffman code of
each character.
3. Decode a “encoded file”.
Example:
Text:
Abbdc
Huffman code:
„A‟: 10
„b‟: 01
„c‟: 11
„d‟: 00
Your “compressed” file should show:
1001010011

If input 101010110000
Your output should be:
AAAcdd

Bonus(extra 10 points on your overall credits):

1. Implement the real compress/uncompress utility which can compress/uncompress
files.
2. Make comparisons on the compression rate among different type of files, for
example txt file, image file, …, and so on. (at least three types)

S-ar putea să vă placă și

Your Brain Is NOT A Computer
Document10 pagini
Your Brain Is NOT A Computer
Abhijeet
Încă nu există evaluări
Huffman Code
Document51 pagini
Huffman Code
Honey Lara
Încă nu există evaluări
Pros and Cons of Abortion
Document14 pagini
Pros and Cons of Abortion
Suman Sarekukka
Încă nu există evaluări
Joker Pattern PDF New PDF
Document7 pagini
Joker Pattern PDF New PDF
Lorena Capogrossi
Încă nu există evaluări
Case: Iridium LLC: Bhanu - Divya - Harsh - Namita
Document9 pagini
Case: Iridium LLC: Bhanu - Divya - Harsh - Namita
Harsh Agrawal
Încă nu există evaluări
Assignment 6: Huffman Encoding: Assignment Overview and Starter Files
Document20 pagini
Assignment 6: Huffman Encoding: Assignment Overview and Starter Files
Harsh Tiwari
Încă nu există evaluări
Huffman Coding Technique
Document13 pagini
Huffman Coding Technique
Anchal Rathore
Încă nu există evaluări
Unit20 HuffmanCoding
Document22 pagini
Unit20 HuffmanCoding
Vijaya Azimuddin
Încă nu există evaluări
Synopsis On: Data Compression
Document25 pagini
Synopsis On: Data Compression
luckshay
Încă nu există evaluări
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
Document25 pagini
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
Sameer Shirhattimath
Încă nu există evaluări
Assignment of Successful
Document5 pagini
Assignment of Successful
Le Tien Dat (K18 HL)
Încă nu există evaluări
Huffman Coding A Case Study of A Comparison
Document2 pagini
Huffman Coding A Case Study of A Comparison
SIDDHARTH GUPTA
Încă nu există evaluări
Static Huffman Coding Term Paper
Document23 pagini
Static Huffman Coding Term Paper
Ravish Nirvan
Încă nu există evaluări
Ece-V-Information Theory & Coding (10ec55) - Assignment
Document10 pagini
Ece-V-Information Theory & Coding (10ec55) - Assignment
Lavanya Vaishnavi D.A.
Încă nu există evaluări
Data Compression Techniques: Pushpender Rana, Student
Document4 pagini
Data Compression Techniques: Pushpender Rana, Student
Soban
Încă nu există evaluări
Mini Project
Document26 pagini
Mini Project
Karpagam K
Încă nu există evaluări
Huffman
Document13 pagini
Huffman
Nuredin Abdumalik
Încă nu există evaluări
Data Compression Data Compression: Chapter Four
Document22 pagini
Data Compression Data Compression: Chapter Four
Dawit Bassa
Încă nu există evaluări
Mad Unit 3-Jntuworld
Document53 pagini
Mad Unit 3-Jntuworld
Dilip TheLip
Încă nu există evaluări
S 2
Document8 pagini
S 2
Vidushi Bindroo
Încă nu există evaluări
Huffman Coding Assignment
Document7 pagini
Huffman Coding Assignment
Mavine
0% (1)
Problem Source Coding...
Document7 pagini
Problem Source Coding...
Manoj Kumar
Încă nu există evaluări
Huffman and Lempel-Ziv-Welch
Document14 pagini
Huffman and Lempel-Ziv-Welch
David Siegfried
Încă nu există evaluări
Lecture 6
Document22 pagini
Lecture 6
Shubham
Încă nu există evaluări
DSP PDF
Document8 pagini
DSP PDF
22bec032
Încă nu există evaluări
Lossless Data Compression Techniques and Their Performance
Document6 pagini
Lossless Data Compression Techniques and Their Performance
Hanna Mangampo
Încă nu există evaluări
Algorithms For Data Science: CSOR W4246
Document58 pagini
Algorithms For Data Science: CSOR W4246
Eartha
Încă nu există evaluări
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
Document24 pagini
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
Muhammad Zia Shahid
Încă nu există evaluări
Project 3: Huffman Coding
Document4 pagini
Project 3: Huffman Coding
Kanvi Enterprises Opc Pvt Ltd
Încă nu există evaluări
Entropy: A 00 A 01 A 10 A 11
Document22 pagini
Entropy: A 00 A 01 A 10 A 11
Darshan Shah
Încă nu există evaluări
Files: For Multiple-Choice and Essay Questions
Document6 pagini
Files: For Multiple-Choice and Essay Questions
Phi Bảo
Încă nu există evaluări
Spectra, Signals Report
Document8 pagini
Spectra, Signals Report
Daniel Lorenz Broces III
Încă nu există evaluări
What Is Huffman Coding and Its History
Document5 pagini
What Is Huffman Coding and Its History
Nam Phương
Încă nu există evaluări
Why Needed?: Without Compression, These Applications Would Not Be Feasible
Document11 pagini
Why Needed?: Without Compression, These Applications Would Not Be Feasible
smile00972
Încă nu există evaluări
Introduction To Information Technology: Lecture #6
Document22 pagini
Introduction To Information Technology: Lecture #6
Yesu Babu A
Încă nu există evaluări
CHP - 10 - Image Compression - Error Free and Lossy Compression Min
Document20 pagini
CHP - 10 - Image Compression - Error Free and Lossy Compression Min
detex59086
Încă nu există evaluări
Compression Algorithms: Hu Man and Lempel-Ziv-Welch (LZW) : Hapter
Document17 pagini
Compression Algorithms: Hu Man and Lempel-Ziv-Welch (LZW) : Hapter
rafael ribas
Încă nu există evaluări
Application of Compression
Document14 pagini
Application of Compression
Anjaneyulu Nalluri
Încă nu există evaluări
Information Theory and Coding - Chapter 3
Document33 pagini
Information Theory and Coding - Chapter 3
Dr. Aref Hassan Kurdali
Încă nu există evaluări
Data Compression
Document28 pagini
Data Compression
Kim
Încă nu există evaluări
Unit 5-ERTS
Document48 pagini
Unit 5-ERTS
Manikandan Annamalai
Încă nu există evaluări
Data Compression Algorithms and Their Applications
Document14 pagini
Data Compression Algorithms and Their Applications
Mohammad Hosseini
100% (1)
The Audio Codec: Stereo and Huffman Coding
Document6 pagini
The Audio Codec: Stereo and Huffman Coding
applefounder
Încă nu există evaluări
Assignment 1
Document1 pagină
Assignment 1
saumya
Încă nu există evaluări
Huff Man Coding
Document8 pagini
Huff Man Coding
Yamini Reddy
Încă nu există evaluări
Compression and Decompression Using Huffman Convention Synopsis
Document10 pagini
Compression and Decompression Using Huffman Convention Synopsis
uismechproject
Încă nu există evaluări
Huffman Coding: Version of September 17, 2016
Document27 pagini
Huffman Coding: Version of September 17, 2016
Boy Afrianda Sinaga
Încă nu există evaluări
Problemset PDF
Document23 pagini
Problemset PDF
Antonio Sandoval Larrain
Încă nu există evaluări
Lab 2: Source Coding - Huffman Coding
Document2 pagini
Lab 2: Source Coding - Huffman Coding
quantum111
Încă nu există evaluări
Chapter Three
Document30 pagini
Chapter Three
mekuria
Încă nu există evaluări
CS112: Modeling Uncertainty in Information Systems Homework 3 Due Friday, May 18 (See Below For Times)
Document6 pagini
CS112: Modeling Uncertainty in Information Systems Homework 3 Due Friday, May 18 (See Below For Times)
Anritsi An
Încă nu există evaluări
8 Data Compression 10
Document12 pagini
8 Data Compression 10
sweta
Încă nu există evaluări
Net2 PDF
Document38 pagini
Net2 PDF
Mallesh
Încă nu există evaluări
Lect06 Greedy Huffman
Document7 pagini
Lect06 Greedy Huffman
Anthony-Dimitri A
Încă nu există evaluări
Compression II
Document51 pagini
Compression II
Jagadeesh Nani
Încă nu există evaluări
Huffman Code - Brilliant Math & Science Wiki
Document18 pagini
Huffman Code - Brilliant Math & Science Wiki
applefounder
Încă nu există evaluări
Ehternet/802.3 Simulation Software Design CSI3131 - Operating Systems
Document8 pagini
Ehternet/802.3 Simulation Software Design CSI3131 - Operating Systems
aaa1235
Încă nu există evaluări
Comp 203 Assignment
Document4 pagini
Comp 203 Assignment
Kenboyz 100
Încă nu există evaluări
Lecture 3 Compressiond Algo
Document65 pagini
Lecture 3 Compressiond Algo
Kanishka Gopal
Încă nu există evaluări
C/C++ Standard I/O
Document6 pagini
C/C++ Standard I/O
Serban Petrescu
Încă nu există evaluări
EE450 SocketProgrammingProject Fall2015
Document25 pagini
EE450 SocketProgrammingProject Fall2015
nikhilnarang
Încă nu există evaluări
WK4 - BitStuffing
Document5 pagini
WK4 - BitStuffing
Loges Waran
Încă nu există evaluări
16 Greedy Algorithms
Document21 pagini
16 Greedy Algorithms
Rocking Vaibhav
Încă nu există evaluări
Coding In C Decoded: Decoded, #1
De la Everand
Coding In C Decoded: Decoded, #1
D. Brown
Încă nu există evaluări
MNDCS-2024 New3 - 231101 - 003728
Document3 pagini
MNDCS-2024 New3 - 231101 - 003728
Dr. Farida Ashraf Ali
Încă nu există evaluări
Technical English For Mining (L3)
Document21 pagini
Technical English For Mining (L3)
Tō Rā Yh
Încă nu există evaluări
Factor Affecting Child Dental Behaviour Pedo
Document19 pagini
Factor Affecting Child Dental Behaviour Pedo
FourthMolar.com
Încă nu există evaluări
2-Port Antenna Frequency Range Dual Polarization HPBW Adjust. Electr. DT
Document5 pagini
2-Port Antenna Frequency Range Dual Polarization HPBW Adjust. Electr. DT
Ibrahim Jaber
Încă nu există evaluări
Brochure For New Hires
Document11 pagini
Brochure For New Hires
rose
Încă nu există evaluări
Corruption Cricket
Document21 pagini
Corruption Cricket
Ashwin Naraayan
Încă nu există evaluări
Animals Living in Lithuania
Document12 pagini
Animals Living in Lithuania
Suiliwas
Încă nu există evaluări
NEGRETE vs. COURT OF FIRST INSTANCE OF MARINDUQUE
Document1 pagină
NEGRETE vs. COURT OF FIRST INSTANCE OF MARINDUQUE
Leo Tumagan
Încă nu există evaluări
RCM Pricelist Online Store 2
Document14 pagini
RCM Pricelist Online Store 2
OJ Alexander Nadong
Încă nu există evaluări
Problem+Set+ 3+ Spring+2014,+0930
Document8 pagini
Problem+Set+ 3+ Spring+2014,+0930
jessica_1292
Încă nu există evaluări
Kuis 4
Document10 pagini
Kuis 4
Deri Anto
Încă nu există evaluări
Kebutuhan Modal Kerja Pada Cv. Cipta Karya Mandiri Di Samarinda
Document7 pagini
Kebutuhan Modal Kerja Pada Cv. Cipta Karya Mandiri Di Samarinda
Herdi Vhant
Încă nu există evaluări
Activity 1 Which Is Which
Document1 pagină
Activity 1 Which Is Which
Rhanna Lei Sia
Încă nu există evaluări
Fish Immune System and Vaccines-Springer (2022) - 1
Document293 pagini
Fish Immune System and Vaccines-Springer (2022) - 1
Rodolfo Velazco
100% (1)
Fontenot Opinion and Order
Document190 pagini
Fontenot Opinion and Order
Injustice Watch
Încă nu există evaluări
Sustainable Building: Submitted By-Naitik Jaiswal
Document17 pagini
Sustainable Building: Submitted By-Naitik Jaiswal
Naitik Jaiswal
Încă nu există evaluări
TDS 39987 Easycoat Profile Decor 3MM Euk GB
Document3 pagini
TDS 39987 Easycoat Profile Decor 3MM Euk GB
p4pubgwaly
Încă nu există evaluări
Toyota Corolla AE80 - 2 - 3 01 - 85-08 - 86 Corolla (PDFDrive)
Document107 pagini
Toyota Corolla AE80 - 2 - 3 01 - 85-08 - 86 Corolla (PDFDrive)
Abhay Kumar Sharma BOODHOO
Încă nu există evaluări
Math - Gr6 - Q2 - Week-08 - Comparing and Arranging Integers On The Number Line
Document37 pagini
Math - Gr6 - Q2 - Week-08 - Comparing and Arranging Integers On The Number Line
Diana Tubig
Încă nu există evaluări
Eliminate Zombie Nouns and Minimize Passive Voice: Plain Language
Document2 pagini
Eliminate Zombie Nouns and Minimize Passive Voice: Plain Language
Pădure Ionuț
Încă nu există evaluări
All This Comand Use To Type in Notepad
Document9 pagini
All This Comand Use To Type in Notepad
Biloul Shiraz
Încă nu există evaluări
Missions ETC 2020 SchemesOfWar
Document10 pagini
Missions ETC 2020 SchemesOfWar
DanieleBisignano
Încă nu există evaluări
Binder1 CARENCRO
Document27 pagini
Binder1 CARENCRO
Addisu Tsehay
Încă nu există evaluări
Song Flow
Document4 pagini
Song Flow
Ehij Zhey
Încă nu există evaluări
IEEE 802.1ad
Document7 pagini
IEEE 802.1ad
Le Viet Ha
Încă nu există evaluări
Aci - The Financial Markets Association: Examination Formulae
Document8 pagini
Aci - The Financial Markets Association: Examination Formulae
Jovan Ssenkandwa
Încă nu există evaluări