0% found this document useful (0 votes)

88 views4 pages

WSD Using Dictionary

Word Sense Disambiguation (WSD) is a crucial method in Natural Language Processing (NLP) that determines the meaning of a word based on its context, addressing the ambiguity of words with multiple meanings. WSD has various applications, including lexicography, text mining, and information retrieval, but faces challenges such as differing dictionary definitions and the complexity of word meanings. Implementation methods for WSD include dictionary-based, supervised, semi-supervised, and unsupervised approaches, with the Lesk Algorithm being a notable classical method.

Uploaded by

beebird234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views4 pages

WSD Using Dictionary

Uploaded by

beebird234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

WSD using Dictionary, Treasurers, supervised

In human language, often a word is used in more than one way. Understanding the

various usage patterns in the language is important for various Natural

Language Processing Applications.

In various usage situations, the same word can mean differently. As, a vast
majority of the information online, is in English, for the sake of simplicity, let
us deal with examples in the English language only.

Let us take the example of the word “bark“:

One meaning of the word refers to the outer covering of the tree. The other
meaning refers to the sound made by a dog. So, here the same word has
different meanings.

Let us take a piece of text:

“Cinnamon comes from the bark of the Cinnamon tree.”

“The dog barked at the stranger.”

Let us now try a sentence with both words:

“The dog was scratching the bark of the tree, when the man approached
the dog to make it stop, the dog barked.”

Suppose, this sentence is passed to an algorithm for sentiment analysis,

“bark” and “barked” might mean the same meaning.

So, we can understand that, same words can mean differently based on
the usage of the word in a particular sentence. The usage of words defines
a lot about their meaning. But the problem lies that, in NLP, while dealing
with text data, we need some way to interpret the different words with
different meanings.

What is Word Sense Disambiguation?

Word Sense Disambiguation is an important method of NLP by which the

meaning of a word is determined, which is used in a particular context. NLP
systems often face the challenge of properly identifying words, and
determining the specific usage of a word in a particular sentence has many
applications.

Word Sense Disambiguation basically solves the ambiguity that arises in

determining the meaning of the same word used in different situations.

Word Sense Disambiguation Applications

WSD has many applications in various text processing and NLP fields.

 WSD can be used alongside Lexicography. Much of the

modern Lexicography is corpus-based. WSD, used in Lexicography can
provide significant textual indicators.
 WSD can also be used in Text Mining and Information Extraction tasks. As
the major purpose of WSD is to accurately understand the meaning of a
word in particular usage or sentence, it can be used for the correct labeling
of words.
 For example, from a security point of view, a text system should be able to
understand the difference between a coal “mine” and a land “mine”.
 While the former serves industrial purposes, the latter is a security threat.
So a text mining application must be able to determine the difference
between the two.
 Similarly, WSD can be used for Information Retrieval purposes. Information
Retrieval systems work through text data primarily based on textual
information. Knowing the relevance of using a word in any sentence will
surely help.

Challenges in Word Sense Disambiguation

WSD faces a lot of challenges and problems.

 The most common problem is the difference between various dictionaries

or text corpus. Different dictionaries have different meanings for words,
which makes the sense of the words to be perceived as different. A lot of
text information is out there and often it is not possible to process
everything properly.
 Different applications need different algorithms and that is often a challenge
for WSD.
 A problem also arises is that words cannot be divided into discrete
meanings. Words often have related meanings and this causes a lot of
problems.

How to implement WSD?

There are four main ways to implement WSD.

These are:

Dictionary- and knowledge-based methods:

These methods rely on text data like dictionaries, thesaurus, etc. It is based
on the fact that words that are related to each other can be found in the
definitions. The popularly used Lesk method, which we shall discuss more
later is a seminal dictionary-based method.

Supervised methods:

In this type, sense-annotated corpora are used to train machine learning

models. But, a problem that may arise is that such corpora are very tough
and time-consuming to create.

Semi-supervised Methods:
Due to the lack of such corpus, most word sense disambiguation
algorithms use semi-supervised methods. The process starts with a small
amount of data, which is often manually created.

This is used to train an initial classifier. This classifier is used on an

untagged part of the corpus, to create a larger training set. Basically, this
method involves bootstrapping from the initial data, which is referred to as
the seed data.

Semi-supervised methods thus, use both labeled and unlabelled data.

Unsupervised Methods:

Unsupervised Methods pose the greatest challenge to researchers and

NLP professionals. A key assumption of these models is that similar
meanings and senses occur in a similar context. They are not dependent
on manual efforts, hence can overcome the knowledge acquisition
deadlock.

Lesk Algorithm

Lesk Algorithm is a classical Word Sense Disambiguation algorithm

introduced by Michael E. Lesk in 1986.

The Lesk algorithm is based on the idea that words in a given region of the
text will have a similar meaning. In the Simplified Lesk Algorithm, the
correct meaning of each word context is found by getting the sense which
overlaps the most among the given context and its dictionary meaning.

NLP Semantic 2222 - Word Sense Disambiguation
No ratings yet
NLP Semantic 2222 - Word Sense Disambiguation
4 pages
Word Sense Disambiguation in NLP
No ratings yet
Word Sense Disambiguation in NLP
4 pages
Unit No 4
No ratings yet
Unit No 4
9 pages
1508 01346 PDF
No ratings yet
1508 01346 PDF
16 pages
Survey of Word Sense Disambiguation
No ratings yet
Survey of Word Sense Disambiguation
16 pages
Word Sense Disambiguation in Natural Language Processing - GeeksforGeeks
No ratings yet
Word Sense Disambiguation in Natural Language Processing - GeeksforGeeks
6 pages
WSD 1
No ratings yet
WSD 1
10 pages
NLP Chapter 4
No ratings yet
NLP Chapter 4
4 pages
Semantic Parsing
No ratings yet
Semantic Parsing
79 pages
An Hybrid Approach To Word Sense Disambiguation
No ratings yet
An Hybrid Approach To Word Sense Disambiguation
12 pages
NLP Unit2
No ratings yet
NLP Unit2
9 pages
Nlp-Unit Iii
No ratings yet
Nlp-Unit Iii
74 pages
Hindi Word Sense Disambiguation Method
No ratings yet
Hindi Word Sense Disambiguation Method
17 pages
NLP Answers
No ratings yet
NLP Answers
13 pages
NLP Assignment 4
No ratings yet
NLP Assignment 4
3 pages
Word Sense Disambiguation Study
No ratings yet
Word Sense Disambiguation Study
15 pages
Unit 2 - Lecture 3
No ratings yet
Unit 2 - Lecture 3
9 pages
Word Sense Disambiguation Approaches
No ratings yet
Word Sense Disambiguation Approaches
3 pages
Word Sense Disambiguation (WSD)
No ratings yet
Word Sense Disambiguation (WSD)
9 pages
A Knowledge Based Approach To Resolve Wo
No ratings yet
A Knowledge Based Approach To Resolve Wo
6 pages
Words Have diff-WPS Office
No ratings yet
Words Have diff-WPS Office
4 pages
Word Sense Disambiguation Survey
No ratings yet
Word Sense Disambiguation Survey
22 pages
What Is Word Sense Disambiguation Good For?: Adam Kilgarriff Itri University of Brighton
No ratings yet
What Is Word Sense Disambiguation Good For?: Adam Kilgarriff Itri University of Brighton
6 pages
Unit 2 - Lecture 3
No ratings yet
Unit 2 - Lecture 3
9 pages
Walker's Algorithm for WSD Techniques
No ratings yet
Walker's Algorithm for WSD Techniques
5 pages
Corpus Based Approach For Semantic Interpretation
No ratings yet
Corpus Based Approach For Semantic Interpretation
20 pages
NLP Assign Mod-4,5,6 IramShaikh
No ratings yet
NLP Assign Mod-4,5,6 IramShaikh
10 pages
Advances in WSD
No ratings yet
Advances in WSD
208 pages
Semantic Analysis in NLP Guide
No ratings yet
Semantic Analysis in NLP Guide
18 pages
Word Sense Diambiguation
No ratings yet
Word Sense Diambiguation
11 pages
Word Sense Disambiguation Sec 5
No ratings yet
Word Sense Disambiguation Sec 5
2 pages
2019 Wiedemannetal Konvens Bert 1
No ratings yet
2019 Wiedemannetal Konvens Bert 1
2 pages
NLP Mod4 Lec1 Word Sense Disambiguation
No ratings yet
NLP Mod4 Lec1 Word Sense Disambiguation
26 pages
Word Sense Disambiguation Guide
No ratings yet
Word Sense Disambiguation Guide
6 pages
Unit 3-1
No ratings yet
Unit 3-1
66 pages
Chapter 4 NLP
No ratings yet
Chapter 4 NLP
17 pages
NLP: WSD Approaches Overview
No ratings yet
NLP: WSD Approaches Overview
5 pages
BERT for Word Sense Disambiguation
No ratings yet
BERT for Word Sense Disambiguation
10 pages
Performance Enhancement of WSD Using Association Rules in WEKA
No ratings yet
Performance Enhancement of WSD Using Association Rules in WEKA
8 pages
Introduction To The Special Issue On Word Sense Disambiguation: The State of The Art
No ratings yet
Introduction To The Special Issue On Word Sense Disambiguation: The State of The Art
40 pages
Semantic Processing Overview
No ratings yet
Semantic Processing Overview
13 pages
Trigram 11
No ratings yet
Trigram 11
16 pages
Word Sense Disambiguation Methods Applied To English and Romanian
No ratings yet
Word Sense Disambiguation Methods Applied To English and Romanian
8 pages
Word Sense Disambiguation Systems
No ratings yet
Word Sense Disambiguation Systems
4 pages
Word Sense Disambiguation
No ratings yet
Word Sense Disambiguation
2 pages
NLP - Mid 2 Examination
No ratings yet
NLP - Mid 2 Examination
4 pages
Problem Statement NLP WSD
No ratings yet
Problem Statement NLP WSD
9 pages
v24dsl07t - Unit IV - NLP
No ratings yet
v24dsl07t - Unit IV - NLP
62 pages
Understanding Semantic Disambiguation
No ratings yet
Understanding Semantic Disambiguation
46 pages
Unit 3 and 4 Notes
No ratings yet
Unit 3 and 4 Notes
27 pages
A Metaheuristic With A Neural Surrogate Function - 2022 - Machine Learning With
No ratings yet
A Metaheuristic With A Neural Surrogate Function - 2022 - Machine Learning With
11 pages
Unsupervised Hindi Word Sense Disambiguation Using Graph Based Centrality Measures
No ratings yet
Unsupervised Hindi Word Sense Disambiguation Using Graph Based Centrality Measures
8 pages
2019 Wiedemannetal Konvens Bert 5
No ratings yet
2019 Wiedemannetal Konvens Bert 5
2 pages
Word Sense
No ratings yet
Word Sense
13 pages
NLP Unit 3
No ratings yet
NLP Unit 3
20 pages
Semisupervised Data Driven Word Sense... (Pratibha Rani and Others)
No ratings yet
Semisupervised Data Driven Word Sense... (Pratibha Rani and Others)
11 pages
Word Sense Disambiguation
No ratings yet
Word Sense Disambiguation
39 pages
Analysis of Variance
No ratings yet
Analysis of Variance
4 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
11 pages
Autoencoders Tutorial - What Are Autoencoders - Edureka
No ratings yet
Autoencoders Tutorial - What Are Autoencoders - Edureka
10 pages
Meaning Representation
No ratings yet
Meaning Representation
7 pages
Alexnet and Data Augmentation
No ratings yet
Alexnet and Data Augmentation
6 pages
Why Were Gans Developed in The First Place?: Generative Adversarial Network (Gan)
No ratings yet
Why Were Gans Developed in The First Place?: Generative Adversarial Network (Gan)
3 pages
CN 8
No ratings yet
CN 8
5 pages
Operating System - I - O Hardware - Tutorialspoint
No ratings yet
Operating System - I - O Hardware - Tutorialspoint
6 pages
Unit-4 - Cloud Computing Security Architecture
No ratings yet
Unit-4 - Cloud Computing Security Architecture
21 pages
Cloud Stack
No ratings yet
Cloud Stack
4 pages
Google Net
No ratings yet
Google Net
7 pages
13.3.1 Packet Tracer - Use Icmp To Test and Correct Network Connectivity
No ratings yet
13.3.1 Packet Tracer - Use Icmp To Test and Correct Network Connectivity
2 pages
NL11SyntaxandContext Free Grammars
No ratings yet
NL11SyntaxandContext Free Grammars
185 pages
Unit 4
No ratings yet
Unit 4
25 pages
VGG Net
No ratings yet
VGG Net
6 pages
Ngrams Final
No ratings yet
Ngrams Final
28 pages
My Experience in FRCS Glasgow Ophthalmology
No ratings yet
My Experience in FRCS Glasgow Ophthalmology
1 page
Agri Plus GD
No ratings yet
Agri Plus GD
4 pages
Logical Text Complete
No ratings yet
Logical Text Complete
4 pages
SOLGA
No ratings yet
SOLGA
1 page
ORDINANCES FOR COURSE MASTER OF SCIENCE (ARTIFICIAL INTELLIGENCE AND DATA SCIENCE) 2020-21 Onwards
No ratings yet
ORDINANCES FOR COURSE MASTER OF SCIENCE (ARTIFICIAL INTELLIGENCE AND DATA SCIENCE) 2020-21 Onwards
13 pages
Ems Grade 9 Term 1 Controlled Test Answer Booklet JTG 2024 - 094411
No ratings yet
Ems Grade 9 Term 1 Controlled Test Answer Booklet JTG 2024 - 094411
6 pages
Geophysical Research Letters - 2021 - O'Neill - Orbital Influences On Conditions Favorable For Glacial Inception
No ratings yet
Geophysical Research Letters - 2021 - O'Neill - Orbital Influences On Conditions Favorable For Glacial Inception
9 pages
Flex Cube
No ratings yet
Flex Cube
3 pages
Franchise Crew Application 210507
No ratings yet
Franchise Crew Application 210507
2 pages
Aldehyde and Ketone
No ratings yet
Aldehyde and Ketone
87 pages
Killing On Carnival Row
No ratings yet
Killing On Carnival Row
116 pages
Furse LPS Standards Guide00002
No ratings yet
Furse LPS Standards Guide00002
1 page
Shruti Shrestha Architecture Portfolio
No ratings yet
Shruti Shrestha Architecture Portfolio
23 pages
Court Dismisses Falsification Case
No ratings yet
Court Dismisses Falsification Case
3 pages
European Journal of Education Studies: Rongie C. Abella, Jezyl C. Cutamora
No ratings yet
European Journal of Education Studies: Rongie C. Abella, Jezyl C. Cutamora
33 pages
Advance Grammar Mid-Term Test Đề Số 1
No ratings yet
Advance Grammar Mid-Term Test Đề Số 1
6 pages
Mantles and Anointings Part 2 - Josephs Anointing
No ratings yet
Mantles and Anointings Part 2 - Josephs Anointing
13 pages
Washington Tenant Eviction Appeal
No ratings yet
Washington Tenant Eviction Appeal
34 pages
Oppo A3 Pro Bill
No ratings yet
Oppo A3 Pro Bill
1 page
Albinoni - Oboe Concertos, Op. 7 & 9 - Michael Talbot, 1995
No ratings yet
Albinoni - Oboe Concertos, Op. 7 & 9 - Michael Talbot, 1995
4 pages
Health and Beauty Product Range1
No ratings yet
Health and Beauty Product Range1
60 pages
22 Min Workout Overview
100% (1)
22 Min Workout Overview
3 pages
Probability Exercises for Class 10
No ratings yet
Probability Exercises for Class 10
9 pages
San Beda Redbook Previous Bar Questions and Answers PDF
100% (1)
San Beda Redbook Previous Bar Questions and Answers PDF
786 pages
Smartphone Use and Sleep Quality Impact
No ratings yet
Smartphone Use and Sleep Quality Impact
2 pages
Quantitative Modeling in Finance (DRAFT)
No ratings yet
Quantitative Modeling in Finance (DRAFT)
27 pages
Understanding Comparative and Superlative Adjectives
No ratings yet
Understanding Comparative and Superlative Adjectives
3 pages
Indian Infra Tunnel
No ratings yet
Indian Infra Tunnel
5 pages
Marathon Training Plan
No ratings yet
Marathon Training Plan
1 page
Nimcet Reasoning Topicwise Pyq2010
No ratings yet
Nimcet Reasoning Topicwise Pyq2010
7 pages