-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathSearcher.EditDistace.html
82 lines (75 loc) · 5.04 KB
/
Searcher.EditDistace.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html><head><title>Python: module EditDistace</title>
<meta charset="utf-8">
</head><body bgcolor="#f0f0f8">
<table width="100%" cellspacing=0 cellpadding=2 border=0 summary="heading">
<tr bgcolor="#7799ee">
<td valign=bottom> <br>
<font color="#ffffff" face="helvetica, arial"> <br><big><big><strong>EditDistace</strong></big></big></font></td
><td align=right valign=bottom
><font color="#ffffff" face="helvetica, arial"><a href=".">index</a><br><a href="file:/home/vamshi/PycharmProjects/InformationRetrieval/Searcher/EditDistace.py">/home/vamshi/PycharmProjects/InformationRetrieval/Searcher/EditDistace.py</a></font></td></tr></table>
<p><tt>This module contains the <a href="#EditDistance">EditDistance</a> class.</tt></p>
<p>
<table width="100%" cellspacing=0 cellpadding=2 border=0 summary="section">
<tr bgcolor="#aa55cc">
<td colspan=3 valign=bottom> <br>
<font color="#ffffff" face="helvetica, arial"><big><strong>Modules</strong></big></font></td></tr>
<tr><td bgcolor="#aa55cc"><tt> </tt></td><td> </td>
<td width="100%"><table width="100%" summary="list"><tr><td width="25%" valign=top><a href="operator.html">operator</a><br>
</td><td width="25%" valign=top><a href="shelve.html">shelve</a><br>
</td><td width="25%" valign=top></td><td width="25%" valign=top></td></tr></table></td></tr></table><p>
<table width="100%" cellspacing=0 cellpadding=2 border=0 summary="section">
<tr bgcolor="#ee77aa">
<td colspan=3 valign=bottom> <br>
<font color="#ffffff" face="helvetica, arial"><big><strong>Classes</strong></big></font></td></tr>
<tr><td bgcolor="#ee77aa"><tt> </tt></td><td> </td>
<td width="100%"><dl>
<dt><font face="helvetica, arial"><a href="__builtin__.html#object">__builtin__.object</a>
</font></dt><dd>
<dl>
<dt><font face="helvetica, arial"><a href="EditDistace.html#EditDistance">EditDistance</a>
</font></dt></dl>
</dd>
</dl>
<p>
<table width="100%" cellspacing=0 cellpadding=2 border=0 summary="section">
<tr bgcolor="#ffc8d8">
<td colspan=3 valign=bottom> <br>
<font color="#000000" face="helvetica, arial"><a name="EditDistance">class <strong>EditDistance</strong></a>(<a href="__builtin__.html#object">__builtin__.object</a>)</font></td></tr>
<tr bgcolor="#ffc8d8"><td rowspan=2><tt> </tt></td>
<td colspan=2><tt>Just a small class to calculate edit distance between two words, and find<br>
the top 5 corrections based on the edit distance and the number of times the<br>
word was used in a query.<br>
The source_word comes from query entered by the user, and the target_word<br>
is from the query corpus which is a dictionary consisting of all the<br>
query_words so far that have df != zero.<br> </tt></td></tr>
<tr><td> </td>
<td width="100%">Methods defined here:<br>
<dl><dt><a name="EditDistance-top_corrections"><strong>top_corrections</strong></a>(self, source_word)</dt><dd><tt>Checks edit distance of source_word from every word in query_corpus<br>
and returns the top 5 corrections. It retrieves the top 5 based on<br>
pure edit_distance and then sorts them again based on number of times<br>
the query_word was used prior<br>
:param source_word: (String) query_word entered by user which has zero<br>
document frequency.<br>
:return:</tt></dd></dl>
<hr>
Static methods defined here:<br>
<dl><dt><a name="EditDistance-edit_distance"><strong>edit_distance</strong></a>(source_word, target_word)</dt><dd><tt>A very basic algorithm to calculate levenshtein distance between two<br>
words.<br>
:param source_word : (String) The word entered by user in the query.<br>
This<br>
function is called only if the df of the source_word is zero.<br>
:param target_word: (String)The word whose edit distance from<br>
source_word is<br>
to be checked<br>
:return: edit_distance (integer) between source_word and target_word</tt></dd></dl>
<hr>
Data descriptors defined here:<br>
<dl><dt><strong>__dict__</strong></dt>
<dd><tt>dictionary for instance variables (if defined)</tt></dd>
</dl>
<dl><dt><strong>__weakref__</strong></dt>
<dd><tt>list of weak references to the object (if defined)</tt></dd>
</dl>
</td></tr></table></td></tr></table>
</body></html>