Opened 12 years ago
Last modified 8 years ago
#82 assigned new-feature
cool comparison tools -- find similar
Reported by: | Fred T. Hamster | Owned by: | bugdock |
---|---|---|---|
Priority: | minor | Milestone: | |
Component: | feistymeow-nucleus | Version: | |
Keywords: | Cc: |
Description
- Examines a directory tree and builds an index of file name portions. *
- The files with similar names are reported. *
- text comparison tool for supporting nechung:
- it would be nice to have a tool that could check that there are no duplicate fortunes in nechung database.
- more generally, this should be able to do a fuzzy compare of quotes against each other and report ones that incorporate a large amount of another quote or which is too similar to another quote.
idea could encompass both...
==============
generalized text comparison giving a score based on:
words used in common.
similar word orders.
similar sizes for words.
sizes for words distributed similarly.
(long here in both, etc)
how many large chunks are the same.
must work on text as a single stream--no crs.
probably most important is how similar the word choices are.
basic idea...
comparator:
can take like a file system tree and spit out the names that are similar
or same within it. should operate on any list.
Note:
See TracTickets
for help on using tickets.