SoLiXG:Greece’s recovery and resilience plan: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
Word Frequency | == Word Frequency == | ||
=== Most frequent words === | === Most frequent words === | ||
Revision as of 17:18, 15 February 2023
Word Frequency
Most frequent words
Script
<syntaxhighlight lang="python"> import re
from wordfreq import word_frequency
- this is a script to find the most frequent words in a textfile
lines = open('gr-policy.txt', 'r') text=lines.read() text_list=text.replace('\n', ' ').split(".")
lines.close()
sep_words=[] new_list=[] all_freq={}
frequency={} with open("output.txt", "a") as f: for l in text_list: for w in l.split(): sep_words.append(w) for word in sep_words: freq = sep_words.count(word) frequency={word:freq} all_freq.update(frequency) # all_freq.append(frequency) new_list=sorted(all_freq.items(), key=lambda item: item[1], reverse=True ) print(*new_list, sep = "\n", file=f)
</syntaxhighlight>