SoLiXG:Greece’s recovery and resilience plan: Difference between revisions

From titipi
Jump to navigation Jump to search
No edit summary
Line 1: Line 1:
=== Word frequencies ===


 
Word Frequency
== Word Frequency ==
=== Most frequent words ===
=== Most frequent words ===



Revision as of 17:18, 15 February 2023

Word Frequency

Most frequent words

Script

<syntaxhighlight lang="python"> import re

from wordfreq import word_frequency

  1. this is a script to find the most frequent words in a textfile

lines = open('gr-policy.txt', 'r') text=lines.read() text_list=text.replace('\n', ' ').split(".")

lines.close()

sep_words=[] new_list=[] all_freq={}

frequency={} with open("output.txt", "a") as f:    for l in text_list:        for w in l.split():            sep_words.append(w)    for word in sep_words:        freq = sep_words.count(word)        frequency={word:freq}        all_freq.update(frequency)                # all_freq.append(frequency)    new_list=sorted(all_freq.items(), key=lambda item: item[1], reverse=True )    print(*new_list, sep = "\n", file=f)

</syntaxhighlight>