-
Notifications
You must be signed in to change notification settings - Fork 4.9k
/
WordFrequency.sh
38 lines (34 loc) · 1.18 KB
/
WordFrequency.sh
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
# Source : https://leetcode.com/problems/word-frequency/
# Author : Hao Chen
# Date : 2015-03-31
##################################################################################
#
# Write a bash script to calculate the frequency of each word in a text file words.txt.
#
# For simplicity sake, you may assume:
#
# words.txt contains only lowercase characters and space ' ' characters.
# Each word must consist of lowercase characters only.
# Words are separated by one or more whitespace characters.
#
# For example, assume that words.txt has the following content:
# the day is sunny the the
# the sunny is is
#
# Your script should output the following, sorted by descending frequency:
#
# the 4
# is 3
# sunny 2
# day 1
#
# Note:
# Don't worry about handling ties, it is guaranteed that each word's frequency count is unique.
#
# [show hint]
# Hint:
# Could you write it in one-line using Unix pipes?
##################################################################################
#!/bin/sh
# Read from the file words.txt and output the word frequency list to stdout.
cat words.txt | tr [:space:] "\n" | sed '/^$/d' | tr '[:upper:]' '[:lower:]'|sort|uniq -c|sort -nr | awk '{ print $2,$1}'