Linux Find Large Files

by Vivek Gite on December 16, 2008 · 29 comments

Q. How do I find out all large files in a directory?

A. There is no single command that can be used to list all large files. But, with the help of find command and shell pipes, you can easily list all large files.

Linux List All Large Files

To finds all files over 50,000KB (50MB+) in size and display their names, along with size, use following syntax:

Syntax for RedHat / CentOS / Fedora Linux

find {/path/to/directory/} -type f -size +{size-in-kb}k -exec ls -lh {} \; | awk '{ print $9 ": " $5 }'
Search or find big files Linux (50MB) in current directory, enter:
$ find . -type f -size +50000k -exec ls -lh {} \; | awk '{ print $9 ": " $5 }'
Search in my /var/log directory:
# find /var/log -type f -size +100000k -exec ls -lh {} \; | awk '{ print $9 ": " $5 }'

Syntax for Debian / Ubuntu Linux

find {/path/to/directory} -type f -size +{file-size-in-kb}k -exec ls -lh {} \; | awk '{ print $8 ": " $5 }'
Search in current directory:
$ find . -type f -size +10000k -exec ls -lh {} \; | awk '{ print $8 ": " $5 }'
Sample output:

./.kde/share/apps/akregator/Archive/http___blogs.msdn.com_MainFeed.aspx?Type=AllBlogs.mk4: 91M
./out/out.tar.gz: 828M
./.cache/tracker/file-meta.db: 101M
./ubuntu-8.04-desktop-i386.iso: 700M
./vivek/out/mp3/Eric: 230M

Above commands will lists files that are are greater than 10,000 kilobytes in size. To list all files in your home directory tree less than 500 bytes in size, type:
$ find $HOME -size -500b
OR
$ find ~ -size -500b

To list all files on the system whose size is exactly 20 512-byte blocks, type:
# find / -size 20

Perl hack: To display large files

Jonathan has contributed following perl code print out stars and the length of the stars show the usage of each folder / file from smallest to largest on the box:

 du -k | sort -n | perl -ne 'if ( /^(\d+)\s+(.*$)/){$l=log($1+.1);$m=int($l/log(1024)); printf  ("%6.1f\t%s\t%25s  %s\n",($1/(2**(10*$m))),(("K","M","G","T","P")[$m]),"*"x (1.5*$l),$2);}'

ls command: finding the largest files in a directory

You can also use ls command:
$ ls -lS
$ ls -lS | less
$ ls -lS | head +10

ls command: finding the smallest files in a directory

Use ls command as follows:
$ ls -lSr
$ ls -lSr | less
$ ls -lSr | tail -10

You can also use du command as pointed out georges in the comments.

See more find command examples and usage here and here.

Featured Articles:

Share this with other sys admins!
Facebook it - Tweet it - Print it -

{ 29 comments… read them below or add one }

1 Scott Carlson December 16, 2008

I use a script with this…

find . -xdev -printf ‘%s %p\n’ |sort -nr|head -20

Reply

2 Vivek Gite December 16, 2008

Excellent scott!

Reply

3 georges December 16, 2008

What I use is much simpler and efficient I’m afraid:
du -xak .|sort -n|tail -50

it lists the 50 biggest files or directories sorted by size

Reply

4 Jonny August 16, 2011

Why are you afraid?

Reply

5 Warren September 23, 2011

Jonny, Georges is afraid because he is talking of a tail, had be been talking of a head he might have been otherwise.

Reply

6 Shatnanu Oak December 17, 2008

Poor man’s command.
ls -lhR | grep 'G '
Not perfect but let me know the big files more than 1 GB

Reply

7 Topper December 17, 2008

ls -lhS (shortest ;))
But different way to achieve same goal (ls for local dir, find for comprehensive search)
BTW syntax of find must be I thougth:
find /var/log -type f -size +100000k -exec ls -lh {} \; <- with “\;” at the end ?

Reply

8 Vivek Gite December 17, 2008

Topper,

Dam html… thanks for the heads up.

Reply

9 Jonathan Jiang December 17, 2008

I prefer this perl script feeding from a du -k :

du -k | sort -n | perl -ne 'if ( /^(\d+)\s+(.*$)/){$l=log($1+.1);$m=int($l/log(1024)); printf                 ("%6.1f\t%s\t%25s  %s\n",($1/(2**(10*$m))),(("K","M","G","T","P")[$m]),"*"x (1.5*$l),$2);}'

It’ll print out stars and the length of the stars show the usage of each folder / file from smallest to largest on the box. Enjoy!

Reply

10 Jonathan Jiang December 17, 2008
du -k | sort -n | perl -ne 'if ( /^(\d+)\s+(.*$)/){$l=log($1+.1);$m=int($l/log(1024)); printf                 ("%6.1f\t%s\t%25s  %s\n",($1/(2**(10*$m))),(("K","M","G","T","P")[$m]),"*"x (1.5*$l),$2);}'

Reply

11 Vivek Gite December 17, 2008

Jonathan,

Thanks for sharing your nifty perl code. The faq has been updated with your code.

Reply

12 Chris Giordano January 27, 2009

If using RedHat 6.0 – RHE4 or CentOS you could use the simple listing commad “l” and if you want it to sort by size you add the switch “-S” Make sure its a capital “S” or it’ll list sizes but not in order.

l -S
this will return everything in that directory from largest to smallest.

if you want to do listing in a directory and need to figure out the switch you could also do “l –help” this will bring up the help file for the listing command.

Reply

13 ronald kriwelz simanjuntak January 14, 2010

how bout using this :
find /var -size +10000k -print0 | xargs -0 ls -lSh

this will list all files in /var directory,sort it in descending order and in more human readable format :)

Reply

14 Scott Carlson January 14, 2010

@ronald

Your find doesn’t work across subdirectories. It’s definitely very clean for a leaf directory though.

Reply

15 ronald kriwelz simanjuntak January 15, 2010

wut do you mean by it doesnt work across subdirectories ? i tried it on my ubuntu box and it show files in the subdirectories.
-rw-rw—- 1 mysql mysql 412M Jan 15 10:18 /var/lib/mysql/darta/namefile.MYD
-rw-rw—- 1 mysql mysql 173M Jun 9 2009 /var/lib/mysql/flyingfight/dbacomment.MYD
-rw-rw—- 1 mysql mysql 165M Jan 15 10:40 /var/lib/mysql/interndba/post.MYI
-rw-rw—- 1 mysql mysql 159M Jan 15 10:40 /var/lib/mysql/interndba/post.MYD
-rw——- 1 root root 105M Jan 10 03:31 /var/log/messages.1

those files are in different subdirectories right?

Reply

16 Scott Carlson January 15, 2010

@ronald

Interesting. I dug a bit. My use case is find the largest files in a directory and not just those over 10M. So I had removed the size restriction, but the same problem occurs with a smaller size restriction. Even with “-size +100k” find was returning directories as well as files. This messed up the expected results as I previously saw.

So for me, this one works as expected.
find . -type f -print0 | xargs -0 ls -lSh | head -20

Thanks.

Reply

17 ronald kriwelz simanjuntak January 15, 2010

owh yes, i forgot to say that it will list all the files bigger than 10MB,since wut i ned is to list biggest files, and yeah ur addition to the command does the thing :)
or u can add “more” to the command
the power of command line, the beauty of linux :)

Reply

18 Keith White April 7, 2010

I find the following works rather well…

du -xak . | sort -n | awk '{size=$1/1024; path=""; for (i=2; i 50) { printf("%dMb %s\n", size,path); } }'

It lists all files or directories bigger than 50MB (just change size>50 to alter that) in the current directory (change the “.” to a directory path to specify another one) in a friendly, human-readable way and happily plays with spaces (I used it on an NTFS mount in fact).

Reply

19 Deepankar April 9, 2010

Try
cd
du -h | grep [0-9]G

This will list all files that are in GB.
Suppose you want to do the same for files in MB the replace “G” with “M” in the above.

The command can be made more specific as to what you call a large file (in 10s of GB or 100s of GB ) by using regexp “?” instead of “[0-9]“

Reply

20 Deepankar April 9, 2010

Try
cd (directory path)
du -h | grep [0-9]G

This will list all files that are in GB.
Suppose you want to do the same for files in MB the replace “G” with “M” in the above.

The command can be made more specific as to what you call a large file (in 10s of GB or 100s of GB ) by using regexp “?” instead of “[0-9]“

Reply

21 Michael April 15, 2010

Jonathan,

That Perl one-liner is a work of art. Thank you for sharing it with the world!

Reply

22 Lesle Boyd December 22, 2010

You guys are the greatest!
I sure enjoyed reading this thread and the information is extremely useful in my job.
Thanks to all who posted. My head is swimming!

Reply

23 Albert December 22, 2010

Hi everyone!!
i have a litle problem, i have this

find /home/dir -exec grep -H -w “op1″ {} \; | grep -w “op2″

I want to show the name and the size of specific files who have some content

ls -l (filename) | awk ‘{sum = sum + $5} END {print sum}’

i been trying put this together but no luck

Reply

24 gunjankapoor December 28, 2010

To finds files = 50,000KB (50MB+) in size and display their names, along with size.
(The size should be exact).
What will be the command?

Reply

25 Binu January 21, 2011

find -size +50M -printf “%s %p\n”

‘man find’ will tell you other printf options.

Reply

26 Pejman May 23, 2011

tnx to everyone. great sharing :)

here is the same command but has filter for just *.log files.
to find huge log files on linux:

find . -size +1000k -name *.log -print0 | xargs -0 ls –lSh

good luck.

Reply

27 skater July 1, 2011

My tips that put together some of the above

#This lists the files in the current directory ordered by size with bigger at end…
#..so you do not have to scroll up ;)
ls -alSr

#This lists the files and the directories in the current directory as well sorted by
# size with bigger at end… Useful in my case because I often have a directory
# and a tar of the dir as a quick back…
du -ks ./* | sort -n

bis
S

Reply

28 Erwin July 19, 2011

super awesome ;)

Reply

29 Tom scott November 16, 2011

How would i delete a directory that has gone above say 10GB?
-size shows differently for files and directories.

Reply

Leave a Comment

You can use these HTML tags and attributes for your code and commands: <strong> <em> <ol> <li> <u> <ul> <blockquote> <pre> <a href="" title="">
What is 4 + 6 ?
Please leave these two fields as-is:
IMPORTANT! To be able to proceed, you need to solve the simple math so we know that you are a human and not a script.




Previous post:

Next post: